Skip to content

Extract disciplines from papers #20

@andreww

Description

@andreww

For some planned use-cases (e.g. #19) we need to be able to determine the research area (discipline) for each paper we process. Hopefully this is exposed in the data we gather from EuroPMC - in which case this can be passed into the URL processing part of the code so we can tag each processed URL with "used by research area" or similar. If not, we probably need some other way of gathering this information. Can we use the doi itself to say anything (e.g. by resolving the journal and going from there)?

One potentially important question is should we insist on each paper belonging to a single discipline, or do we need to allow each paper to belong to multiple disciplines? If we allow multiple disciplines how should we represent this in the "output" data?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions