Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate metadata extraction from papers using GPT/ScholarAI #35

Open
Bankso opened this issue Jan 24, 2024 · 0 comments
Open

Integrate metadata extraction from papers using GPT/ScholarAI #35

Bankso opened this issue Jan 24, 2024 · 0 comments

Comments

@Bankso
Copy link
Contributor

Bankso commented Jan 24, 2024

Per this pubpub: https://sagebionetworks.pubpub.org/pub/vh1xcgd9/release/6

Discussed with Jineta on 1.24.24 - we can use the framework described in the article linked above to request metadata extraction from papers using GPT-4.5/ScholarAI.

Proposed input: the article text, a prompt requesting metadata extraction, and a metadata template (could be JSON, CSV, etc.)
Output: metadata template, populated with information extracted by the model

The article notes that scalability was not feasible at the time it was published (Nov 2023) so it will be important to consider how we can consistently and reproducibly implement this process for MC2 resource curation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant