Integrate NLP Modality into EVA #739

kurbanrita · 2024-12-24T11:06:21Z

This is an epic ticket where I will document the integration of the NLP modality to eva.

High-Level Tasks:

Literature review of relevant datasets that can be used as eva tasks.
Adding a custom text dataset to a new language folder.
Define relevant metrics for text classification, free-form text generation, etc.
End-to-end pipeline set up, including relevant modificaitons to the trainer, configurations, etc.

kurbanrita · 2024-12-24T11:15:53Z

An initial literature review is available here: https://kaiko-ai.atlassian.net/l/cp/Xg1JNjZJ.
Its primary objective was picking the initial task but I will continue adding more datasets as I come across them.

I have preliminarily chosen to use PubMedQA as the first text task for eva. First, its classification task with yes/no/maybe questions maps naturally to eva’s classification task, ensuring seamless integration. PubMedQA provides over 1,000 high-quality, human-annotated questions, offering reliable and accurate benchmarks. Being fully open-source, it allows easy access and use without licensing issues. The QA format aligns perfectly with our internal focus on question–answer tasks, making it highly relevant to our needs. Additionally, PubMedQA is not too easy, providing a meaningful challenge that effectively differentiates between simple models and advanced LLMs. Finally, as a widely used and trusted dataset in the medical NLP community, PubMedQA ensures that our evaluations are based on a credible and respected benchmark. These factors make PubMedQA an ideal initial choice for expanding eva’s capabilities to include text-based evaluations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate NLP Modality into EVA #739

Integrate NLP Modality into EVA #739

kurbanrita commented Dec 24, 2024 •

edited

Loading

kurbanrita commented Dec 24, 2024 •

edited

Loading

Integrate NLP Modality into EVA #739

Integrate NLP Modality into EVA #739

Comments

kurbanrita commented Dec 24, 2024 • edited Loading

kurbanrita commented Dec 24, 2024 • edited Loading

kurbanrita commented Dec 24, 2024 •

edited

Loading

kurbanrita commented Dec 24, 2024 •

edited

Loading