Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Result File Dataclasses: IPA 7.X Updates #209

Merged
merged 16 commits into from
Feb 11, 2025
Merged

Conversation

mawelborn
Copy link
Contributor

@mawelborn mawelborn commented Feb 10, 2025

This PR adds support for new IPA 7.X models, including a new dataclass for summarization model predictions:

  • Supports GenAI Classification models using the existing Classification dataclass.
    • Identifiable by classification.model.type == ModelGroupType.GENAI_CLASSIFICATION if needed.
  • Supports GenAI Document Extraction models using the existing DocumentExtraction dataclass.
    • Identifiable by extraction.model.type == ModelGroupType.GENAI_EXTRACTION if needed.
  • Supports GenAI Summarization models using the new Summarization dataclass.
    • Summarization is a subclass of Extraction with confidence and text attrs.
    • PredictionList has been updated with a new predictions.summarizations property.
  • Supports partial failure of multifile submissions via new Document.failed, Document.error and Document.traceback attrs.
  • Factored out repeated span and bounding box complexity using composition.
  • Cleaned up some technical debt.

@mawelborn mawelborn changed the title IPA 7.X Updates Result File Dataclasses: IPA 7.X Updates Feb 10, 2025
@mawelborn mawelborn merged commit a87da3d into main Feb 11, 2025
9 checks passed
@mawelborn mawelborn deleted the mawelborn/ipa-7-dataclasses branch February 11, 2025 14:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant