Prediction List Improvements for Result File Dataclasses #206
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Expanding the PredictionList API to simplify some common patterns seen in auto review and custom output.
Group by Set of Linked Labels / Pages
Predictions can now be grouped by mutable collections, including a document extractions's set of linked label groups and an unbundling's list of pages. Internally, mutable collections are converted to their immutable variant before being used as a dictionary key.
Before:
After:
Group by Individual Linked Labels / Pages
The new
.groupbyiter()
method groups each prediction with every key in an iterable individually. This is particularly useful for a document extraction's set of linked label groups. While it's sometimes desirable to group by the entire set as.groupby()
does, it's more often desirable to group by each linked label group individually.Before:
After:
The
.groupby()
and.groupbyiter()
unit tests are good examples of the difference in behavior:"Where Attr In" for Documents, Models, and Reviews
The
.where()
method has newdocument_in
,model_in
, andreview_in
keyword args to complement the existing single-value variants.Before:
After: