You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Similarly as for the query attribute mention which can list different mention recognizers to be applied, we could introduce a structure attribute that will specified the (unique) underlying structure to be considered for processing a complete document (usually a PDF).
For instance default would mean no structure is considered (recognize and disambiguate entities in any tokens extracted from a PDF), scientific-article would mean to use GROBID model for scientific publication, e-book would mean to use GROBID model for structuring monograph document, patent-st36 a patent in XML ST-36 format, etc.
For each structure type - except the default one - some structured might be ignored because they are not textual or it does not make sense to apply a generic entity extraction, and some will be relevant. Ideally this could be specified in some config files.
The text was updated successfully, but these errors were encountered:
Similarly as for the query attribute
mention
which can list different mention recognizers to be applied, we could introduce astructure
attribute that will specified the (unique) underlying structure to be considered for processing a complete document (usually a PDF).For instance
default
would mean no structure is considered (recognize and disambiguate entities in any tokens extracted from a PDF),scientific-article
would mean to use GROBID model for scientific publication,e-book
would mean to use GROBID model for structuring monograph document,patent-st36
a patent in XML ST-36 format, etc.For each structure type - except the default one - some structured might be ignored because they are not textual or it does not make sense to apply a generic entity extraction, and some will be relevant. Ideally this could be specified in some config files.
The text was updated successfully, but these errors were encountered: