You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey @Sunnycheey, we decided that the quality of the tables was too low for practical usage & we decided not to include it as part of the release. We've been since working on how to improve table extraction so we that we might include it in future S2ORC releases. If you're looking for a S2ORC-like dataset that includes higher quality tables, you can check out https://github.com/allenai/cord19 in which we used IBM Research's table parsing software.
I want to know why you remove the table content while processing since the table content is structured and important in many situtation.
The text was updated successfully, but these errors were encountered: