-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add something like parent_id
to TSV specification
#14
Comments
parent_id
to TV specificationparent_id
to TSV specification
Yes it does make sense, thanks. At the moment I'm working on a few modifications to this TSV spec based on other feedback, and will add this to that list. Should be up for comment within a few days. |
I've labeled the existing TSV spec as v0.1 and created a draft v0.2 and modified the examples. Labeled these "for comment" -- before coding the parsing in WHG, I'd want to hear comments, corrections, etc. Many thanks for weighing in |
Just a request for clarification: Now the spec for parent_id states:
How would I describe that an entity in the same file is the parent, which would have an id, but not necessarily a web-published record (yet)? Would something like |
Ah, good point. When dataset files are uploaded to WHG, records are assigned a placeid in our system that will remain constant through any future updating. So they are effectively published and web-accessible. If parents are uploaded separately and first, then their URIs can be used in files that follow, but it is unreasonable to expect that workflow. So would it work to allow (and parse) values like "#2345" for parent_id? On import, rows having a "#" in that position would be processed last, after placeids had been assigned to the previous. |
Yes, that sounds reasonable. In practice, I assume processing might become a bit more complex when more than two levels of hierarchy are included. But I guess this could be solved. |
Yes, solvable. Probably simplest as a database operation after all rows have been inserted. Settling this spec now means modifying a bunch of code and sample datasets, so I need to make upgrades to the spec as seldom as possible. Thanks again. |
Currently, the TSV specification allows only for a free-text entry for
parent
. This makes it impossible to link to parent entries that are part of the same dataset. Would it make sense to add a column likeparent_id
which allows to specify anid
corresponding to the parent entry?The text was updated successfully, but these errors were encountered: