-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
generating row number while mapping from CSV source #5
Comments
Hm, my concern would be in this case how persistent your URIs would be.. Row numbers is a work around but are you sure they would solve your problem without generating another one? What if a new row is inserted? Wouldn't that affect your whole dataset? I'm against using row numbers as identifiers and in particular in the case of CSV files, but of course this is also a personal opinion. Why not blank nodes? In this case it's the same as row-based generated URIs without causing the expectation that they are persistent. That's at least what I'm doing in these cases and you can do that in RML by defining the subject map's term type to blank node.
|
Hi thank you for the reply and the suggestion! :-) I'm against the row number as well, but in this particular case it was a Anyway: this make me integrate a bit one of the previous question: is there 2016-10-13 13:25 GMT+02:00 andimou [email protected]:
|
Hi a good default option seems to be having the chance to adopt well-known uris for replacing blank nodes automatically. as suggested in https://www.w3.org/TR/rdf11-concepts/#rdf-documents (section "3.5 Replacing Blank Nodes with IRIs") However the possible adoption of user-mad functions to produce ID/URI/IRI suing custom criteria would be very useful in some different practical scenarios. |
Hi
I need to implement a prototype mapping on a CSV source which does not have a cell containing an identifier which I can use for constructing uri. A possibile workaround for my case is to generate a simple id column (with its values) during an ETL pre-processing phase, but I wonder if there can be more proper solution inside the specification.
For example looking at CSVW it's possible to describe data introducing a
_row
map designed for extracting row number while processing. Is it possible to use this approach with RML over CSV? If so can someone provide me a simple example?Otherwise is it possible to provide those functionalities as an extension in Java? What are the interfaces involved?
thank you in advance for any directions or suggestions
The text was updated successfully, but these errors were encountered: