-
-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Migration ARFF to Parquet on the OpenML server #50
Comments
A couple of remarks:
|
Also it would be great if the types |
Also it seems that the parquet urls from the test server are wrong. Edit: more info |
Do you have an example? We want to see if this issue was with the ARFF file or specifically introduced in the conversion. |
Parquet URLS from the test server have been disabled for now, until we have a separate minio (or bucket) for the test server. |
We'll look into that, and for the conversion scripts we'll have a closer look to preserve the feature data, or encode it into correct data types where ARFF was previously not expressive enough (e.g., boolean, 8-bit integers). |
This was a confusion from my side, sorry! |
The following will be changed for the conversion script:
Additionally feature meta-data needs to be updated:
|
This is a centralised discussion about the server side changes (being) made to the datasets in their conversion from ARFF to Parquet. Related on-going discussions that reference the server state of different datasets:
Let's keep the relevant information about the migration as it relates to server data in this thread.
This is not for connector specific discussions (for example, how
openml-python
handles this).@joaquinvanschoren @prabhant @sebffischer
The text was updated successfully, but these errors were encountered: