-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OMERO table space in column name #57
Comments
The issue with spaces in column names has been mentioned several times. As far as I understand, the investigation seemed to indication the limitation comes from PyTables i.e. the underlying storage mechanism for OMERO.tables. Trying to find a few pointers, from the source code, do we know if the querying issues is related to the https://github.com/PyTables/PyTables/blob/0eed850b9031fb540edd2c1ff5c81b91efeba9d6/tables/path.py#L21 If this is the underlying problem, other characters commonly used in column headers like /cc @will-moore |
An option could be to also add the CSV alongside the table. |
I can definitely see having the CSV attached as a workaround, but to some extent, it's saying that the tables services does not suffice. |
The CSV is a workaround but can be a valid option depending on the language used to access the data e.g. R due to the data manipulation java <-> R. |
ome/omero-py#287 starts exploring solutions for searching tables using columns with space in names. The underlying problem is that you cannot write a valid PyTables condition e.g. Currently blocked on passing this |
The |
The corresponding change has been merged upstream in OMERO.py - https://github.com/ome/openmicroscopy/pull/6283/files brings a proof of concept of how to write a query against a column with space in its name. I have not retested in the IDR context but I assume this issue can either can be closed (as we decided it was not an issue specific to the metadata plugin) and/or moved as a documentation issue? |
Tables in IDR have spaces in most of the columns' name. This implies that it is not possible to retrieve specifying the value in a given column e.g. give me the row with Remdesivir in the
Compound Name
column.To filter one needs to load the full table (~15mins loading time) to retrieve few relevant rows, in the remdesivir example, 24/9792 rows are relevant.
The text was updated successfully, but these errors were encountered: