Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Limited number of arguments for queries #30

Open
153957 opened this issue Aug 10, 2014 · 10 comments
Open

Limited number of arguments for queries #30

153957 opened this issue Aug 10, 2014 · 10 comments

Comments

@153957
Copy link
Member

153957 commented Aug 10, 2014

Due to some (arbitrary?) limit in NumPy (numpy/numpy#4398) there is a limit to the number of inputs in a query in PyTables (PyTables/PyTables#286).

This mostly affects the coincidence_queries module where queries may include many station columns.
For instance, if looking for coincidences that include all stations in the Amsterdam cluster, more than 32 columns will be queried. This will fail.

Example using all station columns:

>>> coln = data.root.coincidences.coincidences.colnames
>>> c = data.root.coincidences.coincidences.read_where(' & '.join(coln[11:]))
........
ValueError: too many inputs

We could work around this by evaluating the query in python instead of in-kernel.
However that would slow execution..
Perhaps we can work around this limit or see if we can also vote for an increase in the limit.

@153957
Copy link
Member Author

153957 commented May 11, 2015

Some more discussion at numpy/numpy#4840

@153957
Copy link
Member Author

153957 commented Apr 18, 2016

See this as well numpy/numpy#4398

@tomkooij
Copy link
Member

Won't fix.

@153957
Copy link
Member Author

153957 commented Sep 12, 2016

Why not?

@153957 153957 reopened this Sep 12, 2016
@tomkooij
Copy link
Member

@153957: We cannot change the numpy query limit and a work-around (in kernel query for large querys) is too much hassle.

I'm happy with leaving this open, but I was trying to close issues that seem too much hassle to fix. (Too low return on investment)

@153957
Copy link
Member Author

153957 commented Sep 12, 2016

It would actually be nice if this is fixed, currently simply selecting 'Amsterdam' as cluster in the Coincidence download form returns the 'to many stations' error:
http://data.hisparc.nl/data/network/coincidences/?end=2016-09-11+00%3A01&start=2016-09-11&stations=None&n=2&cluster=Amsterdam&download=False

Is it to much trouble to convince NumPy to change it on their side? I know various attempts have been made, with little progress..

@tomkooij
Copy link
Member

Ok, that make sense. Let's at least keep this open and hope numpy fixes this. (Or do the workaround thing...).

@153957
Copy link
Member Author

153957 commented Sep 12, 2016

I think this should at least be kept open as long as the numpy issue above is still open.

@153957
Copy link
Member Author

153957 commented Jun 24, 2024

The value was increased in NumPy 2, you can even manually set it higher it seems.
https://numpy.org/devdocs/release/2.0.0-notes.html#larger-npy-maxdims-and-npy-maxargs-npy-ravel-axis-introduced

However, PyTables does not fully support NumPy 2 yet PyTables/PyTables#1172

@153957
Copy link
Member Author

153957 commented Oct 11, 2024

Awaiting the next release of PyTables which includes this fix; PyTables/PyTables#1208

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants