Performance regression of n_unique
in group_by/agg
context since 1.3.0.
#18661
Labels
bug
Something isn't working
needs triage
Awaiting prioritization by a maintainer
performance
Performance issues or improvements
python
Related to Python Polars
Checks
Reproducible example
Log output
Issue description
Since polars>=1.3.0, there appears to be a significant performance regression when using
n_unique
ingroup_by/agg
context. The bottleneck inn_unique
causes thegroup_by/agg
context to take mins or even hours to finish in my probem setting, while it takes only seconds in polars<=1.2.1.Expected behavior
Performance should be roughly the same as polars<=1.2.1.
Installed versions
The text was updated successfully, but these errors were encountered: