Values being shifted around in pl.write_parquet
or pl.read_parquet
between rows (VERY BAD!)
#16109
Closed
2 tasks done
Labels
A-io-parquet
Area: reading/writing Parquet files
accepted
Ready for implementation
bug
Something isn't working
P-critical
Priority: critical
python
Related to Python Polars
regression
Issue introduced by a new release
Checks
Reproducible example
Log output
No response
Issue description
A basic dataframe, written to a parquet, and then read back, should equal the original dataframe (especially when sorted).
This is currently not the case.
Any data lake using parquet files relying on this library is broken if they upgraded. This needs to be added as a unit test.
This broke when upgrading to v0.20.24 from v0.20.23.
Expected behavior
A basic dataframe, written to a parquet, and then read back, should equal the original dataframe (especially when sorted).
Installed versions
The text was updated successfully, but these errors were encountered: