-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH (string dtype): convert string_view columns to future string dtype instead of object dtype in Parquet/Feather IO #60235
ENH (string dtype): convert string_view columns to future string dtype instead of object dtype in Parquet/Feather IO #60235
Conversation
…e instead of object dtype in Parquet IO
Thanks @jorisvandenbossche |
Owee, I'm MrMeeseeks, Look at me. There seem to be a conflict, please backport manually. Here are approximate instructions:
And apply the correct labels and milestones. Congratulations — you did some good work! Hopefully your backport PR will be tested by the continuous integration and merged soon! Remember to remove the If these instructions are inaccurate, feel free to suggest an improvement. |
…e instead of object dtype in Parquet/Feather IO (pandas-dev#60235) (cherry picked from commit f307a0a)
Manual backport -> #60291 |
This is a follow-up on #60222, which allows passing string_view data to the string dtype constructor, but in this PR ensuring we also use this capability when reading Parquet (or Feather, ORC) files that might use that type.
PyArrow does not yet support writing string_view to Parquet, so we can't test it yet with Parquet, only with Feather.