-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Apache ORC File Format, and Use Sparse Index #4707
Comments
Is this feature planned? @alamb @andygrove |
Hi @hrh007 -- I do not know of any plans to support ORC at this time. I think we would welcome a contribution if you would like to do so. |
I will contribute to this issue, but it may take a lot time; because there is no official ORC implementation of the Rust language |
That does sound like an important dependency to implement first 🤔 |
There is https://github.com/DataEngineeringLabs/orc-format but it builds on arrow2 rather than arrow-rs |
Hi @hrh007 just wondering if there's been progress on this? If not then I'm interested in picking this up |
I have not made any progress yet; Glad you can participate, Thanks for your contribution! |
Great job, thanks for your efforts !!! |
BTW I think we need some more help to get ORC implemented: apache/arrow-rs#4980 (comment) |
Following discussion in apache/arrow-rs#4980 We will focus on implementing ORC file format support for Arrow in https://github.com/datafusion-contrib/datafusion-orc first Which when ready could be used by DataFusion to query Eventually we hope https://github.com/datafusion-contrib/datafusion-orc could be merged into arrow-rs which DataFusion could then use directly |
Drafted a short-term roadmap for datafusion-orc |
The text was updated successfully, but these errors were encountered: