Fast full-text search or LIKE query on Parquet file on S3 #6545
collimarco
started this conversation in
General
Replies: 1 comment 15 replies
-
You could try enabling filter pushdown (late materialization) for parquet - https://docs.rs/datafusion/latest/datafusion/config/struct.ParquetOptions.html |
Beta Was this translation helpful? Give feedback.
15 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am trying to filter some logs that are stored in Parquet format on S3.
I use a query like this:
Is this the right approach to search some keywords in a text field? Or is there a better/faster function to use instead of SQL LIKE?
In particular I would like to optimize the speed of the query and reduce the data downloaded (e.g. leveraging the condition pushdown).
Beta Was this translation helpful? Give feedback.
All reactions