Skip to content

How to change the row_count of RecordBatch returned from reading a parquet file #5250

Closed Answered by alamb
twitu asked this question in Q&A
Discussion options

You must be logged in to vote

I think the batch_size https://docs.rs/datafusion/17.0.0/datafusion/config/struct.ExecutionOptions.html#structfield.batch_size is the correct setting. I don't think the size is guaranteed. The 1000 rows is likely coming from the underlying reader (how are you registering alltypes_plain?)

Replies: 1 comment 6 replies

Comment options

You must be logged in to vote
6 replies
@alamb
Comment options

alamb Feb 12, 2023
Collaborator

@twitu
Comment options

@alamb
Comment options

alamb Feb 14, 2023
Collaborator

@tustvold
Comment options

@twitu
Comment options

Answer selected by twitu
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants