You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In ShfufleWriterExec, we copy data from each input RecordBatch into array builders for each partition and once a partition reaches the specified row count, we create an output batch.
Arrow provides an interleave_record_batch kernel that we could potentially use instead of the builder approach.
What is the problem the feature request solves?
In ShfufleWriterExec, we copy data from each input RecordBatch into array builders for each partition and once a partition reaches the specified row count, we create an output batch.
Arrow provides an
interleave_record_batch
kernel that we could potentially use instead of the builder approach.https://docs.rs/arrow/latest/arrow/compute/fn.interleave_record_batch.html
Describe the potential solution
Let's POC this and see if it helps with performance.
Additional context
No response
The text was updated successfully, but these errors were encountered: