First version of bulk export capability #3446
Labels
enhancement
New feature or request
parent-issue
An issue that is or should be split into multiple sub-issues
Background
This is split off from #1393. This issue is to specify what's needed for a minimum viable version of bulk export.
Description
We want a user to be able to submit a request for an entire Sleeper table to be written out to Parquet files. There should be one output file per leaf partition. This file contains all data for that leaf partition in sorted order.
Analysis
There will be need to be sub issues for the different components of this. The following list describes some of the things that will need to be done:
There will be other future improvements to this capability, such as the ability to specify additional filters to restrict the data that is returned, and execution of the output using DataFusion. But those will be added once the basic functionality exists.
Sub tasks
The text was updated successfully, but these errors were encountered: