Support for serving disk persisted RDDs to enable efficient Spark DRA #2778

pinireisman · 2024-10-02T07:54:18Z

pinireisman
Oct 2, 2024

When using in Spark the ExternalShuffleService - it is able to serve RDDs that have been persisted to disk from the shuffle service (as is described in SPARK-27677 ) thus allowing the executor that was responsible for that persisted data (rdd partitions) to be removed by the Dynamic Resource Allocation mechanism without losing access to the persisted RDD partitions.

Can Celeborn be used to do the same thing? It would be very beneficial if these RDD partitions can be served by Celeborn similarly as is done by the External Shuffle Service.

Does such a feature exist in Celeborn? Is anyone working on such a features?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for serving disk persisted RDDs to enable efficient Spark DRA #2778

{{title}}

Replies: 0 comments

Select a reply

Support for serving disk persisted RDDs to enable efficient Spark DRA #2778

pinireisman Oct 2, 2024

Replies: 0 comments

pinireisman
Oct 2, 2024