@mccheah all of those have merged I think.

For the 2.3 code-freeze, this seems to me like a reasonable (maybe slightly ambitious) target -
I expect we can speed up after the first PR.

PR 1: Basic scheduler backend PR (Spark on Kubernetes - basic scheduler backend #498, [SPARK-18278] [Scheduler] Spark on Kubernetes - Basic Scheduler Backend apache/spark#19468)
- Unit testing for executorpodfactory (Add unit-testing for executorpodfactory #491)
PR 2: Basic submission client PR - covers spark-submit changes and Client.scala + the minimum set of submission steps and orchestrator ([SPARK-22646] [Submission] Spark on Kubernetes - basic submission client apache/spark#19717, Spark on Kubernetes - basic submission client #545)
- Improve Unit testing for SparkSubmit changes
- Unit tests for Client.scala
- Reference Dockerfiles for driver and executor.
PR 2.5: Docker images
- ASF procedure for pre-built images
PR 3: Init-container setup + make-distribution setup
- Improve Unit testing (TBD)
PR 4: Dynamic allocation PR
- Improve Unit testing (TBD)
PR 6: Documentation
PR ? Parallel effort on Integration Tests
- Discovery - RiseLAB CI
- Will the ASF allow us to run on external GKE/GCE cluster?

Thoughts? If we all agree, I'll post this on the JIRA.

Upstreaming and pull request strategy for Spark on Kubernetes #441

Description