Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR is long but it does not affect the core functionality of DataFusion for Ray, and does not differ from
0.1.0rc1
which has been extensively used by me in benchmarking fromtest.pypi
.This PR, when merged should be tagged
0.1.0rc2
and I can push the artifacts created by CI totest.pypi
. Upon successful tests, these same artifacts can be used for the actual0.1.0
release.Also, fixes #85
This PR squashed merged the work that represented building out a repeatable k8s benchmarking set up. It is far from perfect but it has enabled significant benchmarking that has resulted in intuition around execution, latency, and price trade offs that I think will help us communicate about the project. The scripts and instructions are written to start from a fresh ubuntu 24.04 ami on amazon, and will work on
arm
orx86_64
architectures.Summary of changes
worker
and replace withprocessor
where appropriate to indicateDFRayProcessors
doc/benchmarks