-
Notifications
You must be signed in to change notification settings - Fork 204
with datafusion comet,no performance improvement. #1084
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @shaileneF Are you testing with the 0.3.0 release or the latest from the main branch? I am going to be running benchmarks today and tomorrow in preparation for the 0.4.0 release so will share my results with you. |
Yes,0.3.0,I download the release jar from https://datafusion.apache.org/comet/user-guide/installation.html. |
One more question @shaileneF ... is your data set partitioned by date? |
|
Here is the dataset generation shell. |
We likely need to resolve #1123 to get better performance results |
This issue has no updates in a long time so I will close. @shaileneF you may want to try again with Comet 0.7.0 since there have been significant improvements in performance since 0.3.0 |
env:
data:TPCDS 100G/1T
with datafusion comet, spark-submit shell:
without datafusion comet, spark-submit shell:
description :
The text was updated successfully, but these errors were encountered: