-
-
Notifications
You must be signed in to change notification settings - Fork 4
Investigate and potentially add support for spark connect #284
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
We have started exploring Spark Connect at GBIF.org. Our primary use case is to explore having a long running spark cluster hold an in memory cached table, for apps to do filtered data egress with minimal startup cost. |
@razvan - thank you for your work. When you are ready, we will be interested in helping to test. |
@timrobertson100 - we merged preliminary support for spark connect deployments in the main branch. Looking forward for your feedback! |
Spark Connect
Spark 3.5 introduces a new client called Spark Connect.
The use case seems to be thin clients that connect to a running spark driver.
This probably means that the operator needs to be able to start spark connect servers without spark applications and publish a service for "connect" clients.
Roadmap
Rough roadmap to GA:
integrate with the history serverSee: doc: comment on spark history integration #559Related PRs
The text was updated successfully, but these errors were encountered: