Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: Update DataFusion introduction to clarify that DataFusion does provide an "out of the box" query engine #12666

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

andygrove
Copy link
Member

Which issue does this PR close?

N/A

Rationale for this change

I have recently seen confusion online about whether DataFusion is an end user tool or library for building new systems.

When people arrive at this repository and read the README, the second sentence states that DataFusion's target users are developers building fast and feature rich database and analytic systems, customized to particular workloads..

I think that most people would move on at this point if they are just looking for a query engine to use, which is a shame because we do have DataFusion subprojects specifically targetted at users.

What changes are included in this PR?

Add a section linking to packaged versions of DataFusion targeting end users.

Are these changes tested?

Are there any user-facing changes?

@github-actions github-actions bot added the documentation Improvements or additions to documentation label Sep 28, 2024
Comment on lines +50 to +55
- [DataFusion Python](https://github.com/apache/datafusion-python/) offers a Python interface for SQL and DataFrame
queries.
- [DataFusion Ray](https://github.com/apache/datafusion-ray/) provides a distributed version of DataFusion that scales
out on Ray clusters.
- [DataFusion Comet](https://github.com/apache/datafusion-comet/) is an accelerator for Apache Spark based on
DataFusion.
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that I intentionally did not include Ballista here since it is not very active. If the project does become active again then we could add it to this list.

@timsaucer
Copy link
Contributor

In addition to the readme, should we update the site documentation?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants