Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Spark Load #220

Open
3 tasks done
gnehil opened this issue Jul 29, 2024 · 0 comments · May be fixed by #214
Open
3 tasks done

[Feature] Spark Load #220

gnehil opened this issue Jul 29, 2024 · 0 comments · May be fixed by #214

Comments

@gnehil
Copy link
Contributor

gnehil commented Jul 29, 2024

Search before asking

  • I had searched in the issues and found no similar issues.

Description

Backgroud

Currently, Spark Load is integrated into the Doris core, which brings the following problems:

  1. Due to the reliance on Spark-related dependencies, the security issues of Spark itself will cause security issues in Doris
  2. When modifying the content of Spark ETL, it also depends on the version release of Doris, which is not conducive to rapid expansion and problem repair
  3. The reliance on Hadoop ecosystem also increases the complexity of the Doris system
    Therefore, separating the task submission, management, and Spark DPP processing of Spark Load from the Doris core will help reduce the complexity of the Doris core and also help unify Doris's Spark ecosystem tools.

Use case

No response

Related issues

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@gnehil gnehil linked a pull request Jul 29, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant