Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Improvement]: Optimize the upsert mode in the stream ingestion scenario and reduce redundant deleted records in data files #964

Open
2 of 3 tasks
YesOrNo828 opened this issue Dec 26, 2022 · 2 comments

Comments

@YesOrNo828
Copy link
Contributor

Search before asking

  • I have searched in the issues and found no similar issues.

What would you like to be improved?

Arctic already supports an upsert table in the stream pipeline, Flink writer would write a delete record into delete files before writing each inserted record into insert files. This causes many redundant deleted records in the deleted data files, slowing down the OLAP query.

How should we improve?

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Subtasks

No response

Code of Conduct

@majin1102
Copy link
Contributor

can you explain your idea of improvement?

Copy link

This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants