Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[core] support custom commitUser #3474

Merged

Conversation

wwj6591812
Copy link
Contributor

@wwj6591812 wwj6591812 commented Jun 4, 2024

Purpose

In our business scenario, the full data are written to the paimon table through bound stream job. Using of bound stream job instead of batch job is to enable downstream build index faster and further reduce the inc data of catchup. When bound stream job failover, the random commitUser will recover from Flink state, this is no problem.

image

But In our business scenario, we use Apache Airflow schedule Flink jobs. A Airflow node will fail due to lots of reasons. Then user clear the airflow node, a new Flink bound stream job will start with no state, so a new CommitUser will be generated. This will affect downstream judgment on whether to use a new build index processor.

So I submit this pr, we can assign a fixed commitUser to an Airflow node.

image

Tests

Add ut

API and Format

Documentation

@wwj6591812 wwj6591812 force-pushed the support_custom_commitUser_0603 branch from e4b11cf to 615d2a6 Compare June 4, 2024 15:39
Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a dangerous option, please write clearly its purpose.

@JingsongLi
Copy link
Contributor

It is not so simple... The commitIdentifier should be incremental too.

@wwj6591812 wwj6591812 force-pushed the support_custom_commitUser_0603 branch from 615d2a6 to 4bd1e3c Compare June 6, 2024 01:20
@wwj6591812
Copy link
Contributor Author

It is not so simple... The commitIdentifier should be incremental too.

Yes,I change the config from customCommitUser to customCommitUserPrefix.
pls cc,thx

@wwj6591812 wwj6591812 force-pushed the support_custom_commitUser_0603 branch from 4bd1e3c to c253c4e Compare June 6, 2024 07:50
Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants