[Enhancement] Reduce lock time of schema change in scenarios with a large number of columns (backport #52800) #52842

mergify · 2024-11-12T11:28:46Z

Why I'm doing:

This check compatible has a complexity of N^2. In a scenario with many columns (>10000), it will be very slow, causing the table lock to be held for a long time and the import will be stuck.

What I'm doing:

Using hash map to replace the link list.

For a table with 20,000 columns, the lock holding time can be optimized from 12s to 0.8s.

TODO: 0.8s is not reasonable. There is still some optimization points and continuous optimization is needed.

What type of PR is this:

Does this PR entail a change in behavior?

Yes, this PR will result in a change in behavior.
No, this PR will not result in a change in behavior.

If yes, please specify the type of change:

Interface/UI changes: syntax, type conversion, expression evaluation, display information
Parameter changes: default values, similar parameters but with different default values
Policy changes: use new policy to replace old one, functionality automatically enabled
Feature removed
Miscellaneous: upgrade & downgrade compatibility, etc.

Checklist:

I have added test cases for my bug fix or my new feature
This pr needs user documentation (for new or modified features or behaviors)
- I have added documentation for my new feature or new function
This is a backport pr

Bugfix cherry-pick branch check:

This is an automatic backport of pull request #52800 done by [Mergify](https://mergify.com). ## Why I'm doing:

This check compatible has a complexity of N^2. In a scenario with many columns (>10000), it will be very slow, causing the table lock to be held for a long time and the import will be stuck.

What I'm doing:

Using hash map to replace the link list.

For a table with 20,000 columns, the lock holding time can be optimized from 12s to 0.8s.

TODO: 0.8s is not reasonable. There is still some optimization points and continuous optimization is needed.

What type of PR is this:

Does this PR entail a change in behavior?

Yes, this PR will result in a change in behavior.
No, this PR will not result in a change in behavior.

If yes, please specify the type of change:

Interface/UI changes: syntax, type conversion, expression evaluation, display information
Parameter changes: default values, similar parameters but with different default values
Policy changes: use new policy to replace old one, functionality automatically enabled
Feature removed
Miscellaneous: upgrade & downgrade compatibility, etc.

Checklist:

I have added test cases for my bug fix or my new feature
This pr needs user documentation (for new or modified features or behaviors)
- I have added documentation for my new feature or new function
This is a backport pr

…arge number of columns (#52800) Signed-off-by: trueeyu <[email protected]> (cherry picked from commit f004276)

sonarcloud · 2024-11-12T11:38:27Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

[Enhancement] Reduce lock time of schema change in scenarios with a l…

949603e

…arge number of columns (#52800) Signed-off-by: trueeyu <[email protected]> (cherry picked from commit f004276)

mergify bot mentioned this pull request Nov 12, 2024

[Enhancement] Reduce lock time of schema change in scenarios with a large number of columns #52800

Merged

24 tasks

github-actions bot assigned trueeyu Nov 12, 2024

github-actions bot added the automerge label Nov 12, 2024

wanpengfei-git enabled auto-merge (squash) November 12, 2024 11:29

trueeyu approved these changes Nov 13, 2024

View reviewed changes

wanpengfei-git merged commit f2abc0e into branch-3.3 Nov 13, 2024
41 checks passed

wanpengfei-git deleted the mergify/bp/branch-3.3/pr-52800 branch November 13, 2024 02:13

github-actions bot added the version:3.3.6 label Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] Reduce lock time of schema change in scenarios with a large number of columns (backport #52800) #52842

[Enhancement] Reduce lock time of schema change in scenarios with a large number of columns (backport #52800) #52842

mergify bot commented Nov 12, 2024 •

edited by trueeyu

Loading

sonarcloud bot commented Nov 12, 2024

[Enhancement] Reduce lock time of schema change in scenarios with a large number of columns (backport #52800) #52842

[Enhancement] Reduce lock time of schema change in scenarios with a large number of columns (backport #52800) #52842

Conversation

mergify bot commented Nov 12, 2024 • edited by trueeyu Loading

Why I'm doing:

What I'm doing:

What type of PR is this:

Checklist:

Bugfix cherry-pick branch check:

What I'm doing:

What type of PR is this:

Checklist:

sonarcloud bot commented Nov 12, 2024

Quality Gate passed

mergify bot commented Nov 12, 2024 •

edited by trueeyu

Loading