-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[opt](paimon)Optimize the storage location of the serialized paimon table #44274
Conversation
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
TeamCity be ut coverage result: |
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
TeamCity be ut coverage result: |
4ebaebd
to
4bcf742
Compare
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
TeamCity be ut coverage result: |
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
TeamCity be ut coverage result: |
run p0 |
run cloud_p0 |
run external |
@@ -152,6 +158,9 @@ private void setPaimonParams(TFileRangeDesc rangeDesc, PaimonSplit paimonSplit) | |||
if (split != null) { | |||
// use jni reader | |||
fileDesc.setPaimonSplit(encodeObjectToString(split)); | |||
if (!serializedTable.isPresent()) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suggest to init this serializedTable
in doInitialize()
.
56e3cc5
to
ed34249
Compare
run buildall |
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
1 similar comment
clang-tidy review says "All clean, LGTM! 👍" |
TeamCity be ut coverage result: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR approved by at least one committer and no changes requested. |
PR approved by anyone and no changes requested. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
…able (apache#44274) Related PR: apache#43167 Previously, the serialized paimon table was placed in `TPaimonFileDesc`, which resulted in each split storing a serialized paimon table. It will wasted a lot of space if a table has lots of split. Now change the storage location of the serialized paimon table so that all splits of a table can share the data.
…able (apache#44274) Related PR: apache#43167 Previously, the serialized paimon table was placed in `TPaimonFileDesc`, which resulted in each split storing a serialized paimon table. It will wasted a lot of space if a table has lots of split. Now change the storage location of the serialized paimon table so that all splits of a table can share the data.
### What problem does this PR solve? Related PR: #44274 Problem Summary: We have already set the serialization table in `TFileScanRangeParams`, so there is no need to repeat it for each split.
### What problem does this PR solve? Related PR: #44274 Problem Summary: We have already set the serialization table in `TFileScanRangeParams`, so there is no need to repeat it for each split.
### What problem does this PR solve? Related PR: apache#44274 Problem Summary: We have already set the serialization table in `TFileScanRangeParams`, so there is no need to repeat it for each split.
What problem does this PR solve?
Related PR: #43167
Previously, the serialized paimon table was placed in
TPaimonFileDesc
, which resulted in each split storing a serialized paimon table. It will wasted a lot of space if a table has lots of split.Now change the storage location of the serialized paimon table so that all splits of a table can share the data.
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)