[Enhancement] Optimize iceberg mor performance of iceberg equality delete #51050

stephen-shelby · 2024-09-14T10:07:35Z

Why I'm doing:

The current implementation of Iceberg reading eq-delete file for mor is to do a local left anti join in each scanner thread in units of scan range.
There are three problems with this:

A large data file such as 1G may have multiple splits, each of which repeatedly reads the same delete file.
A delete file may match different data files, causing the same delete file to be read multiple times.
Even if data cache is used for delete files, the parsed delete file is used as the right table to build a hash table. In extreme cases, memory usage will be very large. If there are many delete files for users, the query often OOM. Many cases can only run normally under one concurrency.

What I'm doing:

This patch implements iceberg equality deletes as a join, rather than reading the data file as the left table and the delete file as the right table for local left anti join. This optimization replaces the previous solution of using local hash joiner in each scanner thread. Compared to the previous solution, the main purpose is to reduce the overhead of repeatedly reading delete files and repeatedly building hashtable since a iceberg equality delete file may be matched by many data files after iceberg planning. This rule needs to strictly meet the check requirements before it can be rewritten.

Three are three conditions that need to rewrite:

iceberg format is v2 format
snapshot summary exist equality delete file.
exists real delete file in scan task after iceberg job planning.

We'll rewrite three patterns.
The first common case:
iceberg identifier column (also the same as pk) are identifier_col and p1.

mysql> explain select * from pk_int_50_par_int_50;
+---------------------------------------------------------------------------------+
| Explain String                                                                  |
+---------------------------------------------------------------------------------+
| PLAN FRAGMENT 0                                                                 |
|  OUTPUT EXPRS:1: identifier_col | 2: data | 3: par_col                          |
|   PARTITION: UNPARTITIONED                                                      |
|                                                                                 |
|   RESULT SINK                                                                   |
|                                                                                 |
|   5:EXCHANGE                                                                    |
|                                                                                 |
| PLAN FRAGMENT 1                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: RANDOM                                                             |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 05                                                             |
|     UNPARTITIONED                                                               |
|                                                                                 |
|   4:Project                                                                     |
|   |  <slot 1> : 1: identifier_col                                               |
|   |  <slot 2> : 2: data                                                         |
|   |  <slot 3> : 3: par_col                                                      |
|   |                                                                             |
|   3:HASH JOIN                                                                   |
|   |  join op: LEFT ANTI JOIN (BROADCAST)                                        |
|   |  colocate: false, reason:                                                   |
|   |  equal join conjunct: 1: identifier_col = 5: identifier_col                 |
|   |  equal join conjunct: 3: par_col = 6: par_col                               |
|   |  other join predicates: 4: $data_sequence_number < 7: $data_sequence_number |
|   |                                                                             |
|   |----2:EXCHANGE                                                               |
|   |                                                                             |
|   0:IcebergScanNode                                                             |
|      TABLE: pk_int_50_par_int_50                                                |
|      cardinality=119750000                                                      |
|      avgRowSize=4.0                                                             |
|                                                                                 |
| PLAN FRAGMENT 2                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: RANDOM                                                             |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 02                                                             |
|     UNPARTITIONED                                                               |
|                                                                                 |
|   1:IcebergEqualityScanNode                                                     |
|      TABLE: pk_int_50_par_int_50_eq_delete_identifier_col_par_col               |
|      cardinality=497500                                                         |
|      avgRowSize=3.0                                                             |
|      Iceberg identifier columns: [identifier_col, par_col]                      |
+---------------------------------------------------------------------------------+

The second case with mutable pk column:
pk column before altering the table is k1.
pk column after altering the table is k1.

mysql> explain select * from test_k1_and_k2_ab;
+---------------------------------------------------------------------------------+
| Explain String                                                                  |
+---------------------------------------------------------------------------------+
| PLAN FRAGMENT 0                                                                 |
|  OUTPUT EXPRS:1: k1 | 2: k2                                                     |
|   PARTITION: UNPARTITIONED                                                      |
|                                                                                 |
|   RESULT SINK                                                                   |
|                                                                                 |
|   11:EXCHANGE                                                                   |
|                                                                                 |
| PLAN FRAGMENT 1                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: HASH_PARTITIONED: 4: k2                                            |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 11                                                             |
|     UNPARTITIONED                                                               |
|                                                                                 |
|   10:Project                                                                    |
|   |  <slot 1> : 1: k1                                                           |
|   |  <slot 2> : 2: k2                                                           |
|   |                                                                             |
|   9:HASH JOIN                                                                   |
|   |  join op: RIGHT ANTI JOIN (PARTITIONED)                                     |
|   |  colocate: false, reason:                                                   |
|   |  equal join conjunct: 4: k2 = 2: k2                                         |
|   |  other join predicates: 3: $data_sequence_number < 5: $data_sequence_number |
|   |                                                                             |
|   |----8:EXCHANGE                                                               |
|   |                                                                             |
|   1:EXCHANGE                                                                    |
|                                                                                 |
| PLAN FRAGMENT 2                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: HASH_PARTITIONED: 1: k1                                            |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 08                                                             |
|     HASH_PARTITIONED: 2: k2                                                     |
|                                                                                 |
|   7:Project                                                                     |
|   |  <slot 1> : 1: k1                                                           |
|   |  <slot 2> : 2: k2                                                           |
|   |  <slot 3> : 3: $data_sequence_number                                        |
|   |                                                                             |
|   6:HASH JOIN                                                                   |
|   |  join op: LEFT ANTI JOIN (PARTITIONED)                                      |
|   |  colocate: false, reason:                                                   |
|   |  equal join conjunct: 1: k1 = 6: k1                                         |
|   |  other join predicates: 3: $data_sequence_number < 7: $data_sequence_number |
|   |                                                                             |
|   |----5:EXCHANGE                                                               |
|   |                                                                             |
|   3:EXCHANGE                                                                    |
|                                                                                 |
| PLAN FRAGMENT 3                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: RANDOM                                                             |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 05                                                             |
|     HASH_PARTITIONED: 6: k1                                                     |
|                                                                                 |
|   4:IcebergEqualityScanNode                                                     |
|      TABLE: test_k1_and_k2_ab_eq_delete_k1                                      |
|      cardinality=3                                                              |
|      avgRowSize=2.0                                                             |
|      Iceberg identifier columns: [k1]                                           |
|                                                                                 |
|                                                                                 |
| PLAN FRAGMENT 4                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: RANDOM                                                             |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 03                                                             |
|     HASH_PARTITIONED: 1: k1                                                     |
|                                                                                 |
|   2:IcebergScanNode                                                             |
|      TABLE: test_k1_and_k2_ab                                                   |
|      cardinality=5                                                              |
|      avgRowSize=3.0                                                             |
|                                                                                 |
| PLAN FRAGMENT 5                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: RANDOM                                                             |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 01                                                             |
|     HASH_PARTITIONED: 4: k2                                                     |
|                                                                                 |
|   0:IcebergEqualityScanNode                                                     |
|      TABLE: test_k1_and_k2_ab_eq_delete_k2                                      |
|      cardinality=3                                                              |
|      avgRowSize=2.0                                                             |
|      Iceberg identifier columns: [k2]                                           |
+---------------------------------------------------------------------------------+
94 rows in set (0.16 sec)

The third case with partition evolution.

Partition Table With 1 delete schema: [k1, p1], Partition column: [p1]. Write some records to this table.
Then alter table partition field (partition evolution): (p1 -> bucket(5, p1)). Write some records to this table.

mysql> explain select * from test_bucket_table;
+---------------------------------------------------------------------------------+
| Explain String                                                                  |
+---------------------------------------------------------------------------------+
| PLAN FRAGMENT 0                                                                 |
|  OUTPUT EXPRS:1: k1 | 2: k2 | 3: p1                                             |
|   PARTITION: UNPARTITIONED                                                      |
|                                                                                 |
|   RESULT SINK                                                                   |
|                                                                                 |
|   6:EXCHANGE                                                                    |
|                                                                                 |
| PLAN FRAGMENT 1                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: HASH_PARTITIONED: 1: k1, 3: p1, 5: $spec_id                        |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 06                                                             |
|     UNPARTITIONED                                                               |
|                                                                                 |
|   5:Project                                                                     |
|   |  <slot 1> : 1: k1                                                           |
|   |  <slot 2> : 2: k2                                                           |
|   |  <slot 3> : 3: p1                                                           |
|   |                                                                             |
|   4:HASH JOIN                                                                   |
|   |  join op: LEFT ANTI JOIN (PARTITIONED)                                      |
|   |  colocate: false, reason:                                                   |
|   |  equal join conjunct: 1: k1 = 6: k1                                         |
|   |  equal join conjunct: 3: p1 = 7: p1                                         |
|   |  equal join conjunct: 5: $spec_id = 9: $spec_id                             |
|   |  other join predicates: 4: $data_sequence_number < 8: $data_sequence_number |
|   |                                                                             |
|   |----3:EXCHANGE                                                               |
|   |                                                                             |
|   1:EXCHANGE                                                                    |
|                                                                                 |
| PLAN FRAGMENT 2                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: RANDOM                                                             |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 03                                                             |
|     HASH_PARTITIONED: 6: k1, 7: p1, 9: $spec_id                                 |
|                                                                                 |
|   2:IcebergEqualityScanNode                                                     |
|      TABLE: test_bucket_table_eq_delete_k1_p1                                   |
|      cardinality=2                                                              |
|      avgRowSize=4.0                                                             |
|      Iceberg identifier columns: [k1, p1]                                       |
|                                                                                 |
|                                                                                 |
| PLAN FRAGMENT 3                                                                 |
|  OUTPUT EXPRS:                                                                  |
|   PARTITION: RANDOM                                                             |
|                                                                                 |
|   STREAM DATA SINK                                                              |
|     EXCHANGE ID: 01                                                             |
|     HASH_PARTITIONED: 1: k1, 3: p1, 5: $spec_id                                 |
|                                                                                 |
|   0:IcebergScanNode                                                             |
|      TABLE: test_bucket_table                                                   |
|      cardinality=4                                                              |
|      avgRowSize=5.0                                                             |
+---------------------------------------------------------------------------------+
60 rows in set (0.08 sec)

Fixes #issue
some poc tests

case	main	optimized
10W delete files with repeat matching(real 300 delete files)	10s	0.2s
10W delete files with repeat matching (real 1W delete files)	160s	5s

TODO:

add more sql-test to this patch.
test more cases for optimization.
add pos-delete file to data cache in next patch [Enhancement] Use data cache for pos-delete file #51123

What type of PR is this:

Does this PR entail a change in behavior?

Yes, this PR will result in a change in behavior.
No, this PR will not result in a change in behavior.

If yes, please specify the type of change:

Interface/UI changes: syntax, type conversion, expression evaluation, display information
Parameter changes: default values, similar parameters but with different default values
Policy changes: use new policy to replace old one, functionality automatically enabled
Feature removed
Miscellaneous: upgrade & downgrade compatibility, etc.

Checklist:

I have added test cases for my bug fix or my new feature
This pr needs user documentation (for new or modified features or behaviors)
- I have added documentation for my new feature or new function
This is a backport pr

Bugfix cherry-pick branch check:

Signed-off-by: stephen <[email protected]>

DorianZheng · 2024-09-20T06:53:43Z

be/src/exec/hdfs_scanner.cpp

@@ -578,6 +581,29 @@ void HdfsScannerContext::append_or_update_partition_column_to_chunk(ChunkPtr* ch
    ck->set_num_rows(row_count);
 }

+void HdfsScannerContext::append_or_update_extended_column_to_chunk(ChunkPtr* chunk, size_t row_count) {


If this is the same as the partition column, why not merge them and make it more general instead of rewriting the same logic again

Yes, please consider to merge into that function, it will be less error-prone.

didn't see you change anything?

this commit b918629

DorianZheng · 2024-09-20T06:58:06Z

fe/fe-core/src/main/java/com/starrocks/planner/IcebergEqualityDeleteScanNode.java

@@ -0,0 +1,88 @@
+// Copyright 2021-present StarRocks, Inc. All rights reserved.


I think if we cannot abstract this to leverage connector API, we are not able to abstract the ConnectorScanNode in the future

yes, we will do this in the future.

DorianZheng · 2024-09-20T07:20:50Z

...n/java/com/starrocks/sql/optimizer/rule/transformation/IcebergEqualityDeleteRewriteRule.java

+
+        long limit = scanOperator.getLimit();
+        ColumnRefFactory columnRefFactory = context.getColumnRefFactory();
+        boolean hasPartitionEvolution = deleteSchemas.stream().map(x -> x.specId).distinct().count() > 1;


If the timeline of operation is as follows:
T1: insert data
T2: partition evolution
T3: delete data

the distinct spec id of the delete schema is 1 but it does partition evolution

this only represents whether this query need to add spec_id to extended columns.

So you mean the eq delete files generated in T3 can delete data in T1?

no. the eq delete files generated in T3 won't be matched by any data files.

be/src/exec/mor_processor.h

be/src/exec/hdfs_scanner.cpp

Youngwb · 2024-09-23T02:29:39Z

be/src/connector/hive_connector.cpp

        } else {
            _materialize_slots.push_back(slots[i]);
            _materialize_index_in_chunk.push_back(i);
        }
    }

-    if (_scan_range.__isset.delete_column_slot_ids && !_scan_range.delete_column_slot_ids.empty()) {


Can just remove these now? I think it may have problems when user upgrade

I have tested it. we will throw an exception.
ERROR 1064 (HY000): Unsupported iceberg file content: 2 in the scanner thread.
Relatively few users of mor scene.

Youngwb · 2024-09-23T03:08:16Z

...n/java/com/starrocks/sql/optimizer/rule/transformation/IcebergEqualityDeleteRewriteRule.java

+                .map(schema -> schema.equalityIds)
+                .flatMap(List::stream)
+                .distinct()
+                .map(fieldId -> nativeTable.schema().findColumnName(fieldId))


native table schema may doesn't have the column in delete schema if table has schema change like drop column?

iceberg doesn't allow dropping an identifier column.

Youngwb · 2024-09-23T03:17:20Z

...n/java/com/starrocks/sql/optimizer/rule/transformation/IcebergEqualityDeleteRewriteRule.java

+        return Utils.createCompound(CompoundPredicateOperator.CompoundType.AND, onOps);
+    }
+
+    private LogicalIcebergScanOperator buildNewScanOperatorWithUnselectedField(Set<DeleteSchema> deleteSchemas,


buildNewScanOperatorWithExtendedField?

yes, it includes not only extended columns, but also the identifier columns that are not selected in user's query.

Youngwb · 2024-09-23T03:41:45Z

fe/fe-core/src/main/java/com/starrocks/sql/plan/PlanFragmentBuilder.java

@@ -1420,12 +1435,15 @@ public PlanFragment visitPhysicalIcebergScan(OptExpression optExpression, ExecPl
                            .add(ScalarOperatorToExpr.buildExecExpression(predicate, formatterContext));
                }

-                icebergScanNode.preProcessIcebergPredicate(node.getPredicate());
+                ScalarOperator icebergPredicate = !isEqDeleteScan ? node.getPredicate() :
+                        ((PhysicalIcebergEqualityDeleteScanOperator) node).getOriginPredicate();


what's the difference of originPredicate and predicate

the schema of iceberg_equality_table is a subset of iceberg_table. eg: iceberg table schema: [c1, c2, c3]. the identifier column is c1. if the query predicate is c1 > 1 and c2 < 3, it can't be used as a predicate of equality_table. so we named it originPredicate in the equality_table.

Youngwb · 2024-09-23T05:52:42Z

...n/java/com/starrocks/sql/optimizer/rule/transformation/IcebergEqualityDeleteRewriteRule.java

+            LogicalIcebergEqualityDeleteScanOperator eqScanOp = new LogicalIcebergEqualityDeleteScanOperator(
+                    equalityDeleteTable, colRefToColumn.build(), columnToColRef.build(), -1, null,
+                    scanOperator.getTableVersionRange());
+            eqScanOp.setOriginPredicate(newScanOp.getPredicate());


why eq scan operator need this? OnPredicateMoveAroundRule is not enough?

we need to get the scan range of equality_table from query level cache by originPredicate.

Signed-off-by: stephen <[email protected]>

sonarcloud · 2024-10-12T08:41:31Z

Quality Gate passed

Issues
21 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
2.6% Duplication on New Code

See analysis details on SonarCloud

github-actions · 2024-10-12T10:36:42Z

[Java-Extensions Incremental Coverage Report]

✅ pass : 0 / 0 (0%)

github-actions · 2024-10-12T10:39:38Z

[FE Incremental Coverage Report]

✅ pass : 386 / 407 (94.84%)

file detail

	path	covered_line	new_line	coverage	not_covered_line_detail
🔵	com/starrocks/catalog/IcebergTable.java	0	1	00.00%	[257]
🔵	com/starrocks/sql/optimizer/statistics/StatisticsCalculator.java	28	33	84.85%	[505, 521, 522, 523, 524]
🔵	com/starrocks/sql/optimizer/operator/logical/LogicalIcebergEqualityDeleteScanOperator.java	22	24	91.67%	[48, 49]
🔵	com/starrocks/sql/optimizer/rule/transformation/IcebergEqualityDeleteRewriteRule.java	157	168	93.45%	[117, 136, 150, 155, 166, 176, 177, 243, 244, 362, 365]
🔵	com/starrocks/planner/IcebergScanNode.java	87	89	97.75%	[207, 214]
🔵	com/starrocks/sql/optimizer/Optimizer.java	1	1	100.00%	[]
🔵	com/starrocks/sql/optimizer/rule/transformation/OnPredicateMoveAroundRule.java	10	10	100.00%	[]
🔵	com/starrocks/qe/SessionVariable.java	4	4	100.00%	[]
🔵	com/starrocks/sql/plan/PlanFragmentBuilder.java	16	16	100.00%	[]
🔵	com/starrocks/sql/optimizer/operator/OperatorVisitor.java	2	2	100.00%	[]
🔵	com/starrocks/sql/optimizer/rule/RuleType.java	2	2	100.00%	[]
🔵	com/starrocks/sql/optimizer/rule/transformation/PushDownPredicateScanRule.java	1	1	100.00%	[]
🔵	com/starrocks/sql/optimizer/operator/logical/LogicalIcebergScanOperator.java	3	3	100.00%	[]
🔵	com/starrocks/sql/optimizer/OptExpressionVisitor.java	1	1	100.00%	[]
🔵	com/starrocks/sql/optimizer/LogicalPlanPrinter.java	1	1	100.00%	[]
🔵	com/starrocks/planner/IcebergEqualityDeleteScanNode.java	26	26	100.00%	[]
🔵	com/starrocks/sql/optimizer/operator/physical/PhysicalIcebergEqualityDeleteScanOperator.java	13	13	100.00%	[]
🔵	com/starrocks/sql/optimizer/operator/OperatorType.java	2	2	100.00%	[]
🔵	com/starrocks/sql/optimizer/rule/implementation/IcebergEqualityDeleteScanImplementationRule.java	10	10	100.00%	[]

github-actions · 2024-10-12T10:46:44Z

[BE Incremental Coverage Report]

✅ pass : 70 / 74 (94.59%)

file detail

	path	covered_line	new_line	coverage	not_covered_line_detail
🔵	be/src/exec/iceberg/iceberg_delete_builder.h	2	6	33.33%	[91, 92, 103, 104]
🔵	be/src/formats/parquet/file_reader.cpp	3	3	100.00%	[]
🔵	be/src/connector/hive_connector.cpp	32	32	100.00%	[]
🔵	be/src/exec/hdfs_scanner_orc.cpp	2	2	100.00%	[]
🔵	be/src/exec/hdfs_scanner.cpp	30	30	100.00%	[]
🔵	be/src/exec/hdfs_scanner_parquet.cpp	1	1	100.00%	[]

Youngwb · 2024-10-12T12:24:39Z

...n/java/com/starrocks/sql/optimizer/rule/transformation/IcebergEqualityDeleteRewriteRule.java

+        boolean hasPartitionEvolution = deleteSchemas.stream().map(x -> x.specId).distinct().count() > 1;
+        if (hasPartitionEvolution && !context.getSessionVariable().enableReadIcebergEqDeleteWithPartitionEvolution()) {
+            throw new StarRocksConnectorException("Equality delete files aren't supported for tables with partition evolution." +
+                    "You can execute `set enable_read_iceberg_equality_delete_with_partition_evolution = true` then rerun it");


why need this enable_read_iceberg_equality_delete_with_partition_evolution variable? can we just suppport it by default?

because there is a semantic inconsistency.

Youngwb · 2024-10-12T12:40:44Z

fe/fe-core/src/main/java/com/starrocks/sql/optimizer/statistics/StatisticsCalculator.java

+
+            double rowCount = 0;
+            Set<String> seenFiles = new HashSet<>();
+            for (FileScanTask fileScanTask : remoteFileDesc.getIcebergScanTasks()) {


It's worthy to get all file scan task to get the row count? may be we can just set row count to a small number

Youngwb · 2024-10-12T13:09:27Z

fe/fe-core/src/main/java/com/starrocks/planner/IcebergScanNode.java

+        hdfsScanRange.setOffset(file.content() == FileContent.DATA ? task.start() : 0);
+        hdfsScanRange.setLength(file.content() == FileContent.DATA ? task.length() : file.fileSizeInBytes());
+        // For iceberg table we do not need partition id
+        if (!idToPartitionSlots.containsKey(partitionId)) {


this is unpatititoned iceberg? the comment looks wired

I will remove this comment in the next patch.

…lete (StarRocks#51050) Signed-off-by: stephen <[email protected]>

…lete (StarRocks#51050) Signed-off-by: stephen <[email protected]> Signed-off-by: zhiminr.ren <[email protected]>

stephen-shelby requested review from a team as code owners September 14, 2024 10:07

wanpengfei-git added the PROTO-REVIEW label Sep 14, 2024

wanpengfei-git requested a review from a team September 14, 2024 10:08

mergify bot assigned stephen-shelby Sep 14, 2024

stephen-shelby force-pushed the eq_optimize branch 8 times, most recently from 6565280 to 6796e0a Compare September 18, 2024 13:38

packy92 previously approved these changes Sep 19, 2024

View reviewed changes

eq-delete

162e88d

Signed-off-by: stephen <[email protected]>

stephen-shelby dismissed packy92’s stale review via 162e88d September 19, 2024 12:47

stephen-shelby force-pushed the eq_optimize branch from 6796e0a to 162e88d Compare September 19, 2024 12:47

DorianZheng reviewed Sep 20, 2024

View reviewed changes

trueeyu reviewed Sep 20, 2024

View reviewed changes

be/src/exec/mor_processor.h Show resolved Hide resolved

dirtysalt reviewed Sep 23, 2024

View reviewed changes

be/src/exec/hdfs_scanner.cpp Show resolved Hide resolved

Youngwb reviewed Sep 23, 2024

View reviewed changes

stephen-shelby added 2 commits October 12, 2024 16:21

resolve comment

b918629

Signed-off-by: stephen <[email protected]>

update error message

1e2740f

Signed-off-by: stephen <[email protected]>

dirtysalt approved these changes Oct 12, 2024

View reviewed changes

trueeyu approved these changes Oct 12, 2024

View reviewed changes

Youngwb reviewed Oct 12, 2024

View reviewed changes

Youngwb approved these changes Oct 14, 2024

View reviewed changes

stephen-shelby enabled auto-merge (squash) October 14, 2024 02:17

HangyuanLiu approved these changes Oct 14, 2024

View reviewed changes

ZiheLiu approved these changes Oct 14, 2024

View reviewed changes

Seaven approved these changes Oct 14, 2024

View reviewed changes

stephen-shelby merged commit e3c6b4e into StarRocks:main Oct 14, 2024
82 of 84 checks passed

stephen-shelby mentioned this pull request Oct 14, 2024

[Enhancement] Read pos-delete file using datacache #51867

Merged

24 tasks

stephen-shelby mentioned this pull request Oct 23, 2024

[Enhancement] support iceberg incremental scan range #52248

Merged

24 tasks

ZiheLiu pushed a commit to ZiheLiu/starrocks that referenced this pull request Oct 31, 2024

[Enhancement] Optimize iceberg mor performance of iceberg equality de…

ea14b6d

…lete (StarRocks#51050) Signed-off-by: stephen <[email protected]>

renzhimin7 pushed a commit to renzhimin7/starrocks that referenced this pull request Nov 7, 2024

[Enhancement] Optimize iceberg mor performance of iceberg equality de…

dc97f9e

…lete (StarRocks#51050) Signed-off-by: stephen <[email protected]> Signed-off-by: zhiminr.ren <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] Optimize iceberg mor performance of iceberg equality delete #51050

[Enhancement] Optimize iceberg mor performance of iceberg equality delete #51050

stephen-shelby commented Sep 14, 2024 •

edited

Loading

DorianZheng Sep 20, 2024 •

edited

Loading

dirtysalt Sep 23, 2024

stephen-shelby Oct 12, 2024

DorianZheng Oct 12, 2024

stephen-shelby Oct 12, 2024

DorianZheng Sep 20, 2024

stephen-shelby Oct 12, 2024

DorianZheng Sep 20, 2024

stephen-shelby Oct 12, 2024

DorianZheng Oct 12, 2024

stephen-shelby Oct 12, 2024

Youngwb Sep 23, 2024

stephen-shelby Oct 12, 2024

Youngwb Sep 23, 2024

stephen-shelby Oct 12, 2024

Youngwb Sep 23, 2024

stephen-shelby Oct 12, 2024

Youngwb Sep 23, 2024

stephen-shelby Oct 12, 2024

Youngwb Sep 23, 2024

stephen-shelby Oct 12, 2024

sonarcloud bot commented Oct 12, 2024

github-actions bot commented Oct 12, 2024

github-actions bot commented Oct 12, 2024

github-actions bot commented Oct 12, 2024

Youngwb Oct 12, 2024

stephen-shelby Oct 14, 2024

Youngwb Oct 12, 2024

Youngwb Oct 12, 2024

stephen-shelby Oct 14, 2024

		@@ -0,0 +1,88 @@
		// Copyright 2021-present StarRocks, Inc. All rights reserved.

[Enhancement] Optimize iceberg mor performance of iceberg equality delete #51050

[Enhancement] Optimize iceberg mor performance of iceberg equality delete #51050

Conversation

stephen-shelby commented Sep 14, 2024 • edited Loading

Why I'm doing:

What I'm doing:

What type of PR is this:

Checklist:

Bugfix cherry-pick branch check:

DorianZheng Sep 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sonarcloud bot commented Oct 12, 2024

Quality Gate passed

github-actions bot commented Oct 12, 2024

[Java-Extensions Incremental Coverage Report]

github-actions bot commented Oct 12, 2024

[FE Incremental Coverage Report]

file detail

github-actions bot commented Oct 12, 2024

[BE Incremental Coverage Report]

file detail

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stephen-shelby commented Sep 14, 2024 •

edited

Loading

DorianZheng Sep 20, 2024 •

edited

Loading