Bump pulsar version to 3.2.0-SNAPSHOT #5902

streamnativebot · 2023-09-18T21:07:18Z

This is a PR created by snbot to trigger the check suite in each repository.

…che#21070) ### Motivation Current, when the producer resend the chunked message like this: - M1: UUID: 0, ChunkID: 0 - M2: UUID: 0, ChunkID: 0 // Resend the first chunk - M3: UUID: 0, ChunkID: 1 When the consumer received the M2, it will find that it's already tracking the UUID:0 chunked messages, and will then discard the message M1 and M2. This will lead to unable to consume the whole chunked message even though it's already persisted in the Pulsar topic. Here is the code logic: https://github.com/apache/pulsar/blob/44a055b8a55078bcf93f4904991598541aa6c1ee/pulsar-client/src/main/java/org/apache/pulsar/client/impl/ConsumerImpl.java#L1436-L1482 The bug can be easily reproduced using the testcase `testResendChunkMessages` introduced by this PR. ### Modifications - When receiving the new duplicated first chunk of a chunked message, the consumer discard the current chunked message context and create a new context to track the following messages. For the case mentioned in Motivation, the M1 will be released and the consumer will assemble M2 and M3 as the chunked message.

…e#21066)

…pache#21062)

…#21048)

) Motivation: When deleting a namespace, we will delete znode under the path `/loadbalance/bundle-data` in `local metadata store` instead of `global metadata store`. Modifications: Delete bundle data znode in local metadata store.

…ache#20948) ## Motivation Make the chunk message function work properly when deduplication is enabled. ## Modification ### Only check and store the sequence ID of the last chunk in a chunk message. For example: ```markdown Chunk-1 sequence ID: 0, chunk ID: 0, total chunk: 2 Chunk-2 sequence ID: 0, chunk ID: 1 Chunk-3 sequence ID: 1, chunk ID: 0 total chunk: 3 Chunk-4 sequence ID: 1, chunk ID: 1 Chunk-5 sequence ID: 1, chunk ID: 1 Chunk-6 sequence ID: 1, chunk ID: 2 ``` Only store check and store the sequence ID of Chunk-2 and Chunk-6. **Add a property in the publishContext to determine whether this chunk is the last chunk when persistent completely.** ```java publishContext.setProperty(IS_LAST_CHUNK, Boolean.FALSE); ``` ### Filter and ack duplicated chunks in a chunk message instead of discarding ctx. For example: ```markdown Chunk-1 sequence ID: 0, chunk ID: 0, msgID: 1:1 Chunk-2 sequence ID: 0, chunk ID: 1, msgID: 1:2 Chunk-3 sequence ID: 0, chunk ID: 2, msgID: 1:3 Chunk-4 sequence ID: 0, chunk ID: 1, msgID: 1:4 Chunk-5 sequence ID: 0, chunk ID: 2, msgID: 1:5 Chunk-6 sequence ID: 0, chunk ID: 3, msgID: 1:6 ``` We should filter and ack chunk-4 and chunk-5.

Signed-off-by: tison <[email protected]> Co-authored-by: Alexander Preuß <[email protected]> Co-authored-by: tison <[email protected]>

…deleting invalid underreplication nodes (apache#21059)

…che#21060)

… was failed (apache#20935) The progress Persist mark deleted position is like this: - persist to BK - If failed to persist to BK, try to persist to ZK But in the current implementation: if the cursor ledger was created failed, Pulsar will not try to persist to ZK. It makes if the cursor ledger created fails, a lot of ack records can not be persisted, and we will get a lot of repeat consumption after the BK recover. Modifications: Try to persist the mark deleted position to ZK if the cursor ledger was created failed

…s limitation (apache#21065)

…21099)

…and schema. (apache#21093) Fixes apache#21075 ### Motivation When the topic is loaded, it will delete the topic-level policy if it is enabled. But if the topic is not loaded, it will directly delete through managed ledger factory. But then we will leave the topic policy there. When the topic is created next time, it will use the old topic policy ### Modifications When deleting the topic, delete the schema and topic policies even if the topic is not loaded.

…vice (apache#21033)

## Motivation Handle ack hole case: For example: ```markdown Chunk-1 sequence ID: 0, chunk ID: 0, msgID: 1:1 Chunk-2 sequence ID: 0, chunk ID: 1, msgID: 1:2 Chunk-3 sequence ID: 0, chunk ID: 0, msgID: 1:3 Chunk-4 sequence ID: 0, chunk ID: 1, msgID: 1:4 Chunk-5 sequence ID: 0, chunk ID: 2, msgID: 1:5 ``` Consumer ack chunk message via ChunkMessageIdImpl that consists of all the chunks in this chunk message(Chunk-3, Chunk-4, Chunk-5). The Chunk-1 and Chunk-2 are not included in the ChunkMessageIdImpl, so we should process it here. ## Modification Ack chunk-1 and chunk-2.

…trategicTwoPhaseCompactor (apache#21091)

…fter RawReader reconnects (apache#21081)

…ing broker deletion event (apache#21083)

### Modifications When upgraded the pulsar version from 2.9.2 to 2.10.3, and the isolated group feature not work anymore. Finally, we found the problem. In IsolatedBookieEnsemblePlacementPolicy, when it gets the bookie rack from the metadata store cache, uses future.isDone() to avoid sync operation. If the future is incomplete, return empty blacklists. The cache may expire due to the caffeine cache `getExpireAfterWriteMillis` config, if the cache expires, the future may be incomplete. (apache#21095 will correct the behavior) In 2.9.2, it uses the sync to get data from the metadata store, we should also keep the behavior.

apache#21116)

…ache#21023)

apache#21087)

…eystore TLS and admin API is called (apache#21077)

…apache#20614)

)

Signed-off-by: Jiwe Guo <[email protected]>

…ndex acks (apache#21126)

…n per broker (apache#21144) Motivation: Pulsar has two mechanisms to guarantee that a producer connects to the broker multiple times the result is still correct. - In a connection, the second connection waits for the first connection to complete. - In a topic, the second connection will override the previous one. However, if a producer can use different connections to connect to the broker, these two mechanisms will not work. When the config `connectionsPerBroker` of `PulsarClient` is larger than `1`, a producer could use more than one connection, leading to the error above. You can reproduce this issue by the test `testSelectConnectionForSameProducer.` Modifications: Make the same producer/consumer usage the same connection

…underloadbroker (apache#21025)

…21051) pip: apache#21052 ### Motivation Introduce the `getLastMessageIds` API to Reader. ### Modifications Implement getLastMessageIds API for Reader

…pache#21169)

apache#21155) #### Issue 1 The client assumed the connection was inactive, but the Broker assumed the connection was fine. The Client tried to use a new connection to reconnect a producer, then got an error `Producer with name 'st-0-5' is already connected to topic`. #### Issue 2 - In a connection, the second connection waits for the first connection to complete\. But there is a bug that causes this mechanism to fail\. - If a producer uses a default name, the second registration will override the first one. But it can not override the first one if it uses a specified producer name\. I think this mechanism is to prevent a client from creating two producers with the same name. However, method `Producer.isSuccessorTo` has checked the `producer-id`, and the `producer-id` of multiple producers created by the same client are different. So this mechanism can be deleted. ### Modifications - For `issue 1`: If a producer with the same name tries to use a new connection, async checks the old connection is available. The producers related to the connection that is not available are automatically cleaned up. - For `issue 2`: - Fix the bug that causes a complete producer future will be removed from `ServerCnx`. - Remove the mechanism that prevents a producer with a specified name from overriding the previous producer.

…etrics (apache#20720)

…21142)

…the fatal exception (apache#21143) PIP: apache#21079 ### Motivation Currently, the connector and function cannot terminate the function instance if there are fatal exceptions thrown outside the function instance thread. The current implementation of the connector and Pulsar Function exception handler cannot handle the fatal exceptions that are thrown outside the function instance thread. For example, suppose we have a sink connector that uses its own threads to batch-sink the data to an external system. If any fatal exceptions occur in those threads, the function instance thread will not be aware of them and will not be able to terminate the connector. This will cause the connector to hang indefinitely. There is a related issue here: apache#9464 The same problem exists for the source connector. The source connector may also use a separate thread to fetch data from an external system. If any fatal exceptions happen in that thread, the connector will also hang forever. This issue has been observed for the Kafka source connector: apache#9464. We have fixed it by adding the notifyError method to the `PushSource` class in PIP-281: apache#20807. However, this does not solve the same problem that all source connectors face because not all connectors are implemented based on the `PushSource` class. The problem is same for the Pulsar Function. Currently, the function can't throw fatal exceptions to the function framework. We need to provide a way for the function developer to implement it. We need a way for the connector and function developers to throw fatal exceptions outside the function instance thread. The function framework should catch these exceptions and terminate the function accordingly. ### Modifications Introduce a new method `fatal` to the context. All the connector implementation code and the function code can use this context and call the `fatal` method to terminate the instance while raising a fatal exception. After the connector or function raises the fatal exception, the function instance thread will be interrupted. The function framework then could catch the exception, log it, and then terminate the function instance.

…1168)

…ernalCallbacks.Processor (apache#21159)

… to 32MB (apache#21131)

… port if TLS is enabled (apache#21015)

…problem. (apache#21161)

RobertIndie and others added 30 commits August 30, 2023 00:23

[improve][doc] Add stronger Security Policy Language to README (apach…

eded9f1

…e#21066)

[fix][build] Upgrade Guava to 32.1.2-jre (apache#21090)

dab5b2f

[refactor][cli] PIP-280 Add pulsar-cli-utils to core-modules profile (a…

53ffe81

…pache#21062)

[fix][client] Fix cannot retry chunk messages and send to DLQ (apache…

99e3fea

…#21048)

[fix][io] Allow setting sourceType in config file (apache#19836)

f1c8684

Signed-off-by: tison <[email protected]> Co-authored-by: Alexander Preuß <[email protected]> Co-authored-by: tison <[email protected]>

[fix][auto-recovery] Improve to the ReplicaitonWorker performance by …

ba0f2ba

…deleting invalid underreplication nodes (apache#21059)

[fix] [broker] consider iowait as idle. (apache#19110)

f35d3e0

[fix][client] Fix logging problem in pulsar client (apache#21094)

eedbdb1

[fix][io] Fix --retain[-key]-ordering not working error for sink (apa…

7ecb93c

…che#21060)

[fix][fn] Fix ProducerConfig cannot update error (apache#21037)

64d006b

[improve][pip] PIP-264: Enhanced OTel-based metric system (apache#21080)

0956def

[improve][broker] Make read compacted entries support maxReadSizeByte…

835e9b6

…s limitation (apache#21065)

[improve][pip] Implement getLastMessageIds API for Reader (apache#21052)

a25125d

[fix][broker] Fix unsubscribe non-durable subscription error (apache#…

927d1b2

…21099)

[improve][pip] Replace reader with table view in the topic policy ser…

cb24ab0

…vice (apache#21033)

[fix][broker] Avoid splitting one batch message into two entries in S…

e59c850

…trategicTwoPhaseCompactor (apache#21091)

[fix][broker] Fix write duplicate entries into the compacted ledger a…

2921a41

…fter RawReader reconnects (apache#21081)

[fix][broker] Cleanup correctly heartbeat bundle ownership when handl…

b26ee8a

…ing broker deletion event (apache#21083)

[fix][client] Fix repeat consume when using n-ack and batched messages (

35bb021

apache#21116)

[fix][fn] Fix the --batch-builder not working error for functions (ap…

29addaa

…ache#21023)

[fix][broker] revert remove duplicate topics name when deleteNamespace (

c675a3d

apache#21087)

[fix][proxy] Fix Proxy 502 gateway error when it is configured with K…

674c52a

…eystore TLS and admin API is called (apache#21077)

[cleanup][misc] Delete .github/ISSUE_TEMPLATE/pip.md (apache#21120)

209f222

Technoboy- and others added 27 commits September 6, 2023 22:06

[improve][pip] PIP-277: Add current option in the Clusters list cmd (…

d890432

…apache#20614)

[fix][test]Flaky test testMaxPendingChunkMessages (apache#21103)

3cb7926

[improve][cli] Add current option in the Clusters list cmd (apache#21139

d1f8f96

)

[cleanup][build] Bumped version to 3.2.0-SNAPSHOT (apache#21147)

e5cf216

Signed-off-by: Jiwe Guo <[email protected]>

[improve] [doc] Add doc for customized util class (apache#21110)

e9d1d99

[improve][broker] Upgrade bookkeeper to 4.16.3 (apache#21146)

9f12ace

[fix][broker] Fix unack count when mixing non batch index and batch i…

6e8bf93

…ndex acks (apache#21126)

[improve] [bookie] Enable forceAllowCompaction by default (apache#21130)

550476f

[fix][broker] fix UniformLoadShedder seleet wrong overloadbroker and …

fc86f3b

…underloadbroker (apache#21025)

[improve][client] Implement getLastMessageIds API for Reader (apache#…

d00a35e

…21051) pip: apache#21052 ### Motivation Introduce the `getLastMessageIds` API to Reader. ### Modifications Implement getLastMessageIds API for Reader

[cleanup][monitor] Remove metric topic_load_times (apache#21167)

b13407d

[improve] [broker] improve read entry error log for troubleshooting (a…

65706c6

…pache#21169)

[fix] [bookie] Fix RocksDB configuration (apache#21157)

af20a8a

[fix][broker] Fix missing generate some metrics in BrokerOperabilityM…

5918efd

…etrics (apache#20720)

[improve] [bk] Update the document of diskUsageWarnThreshold (apache#…

39f2d1d

…21142)

[improve] [broker] disable balancing based on DirectMemory. (apache#2…

b10eed6

…1168)

[improve][ci] Protect branch-3.1 (apache#21185)

b8ebfe3

[fix][broker] fix bug caused by optimistic locking (apache#18390)

2aa8c3b

[fix][auto-recovery] Fix metadata store deadlock due to BookkeeperInt…

97723eb

…ernalCallbacks.Processor (apache#21159)

[improve] [bookie] Change flushEntrylogBytes default value from 256MB…

4fb5203

… to 32MB (apache#21131)

[fix][doc] fix doc comment for subscriptionType (apache#20987)

62a88f9

[fix][broker] Fix PulsarService.getLookupServiceAddress returns wrong…

1363777

… port if TLS is enabled (apache#21015)

[fix] [auto-recovery] Fix PulsarLedgerUnderreplicationManager notify …

8ff51eb

…problem. (apache#21161)

Release 3.2.0-SNAPSHOT

cfd329a

github-actions bot added the PIP label Sep 18, 2023

streamnativebot closed this Sep 25, 2023

streamnativebot deleted the branch-3.2.0-SNAPSHOT branch September 25, 2023 21:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump pulsar version to 3.2.0-SNAPSHOT #5902

Bump pulsar version to 3.2.0-SNAPSHOT #5902

streamnativebot commented Sep 18, 2023

Bump pulsar version to 3.2.0-SNAPSHOT #5902

Bump pulsar version to 3.2.0-SNAPSHOT #5902

Conversation

streamnativebot commented Sep 18, 2023