[improve][broker] Do-not-merge #251

mukesh-ctds · 2024-04-16T11:28:02Z

Motivation

Explain here the context, and why you're making that change. What is the problem you're trying to solve.
This PR sync all commits from apache/branch-3.0 into 3.1_ds which are not present.

Modifications

Describe the modifications you've done.

Cherry-picked commits from branch-3.0 which are not present on 3.1_ds

Verifying this change

Make sure that the change passes the CI checks.

(Please pick either of the following options)

This change is a trivial rework / code cleanup without any test coverage.

Does this pull request potentially affect one of the following parts:

If yes was chosen, please highlight the changes

Dependencies (does it add or upgrade a dependency): (yes)
The public API: (no)
The schema: (no)
The default values of configurations: (no)
The wire protocol: (no)
The rest endpoints: (no)
The admin cli options: (no)
Anything that affects deployment: (no)

Documentation

Check the box below or label this PR directly (if you have committer privilege).

Need to update docs?

doc-required

(If you need help on updating docs, create a doc issue)
no-need-doc

(Please explain why)
doc

(If this PR contains doc changes)

(cherry picked from commit e42faff)

…ion. (apache#20597)" This reverts commit aa565c2. (cherry picked from commit 35b449b)

(cherry picked from commit f3bb89d)

…eption. (apache#20818) (cherry picked from commit 9d580b4)

### Motivation - The task `trim ledgers` runs in the thread `BkMainThreadPool.choose(ledgerName)` - The task `write entries to BK` runs in the thread `BkMainThreadPool.choose(ledgerId)` So the two tasks above may run concurrently/ The task `trim ledgers` work as the flow below: - find the ledgers which are no longer to read, the result is `{Ledgers before the slowest read}`. - check if the `{Ledgers before the slowest read}` is out of retention policy, the result is `{Ledgers to be deleted}`. - if the create time of the ledger is lower than the earliest retention time, mark it should be deleted - if after deleting this ledger, the rest ledgers are still larger than the retention size, mark it should be deleted - delete the`{Ledgers to be deleted}` **(Highlight)** There is a scenario that causes the task `trim ledgers` did discontinuous ledger deletion, resulting consume messages discontinuous: - context: - ledgers: `[{id=1, size=100}, {id=2,size=100}]` - retention size: 150 - no cursor there - Check `ledger 1`, skip by retention check `(200 - 100) < 150` - One in-flight writing is finished, the `calculateTotalSizeWrited()` would return `300` now. - Check `ledger 2`, retention check `(300 - 100) > 150`, mark the ledger-2 should be deleted. - Delete the `ledger 2`. - Create a new consumer. It will receive messages from `[ledger-1, ledegr-3]`, but the `ledger-2` will be skipped. ### Modifications Once the retention constraint has been met, break the loop. (cherry picked from commit 782e91f) (cherry picked from commit b87c0fb)

…pache#21035) Motivation: After [PIP-118: reconnect broker when ZooKeeper session expires](apache#13341), the Broker will not shut down after losing the connection of the local metadata store in the default configuration. However, before the ZK client is reconnected, the events of BK online and offline are lost, resulting in incorrect BK info in the memory. You can reproduce the issue by the test `BkEnsemblesChaosTest. testBookieInfoIsCorrectEvenIfLostNotificationDueToZKClientReconnect`(90% probability of reproduce of the issue, run it again if the issue does not occur) Modifications: Refresh BK info in memory after the ZK client is reconnected. (cherry picked from commit db20035) (cherry picked from commit 5f99925)

…he (apache#20763)" This reverts commit 111c14d. (cherry picked from commit ab9384b)

…ed on topic, when dedup is enabled and no producer is there (apache#20951) (cherry picked from commit 30073db) (cherry picked from commit f68589e)

…ata when doing lookup (apache#21063) Motivation: If we set `allowAutoTopicCreationType` to `PARTITIONED`, the flow of the create topic progress is like the below: 1. `Client-side`: Lookup topic to get partitioned topic metadata to create a producer. 1. `Broker-side`: Create partitioned topic metadata. 1. `Broker-side`: response `{"partitions":3}`. 1. `Client-side`: Create separate connections for each partition of the topic. 1. `Broker-side`: Receive 3 connect requests and create 3 partition-topics. In the `step 2` above, the flow of the progress is like the below: 1. Check the policy of topic auto-creation( the policy is `{allowAutoTopicCreationType=PARTITIONED, defaultNumPartitions=3}` ) 1. Check the partitioned topic metadata already exists. 1. Try to create the partitioned topic metadata if it does not exist. 1. If created failed by the partitioned topic metadata already exists( maybe another broker is also creating now), read partitioned topic metadata from the metadata store and respond to the client. There is a race condition that makes the client get non-partitioned metadata of the topic: | time | `broker-1` | `broker-2` | | --- | --- | --- | | 1 | get policy: `PARTITIONED, 3` | get policy: `PARTITIONED, 3` | | 2 | check the partitioned topic metadata already exists | Check the partitioned topic metadata already exists | | 3 | Partitioned topic metadata does not exist, the metadata cache will cache an empty optional for the path | Partitioned topic metadata does not exist, the metadata cache will cache an empty optional for the path | | 4 | | succeed create the partitioned topic metadata | | 5 | Receive a ZK node changed event to invalidate the cache of the partitioned topic metadata | | 6 | Creating the metadata failed due to it already exists | | 7 | Read the partitioned topic metadata again | If `step-5` is executed later than `step-7`, `broker-1` will get an empty optional from the cache of the partitioned topic metadata and respond non-partitioned metadata to the client. **What thing would make the `step-5` is executed later than `step-7`?** Provide a scenario: Such as the issue that the PR apache#20303 fixed, it makes `zk operation` and `zk node changed notifications` executed in different threads: `main-thread of ZK client` and `metadata store thread`. Therefore, the mechanism of the lookup partitioned topic metadata is fragile and we need to optimize it. Modifications: Before reading the partitioned topic metadata again, refresh the cache first. (cherry picked from commit d099ac4) (cherry picked from commit 2e534d2)

(cherry picked from commit 597ba82)

…pache#21169) (cherry picked from commit 65706c6) (cherry picked from commit faee123)

This reverts commit 53665ee.

… many requests (apache#21216) Motivation: The Pulsar client will close the socket if it receives a `ServiceNotReady` error when doing a lookup. The Broker will respond to the client with a `TooManyRequests` error if there are too many lookup requests in progress, but the Pulsar Proxy responds to the client with a `ServiceNotReady` error in the same scenario. Modifications: Make Pulsar Proxy respond to the client with a `TooManyRequests` error if there are too many lookup requests in progress. (cherry picked from commit d6c3fa4) (cherry picked from commit 9a7c4bb)

…er id when switch ledger (apache#21201) ### Modifications - Print a warning log if the SSL handshake error - Print ledger ID when switching ledger (cherry picked from commit 8485d68) (cherry picked from commit f243925)

…ing queue is not empty (apache#21259) Reproduce steps: - Create a reader. - Reader pulls messages into `incoming queue`, do not call `reader.readNext` now. - Trim ledger task will delete the ledgers, then there is no in the topic. - Now, you can get messages if you call `reader.readNext`, but the method `reader.hasMessageAvailable` return `false` Note: the similar issue of `MultiTopicsConsumerImpl` has been fixed by apache#13332, current PR only trying to fix the issue of `ConsumerImpl`. Make `reader.hasMessageAvailable` return `true` when `incoming queue` is not empty. (cherry picked from commit 6d82b09) (cherry picked from commit 38c3f0c)

### Motivation After trimming ledgers, the variable `lastConfirmedEntry` of the managed ledger might rely on a deleted ledger(the latest ledger which contains data). There is a bug that makes pulsar allow users to set the start read position to an unexisting ledger or a deleted ledger when creating a subscription. This makes the `backlog` and `markDeletedPosition` wrong. ### Modifications Fix the bug. (cherry picked from commit 4ee5cd7) (cherry picked from commit dd28bb4)

…gers (apache#21250) ### Background - But after trimming ledgers, `ml.lastConfirmedPosition` relies on a deleted ledger when the current ledger of ML is empty. - Cursor prevents setting `markDeletedPosition` to a value larger than `ml.lastConfirmedPosition`, but there are no entries to read<sup>[1]</sup>. - The code description in the method `advanceCursors` said: do not make `cursor.markDeletedPosition` larger than `ml.lastConfirmedPosition`<sup>[2]</sup> ### Issue If there is no durable cursor, the `markDeletedPosition` might be set to `{current_ledger, -1}`, and `async mark delete` will be prevented by the `rule-2` above. So he `backlog`, `readPosition`, and `markDeletedPosition` of the cursor will be in an incorrect position after trimming the ledger. You can reproduce it by the test `testTrimLedgerIfNoDurableCursor` ### Modifications Do not make `cursor.markDeletedPosition` larger than `ml.lastConfirmedPosition` when advancing non-durable cursors. (cherry picked from commit ca77982) (cherry picked from commit 6895919)

…ne faster (apache#21183) There is an issue similar to the apache#21155 fixed one. The client assumed the connection was inactive, but the Broker assumed the connection was fine. The Client tried to use a new connection to reconnect an exclusive consumer, then got an error `Exclusive consumer is already connected` - Check the connection of the old consumer is available when the new one tries to subscribe (cherry picked from commit 29db8f8) (cherry picked from commit b796f56)

(cherry picked from commit 3875864)

…ptions caused by wrong topicName (apache#21997) Similar to: apache#20131 The master branch has fixed the issue by apache#19841 Since it will makes users can not receive the messages which created in mistake, we did not cherry-pick apache#19841 into other branches, see detail apache#19841) It works like this: 1. createSubscription( `tp1` ) 2. is partitioned topic? `no`: return subscriptions `yes`: createSubscription(`tp1-partition-0`)....createSubscription(`tp1-partition-n`) --- ```java String partitionedTopic = "tp1-partition-0-DLQ"; TopicName partition0 = partitionedTopic.getPartition(0);// Highlight: the partition0.toString() will be "tp1-partition-0-DLQ"(it is wrong).The correct value is "tp1-partition-0-DLQ-partition-0" ``` Therefore, if there has a partitioned topic named `tp1-partition-0-DLQ`, the method `PersistentTopics.createSubscription` will works like this: 1. call Admin API ``PersistentTopics.createSubscription("tp1-partition-0-DLQ")` 2. is partitioned topic? 3. yes, call `TopicName.getPartition(0)` to get partition 0 and will get `tp1-partition-0-DLQ` , then loop to step-1. Then the infinite HTTP call `PersistentTopics.createSubscription` makes the broker crash. If hits the issue which makes the topic name wrong, do not loop to step 1. The PR apache#19841 fixes the issue which makes the topic name wrong, and this PR will create unfriendly compatibility, and PIP 263 apache#20033 will make compatibility good. (cherry picked from commit 4386401)

(cherry picked from commit fd3d8b9)

…to the admin (apache#22169) (cherry picked from commit 21660bd)

…og.error` (apache#22140) (apache#22213) (cherry picked from commit dc035f5)

…22147) (cherry picked from commit c36c18d) (cherry picked from commit 6f43f3e)

…ma method. (apache#22204) (cherry picked from commit 7130248)

…2231) (cherry picked from commit 58e28bb)

(cherry picked from commit 15a658e)

… reads (apache#22295) (cherry picked from commit 2803ba2) (cherry picked from commit fde7c49)

…o same topics (apache#22255) (cherry picked from commit c616b35) (cherry picked from commit 5ab0e93)

(cherry picked from commit 1c46877)

(cherry picked from commit 4a5400f) (cherry picked from commit 5ba3e57)

…he#21274) Co-authored-by: Baodi Shi <[email protected]> (cherry picked from commit 5d18ff7) (cherry picked from commit 000ee66)

…c loading wouldn't timeout (apache#22479) (cherry picked from commit 837f8bc) (cherry picked from commit 0fbcbb2)

(cherry picked from commit 7a4e16a) (cherry picked from commit 93664d7)

…ks (apache#22463) (cherry picked from commit f3d14a6) (cherry picked from commit 976399c)

…paths or disabling it (apache#22370) (cherry picked from commit 15ed659) (cherry picked from commit cdc5af1)

…d fix race conditions in metricsBufferResponse mode (apache#22494) (cherry picked from commit 7009071) (cherry picked from commit 5f9d7c5)

…pache#22034) (cherry picked from commit d0ca983) (cherry picked from commit 54042df)

…xception and testIncorrectClientClock (apache#22489) (cherry picked from commit d9a43dd) (cherry picked from commit c590198)

Co-authored-by: hoguni <[email protected]> (cherry picked from commit 20915d1) # Conflicts: # pom.xml (cherry picked from commit ef9b28f)

(cherry picked from commit bbff29d) (cherry picked from commit a0120d0)

apache#22475) (cherry picked from commit 767f0b2)

…umberOfEntriesInStorage" This reverts commit e3531e8.

…onstructor (apache#22264) (cherry picked from commit 16cf199)

Technoboy- and others added 30 commits April 12, 2024 11:15

[improve][build] Upgrade dependency to fix CVE. (apache#20264)

ab1dd7d

(cherry picked from commit e42faff)

Revert "[fix][broker] Fix NPE when reset Replicator's cursor by posit…

89367a3

…ion. (apache#20597)" This reverts commit aa565c2. (cherry picked from commit 35b449b)

Revert Add listener interface for namespace service apache#20406

c554f96

(cherry picked from commit f3bb89d)

[fix][io][branch-3.0] Not restart instance when kafka source poll exc…

9c35426

…eption. (apache#20818) (cherry picked from commit 9d580b4)

Revert "[fix][broker] Fix get topic policies as null during clean cac…

2a435c1

…he (apache#20763)" This reverts commit 111c14d. (cherry picked from commit ab9384b)

[fix] [broker] Producer is blocked on creation because backlog exceed…

385adae

…ed on topic, when dedup is enabled and no producer is there (apache#20951) (cherry picked from commit 30073db) (cherry picked from commit f68589e)

[fix][ci] Fix missing binary license (apache#21176)

53665ee

(cherry picked from commit 597ba82)

Merge branch '3.1_ds' into 3.1_ds_cherrypicks_from_3.0

8d1e249

[improve] [broker] improve read entry error log for troubleshooting (a…

7fe00d4

…pache#21169) (cherry picked from commit 65706c6) (cherry picked from commit faee123)

Revert "[fix][ci] Fix missing binary license (apache#21176)"

bdd5e75

This reverts commit 53665ee.

[fix] [build] Remove test testNoOrphanTopicIfInitFailed (apache#21569)

0799c8e

(cherry picked from commit 3875864)

[fix][broker] Fix String wrong format (apache#21829)

3e3efe6

(cherry picked from commit fd3d8b9)

[improve][admin][branch-3.0] Expose the offload threshold in seconds …

22fb6d6

…to the admin (apache#22169) (cherry picked from commit 21660bd)

[improve][fn][branch-3.0] Add missing "exception" argument to some `l…

f005d6f

…og.error` (apache#22140) (apache#22213) (cherry picked from commit dc035f5)

[improve][broker] Change log level to reduce duplicated logs (apache#…

e350c0d

…22147) (cherry picked from commit c36c18d) (cherry picked from commit 6f43f3e)

[fix][client] GenericProtobufNativeSchema not implement getNativeSche…

166f986

…ma method. (apache#22204) (cherry picked from commit 7130248)

Check the validity of config before start websocket service (apache#2…

3718f35

…2231) (cherry picked from commit 58e28bb)

Fix the tests with same namespace name (apache#22240)

d019f43

(cherry picked from commit 15a658e)

[improve][broker] Add missing configuration keys for caching catch-up…

ebba0b5

… reads (apache#22295) (cherry picked from commit 2803ba2) (cherry picked from commit fde7c49)

[fix] [client] Unclear error message when creating a consumer with tw…

58e26ce

…o same topics (apache#22255) (cherry picked from commit c616b35) (cherry picked from commit 5ab0e93)

BewareMyPower and others added 13 commits April 16, 2024 10:51

Remove unused fields msgSize

1065792

(cherry picked from commit 1c46877)

[improve][misc] Upgrade to Bookkeeper 4.16.5 (apache#22484)

29805a8

(cherry picked from commit 4a5400f) (cherry picked from commit 5ba3e57)

[fix][txn]Handle exceptions in the transaction pending ack init (apac…

7005924

…he#21274) Co-authored-by: Baodi Shi <[email protected]> (cherry picked from commit 5d18ff7) (cherry picked from commit 000ee66)

[fix] [broker] Prevent long deduplication cursor backlog so that topi…

fa262d3

…c loading wouldn't timeout (apache#22479) (cherry picked from commit 837f8bc) (cherry picked from commit 0fbcbb2)

[improve] [broker] Servlet support response compression (apache#21667)

752e83c

(cherry picked from commit 7a4e16a) (cherry picked from commit 93664d7)

[improve][test] Replace usage of curl in Java test and fix stream lea…

085af8d

…ks (apache#22463) (cherry picked from commit f3d14a6) (cherry picked from commit 976399c)

[improve][broker] Improve Gzip compression, allow excluding specific …

68b8d05

…paths or disabling it (apache#22370) (cherry picked from commit 15ed659) (cherry picked from commit cdc5af1)

[fix][broker] Optimize /metrics, fix unbounded request queue issue an…

c7d1521

…d fix race conditions in metricsBufferResponse mode (apache#22494) (cherry picked from commit 7009071) (cherry picked from commit 5f9d7c5)

[fix][broker] Create new ledger after the current ledger is closed (a…

eed3d17

…pache#22034) (cherry picked from commit d0ca983) (cherry picked from commit 54042df)

[fix][test] Flaky-test: testMessageExpiryWithTimestampNonRecoverableE…

4731785

…xception and testIncorrectClientClock (apache#22489) (cherry picked from commit d9a43dd) (cherry picked from commit c590198)

[fix][sec] Upgrade Bouncycastle to 1.78 (apache#22509)

6963e6b

Co-authored-by: hoguni <[email protected]> (cherry picked from commit 20915d1) # Conflicts: # pom.xml (cherry picked from commit ef9b28f)

[fix][io] Kafka Source connector maybe stuck (apache#22511)

8c5ad45

(cherry picked from commit bbff29d) (cherry picked from commit a0120d0)

[improve][broker] backlog quota exceed limit log replaced with debug (

d7812b9

apache#22475) (cherry picked from commit 767f0b2)

mukesh-ctds closed this Apr 16, 2024

mukesh-ctds added 2 commits April 16, 2024 19:36

Revert "[fix][test][branch-3.0] Fix broken ManagedLedgerTest.testGetN…

877b527

…umberOfEntriesInStorage" This reverts commit e3531e8.

fix PrometheusMetricsTest.testEscapeLabelValue

6070b4d

mukesh-ctds reopened this Apr 17, 2024

mukesh-ctds force-pushed the 3.1_ds_cherrypicks_from_3.0-fix branch 8 times, most recently from 2e65ead to 6070b4d Compare April 19, 2024 11:08

coderzc and others added 2 commits April 19, 2024 18:10

[improve][broker] Add createTopicIfDoesNotExist option to RawReader c…

72d6710

…onstructor (apache#22264) (cherry picked from commit 16cf199)

fix

dde20bf

mukesh-ctds force-pushed the 3.1_ds_cherrypicks_from_3.0-fix branch from c0b175c to dde20bf Compare April 19, 2024 13:40

mukesh-ctds closed this Apr 19, 2024

mukesh-ctds deleted the 3.1_ds_cherrypicks_from_3.0-fix branch April 23, 2024 10:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[improve][broker] Do-not-merge #251

[improve][broker] Do-not-merge #251

mukesh-ctds commented Apr 16, 2024

[improve][broker] Do-not-merge #251

[improve][broker] Do-not-merge #251

Conversation

mukesh-ctds commented Apr 16, 2024

Motivation

Modifications

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation