Consumer behavior during partition broker reassignment #3758

Nevon · 2022-03-08T14:38:56Z

Nevon
Mar 8, 2022

Hey there! I read in the FAQ that the high level behavior of partition fetching in librdkafka is that there's a thread per broker, and that thread will issue fetch requests for a given set of partitions based on a few criteria (having an offset to fetch from and the fetch queue being within min and max bounds).

We are currently in the process of redesigning the consumer concurrency model in KafkaJS, which now uses a fairly similar model. We don't use multithreading, but the high level behavior is similar due to the async non-blocking nature of NodeJS.

One question that came up is what happens when a partition is reassigned from one broker to another while this is going on. To give an example, imagine that we have 3 brokers, and a topic with 3 partitions.

Broker 1 (b1) is the leader for partitions 1 (p1), b2 is the leader for p2, and b3 for p3.
B1 is a replica for p2, b2 is a replica for p3 and b3 is a replica for p1.
We issue a fetch request to b1 and get back messages for p2.
A Kafka admin goes and changes the partition assignments so that b3 is now a replica for both p1 and p2.
We issue a fetch request towards b3 and get back messages for p1 and p2.

Assuming that we had not committed offsets for p2 yet at the time we got to point 5, and that the partitions were small enough that b3 got in-sync before this, my expectation would be that we have now fetched messages from p2 in the two different threads and that the fetch queue now potentially contains data for the same offsets multiple times.

Our current solution to this is to to have some synchronization between the different fetchers to filter out batches for partitions that another fetcher is currently fetching for or for which there are still batches in the queue. While this is a rather rare case, I'm not super happy with our solution, as it causes us to still fetch data that we then just discard. So I was curious to find out how librdkafka handles this case, and see if maybe you have a better solution in place. Or maybe you know something that makes this a case that we don't need to worry about. I couldn't really find any KIP discussing this case, but it's possible I missed one.

edenhill · 2022-03-29T13:46:28Z

edenhill
Mar 29, 2022

Hey Tommy,
good question (and sorry for the slow response).

The way we deal with partitions migrating during the lifetime of an outstanding FetchRequest is a per-partition fetch-state-version which is a running counter on each partition that gets increased by one each time there's a change to the fetcher - be it a seek, reset or leader change.

When we send a FetchRequest we store this per-partition version along with the request object, and when the FetchResponse comes back we verify, for each partition in the FetchResponse, that the per-partition version stored on the request object matches the current partition version: if not we simply throw away the response MessageSets for that partition.
This fairly simple approach has served us well.

https://github.com/edenhill/librdkafka/blob/master/src/rdkafka_broker.c#L4548-L4575

Or maybe you know something that makes this a case that we don't need to worry about.

It definitely happens in production, where all the rarities and corner cases can't wait to show their pretty faces.

Hope that helps!
Magnus

1 reply

Nevon Mar 29, 2022
Author

Thanks for the reply and the clear explanation! That indeed helps a lot. While the implementation uses a different mechanism, the overall behavior then should be quite similar, which tells me we're not totally out biking - as we would say in Swedish.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consumer behavior during partition broker reassignment #3758

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Consumer behavior during partition broker reassignment #3758

Nevon Mar 8, 2022

Replies: 1 comment · 1 reply

edenhill Mar 29, 2022

Nevon Mar 29, 2022 Author

Nevon
Mar 8, 2022

Replies: 1 comment 1 reply

edenhill
Mar 29, 2022

Nevon Mar 29, 2022
Author