New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Auto offset commits (the right way) #782

Merged

zcox merged 23 commits into Banno:main from zcox:commit-less

Jan 2, 2024

Collaborator

zcox commented Sep 8, 2023 •

edited

Loading

The problem with using the underlying Java Kafka consumer's auto offset commits, is that if processing the record fails (i.e. a runtime exception is thrown), the Resource we wrap it in closes the consumer, and that consumer will commit the offsets of the record(s) that were not processed. This leads to at-most-once processing, which is the worst.

kafka4s already provides a readProcessCommit operation, but that commits offsets after every single successfully processed record, which can put a lot of load on Kafka brokers, and is still at-least-once processing (at least for that last record that failed) since the consumer does not commit that offset on close.

This PR introduces another operation, that is like readProcessCommit, but will only commit offsets after either some number of records is processed, or some amount of time has passed since the last offset commit. We get less offset commit load on Kafka, and still get at-least-once processing (all your consumers are idempotent, right?).

This new operation will also commit offsets of successfully processed records on a failure. This is still at-least-once, but minimizes the reprocessing required after restart.

A future PR should add a batch version of this operation.

zcox added 2 commits

September 8, 2023 15:58


          like readProcessCommit but commits less frequently

0f04775

FFS

ca3166c

amohrland reviewed

View reviewed changes

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala Show resolved Hide resolved

amohrland reviewed

View reviewed changes

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala Outdated Show resolved Hide resolved

amohrland reviewed

View reviewed changes

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala Outdated Show resolved Hide resolved

amohrland approved these changes

View reviewed changes

zcox added 3 commits

September 10, 2023 10:34


          relax

c9571cd


          rename

b17039f


          commit successful offsets on error

9309a2d

Collaborator Author

zcox commented Sep 10, 2023

Added an idea in 9309a2d to help minimize reprocessing after a failure. Could also add a boolean argument to enable/disable that behavior.

zcox added 8 commits

September 10, 2023 13:07


          docs

253e420


          WIP tests

1a15870


          Merge branch 'master' into commit-less

0bac095


          refactor tests

95cff28


          new test

ef2576b


          move tests

82c88e0


          cleanup

119b896


          test commit after record count

06f6def

zcox marked this pull request as ready for review

December 30, 2023 02:15

zcox requested a review from a team as a code owner

December 30, 2023 02:15

zcox added 4 commits

December 30, 2023 10:36


          test commit on fail

c494858


          use better name and type

6d2bc98


          test commit after elapsed time

dea9175


          cleanup

4c12953

OddKristensen reviewed

View reviewed changes

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala Outdated Show resolved Hide resolved

OddKristensen reviewed

View reviewed changes

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala Outdated Show resolved Hide resolved

OddKristensen reviewed

View reviewed changes

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala Outdated Show resolved Hide resolved

OddKristensen approved these changes

View reviewed changes

OddKristensen reviewed

View reviewed changes

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala

+                  */
+                def processingAndCommitting[A](
+                    pollTimeout: FiniteDuration,
+                    maxRecordCount: Long = 1000L,

Collaborator

OddKristensen Jan 2, 2024

If someone passes in a negative number, or a negative time would anything unexpected happen? I am not sure how consumer.recordStream(-2.seconds) would deal with it, but in the other two cases, it would mean that we would always be committing for every record. The question then becomes whether or not this is desirable and if we think that is obvious enough at the call site?

Collaborator Author

zcox Jan 2, 2024

Good point. This probably needs more validation of inputs.

rossabaker reviewed

View reviewed changes

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala Outdated Show resolved Hide resolved

core/src/main/scala/com/banno/kafka/consumer/ConsumerOps.scala

+                  * offsets for failed records when the consumer is closed. The consumer must
+                  * be configured to disable offset auto-commits.
+                  */
+                def processingAndCommitting[A](

Contributor

rossabaker Jan 2, 2024

Does Stream.groupWithin help here? It emits chunks on the maxRecordCount-and-maxElapsedTime cadence.

Collaborator Author

zcox Jan 2, 2024

TIL Stream.groupWithin. Reading its docs, IIUC it seems that it will buffer up elements of the input stream, until it obtains a certain number or timeout exceeded. For a Kafka consumer, I don't think we'd want that behavior, since it would introduce delay.

We want the elements to flow from the input stream as they're read from Kafka, and take an action (i.e. commit offsets) after a number of records or a timeout. Stream.groupWithin is close to that, but if it delays records going to the process function we wouldn't want that.

Contributor

rossabaker Jan 2, 2024

Not if it came after L390, but I would need to think about the onError on line 385. Don't let this block it, but I may play type tetris later.

zcox added 6 commits

January 2, 2024 12:17


          refactor

4ea9f79


          reuse constructor

6c33257


          dedup

3c211a5


          Merge branch 'master' into commit-less

7dd99d9


          fix test

e73dcd3


          use monotonic since they're being used to calculate a duration

a1d907b

zcox merged commit 37b46b2 into Banno:main

5 checks passed

zcox deleted the commit-less branch

January 2, 2024 21:24

amohrland pushed a commit that referenced this pull request


          Merge pull request #782 from zcox/commit-less

0af72cd

Auto offset commits (the right way)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet