Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Logic to batch up compaction commits #3904

Open
patchwork01 opened this issue Dec 11, 2024 · 0 comments
Open

Logic to batch up compaction commits #3904

patchwork01 opened this issue Dec 11, 2024 · 0 comments

Comments

@patchwork01
Copy link
Collaborator

patchwork01 commented Dec 11, 2024

Background

Split from:

Description

We'd like to write the logic to sit in a lambda that will take a batch of compaction commits, organise them by Sleeper table, write them to S3 in one file for each Sleeper table, then send those files to the state store committer queue.

Analysis

The lambda will take a batch of compaction commit SQS messages and pass it to this new class.

Because we split up the batch into Sleeper tables and send each table's batch independently, that means that each Sleeper table can fail independently. We'll need to report which messages did and did not fail, with some way for the lambda to know which SQS messages it refers to (e.g. a message handle like in ingest and compaction tasks).

We can assume the serialisation/deserialisation of the requests and batches will be covered in the following issue:

We will follow up with the Lambda, SQS and S3 code in a separate issue:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant