Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue 1668 - Check files have the correct jobId assigned to them in compaction job state store update #1684

Merged

Conversation

kr565370
Copy link
Collaborator

@kr565370 kr565370 commented Jan 15, 2024

Rerunning functional system tests

Make sure you have checked all steps below.

Issue

Tests

  • My PR adds the following tests OR does not need testing for this extremely good reason:
    • Updating existing tests
    • Added new test for jobId not assigned to files in each FileReferenceStore implementation

Documentation

  • In case of new functionality, my PR adds documentation that describes how to use it, or I have linked to a
    separate issue for that below.
  • If I have added, removed, or updated any external dependencies used in the project, I have updated the
    NOTICES file to reflect this.

@kr565370 kr565370 changed the title Issue 1668 - Check job files have the correct jobId assigned to them in compaction job state store updates Issue 1668 - Check job files have the correct jobId assigned to them in compaction job state store update Jan 15, 2024
@kr565370 kr565370 changed the title Issue 1668 - Check job files have the correct jobId assigned to them in compaction job state store update Issue 1668 - Check files have the correct jobId assigned to them in compaction job state store update Jan 15, 2024
@kr565370 kr565370 added the pr-stacked-top A stacked pull request that we don't want to merge into its target until the target PR is merged label Jan 15, 2024
Base automatically changed from 1666-rename-fileinfo-to-filereference to develop January 17, 2024 09:50
@patchwork01 patchwork01 removed the pr-stacked-top A stacked pull request that we don't want to merge into its target until the target PR is merged label Jan 17, 2024
@patchwork01 patchwork01 removed their assignment Jan 17, 2024
…statestore-update

# Conflicts:
#	java/compaction/compaction-job-execution/src/main/java/sleeper/compaction/jobexecution/CompactSortedFiles.java
#	java/core/src/test/java/sleeper/core/statestore/inmemory/InMemoryFileReferenceStoreTest.java
#	java/statestore/src/test/java/sleeper/statestore/dynamodb/DynamoDBFileReferenceStoreIT.java
@patchwork01 patchwork01 assigned gaffer01 and unassigned kr565370 Jan 17, 2024
@@ -147,7 +148,8 @@ private RecordsProcessed compactNoSplitting() throws IOException, IteratorExcept
// Create writer
LOGGER.debug("Creating writer for file {}", compactionJob.getOutputFile());
Path outputPath = new Path(compactionJob.getOutputFile());
ParquetWriter<Record> writer = ParquetRecordWriterFactory.createParquetRecordWriter(outputPath, tableProperties, conf);
ParquetWriter<Record> writer = ParquetRecordWriterFactory.createParquetRecordWriter(
outputPath, tableProperties, conf, ParquetFileWriter.Mode.OVERWRITE);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add a comment here saying why we've chosen to use OVERWRITE mode?

@gaffer01 gaffer01 removed their assignment Jan 17, 2024
@patchwork01 patchwork01 added the pr-base-for-stacking Base for stacked pull requests (a dependency for others, where this PR's branch will be the base) label Jan 17, 2024
@kr565370 kr565370 merged commit 5561f87 into develop Jan 17, 2024
15 checks passed
@kr565370 kr565370 deleted the 1668-check-job-files-have-job-ids-before-statestore-update branch January 17, 2024 16:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pr-base-for-stacking Base for stacked pull requests (a dependency for others, where this PR's branch will be the base)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Compaction jobs can fail to process if one has been picked up by a task before it has been created
3 participants