Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Dedicated Compaction for write-only table cann't clean up small files after running for a long time #2675

Open
1 of 2 tasks
bridgeDream opened this issue Jan 11, 2024 · 5 comments
Labels
bug Something isn't working

Comments

@bridgeDream
Copy link
Contributor

bridgeDream commented Jan 11, 2024

Search before asking

  • I searched in the issues and found nothing similar.

Paimon version

0.6

Compute Engine

flink 1.16

Minimal reproduce step

  1. start a flink job for writing to paimon table with mode "write-only" with checkpoint as 5s
  2. start a Dedicated Compaction flink to to compact paimon table
  3. after running over 1 days
  4. I found some small files yesterday still exist

What doesn't meet your expectations?

When writing and compact job runs for more than 2 days, I found small files with timestamp as "2023-12-28 19:43:36" still exist in 2023-12-29。
image

Anything else?

No response

Are you willing to submit a PR?

  • I'm willing to submit a PR!
@bridgeDream bridgeDream added the bug Something isn't working label Jan 11, 2024
@wg1026688210
Copy link
Contributor

Hi~ @bridgeDream did you set the snapshot expiration config.

@bridgeDream
Copy link
Contributor Author

bridgeDream commented Jan 11, 2024

Hi~ @bridgeDream did you set the snapshot expiration config.
@wg1026688210

no, just using default config;
image

@AnemoneIndicum
Copy link

I also have same problems as well.

@wg1026688210
Copy link
Contributor

You can try cleaning up the orphan file and confirming whether this file can be cleaned up. If it is not cleaned up, it is possible that the snapshot corresponding to this file has not expired.

@dierbei
Copy link

dierbei commented Mar 21, 2024

@bridgeDream @AnemoneIndicum I had the same problem, did you guys solve it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants