You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am deploying a Robinhood instance to scan the filesystem and purge unused "old files." We define old files as those that have not been read or written in the last 90 days.
Since BeeGFS is not Lustre (thank you, Captain Obvious 👑), Robinhood needs to scan the entire filesystem to update its information instead of parsing filesystem logs.
Enabling last file access time is inefficient because the file system needs to update the time stamp by writing data to the disk even in cases when the user only reads file contents or when the file contents have already been cached in memory, and no disk access would have been necessary at all.
Since relatime is enabled, last_access timestamps may not be updated accurately when files are read, leading to potential unintended deletions of recently accessed files.
Question
How can I properly handle this situation to ensure only truly unused files are deleted, without accidentally removing files that have been read but not modified?
The text was updated successfully, but these errors were encountered:
According to Redhat 6 (probably valid for higher versions):
relatime maintains atime data, but not for each time that a file is accessed. With this option enabled, atime data is written to the disk only if the file has been modified since the atime data was last updated (mtime), or if the file was last accessed more than a certain amount of time ago (by default, one day).
Description
I am deploying a Robinhood instance to scan the filesystem and purge unused "old files." We define old files as those that have not been read or written in the last 90 days.
Since BeeGFS is not Lustre (thank you, Captain Obvious 👑), Robinhood needs to scan the entire filesystem to update its information instead of parsing filesystem logs.
Here are my current rules:
However, BeeGFS is mounted with the
relatime
flag, as shown below:Problem
According to the [BeeGFS documentation](https://doc.beegfs.io/latest/advanced_topics/storage_tuning.html#mount-options):
Since
relatime
is enabled,last_access
timestamps may not be updated accurately when files are read, leading to potential unintended deletions of recently accessed files.Question
How can I properly handle this situation to ensure only truly unused files are deleted, without accidentally removing files that have been read but not modified?
The text was updated successfully, but these errors were encountered: