Adding --cheap option #946

tsdev · 2024-07-18T08:02:52Z

Tell us more about this new feature.

I would like to read a lot of small files at high speed from S3. I previously used goofys and it has an option --cheap that dramatically sped up loading small files, see kahing/goofys@941ac6a by avoiding parallel checking if the URI is a file or prefix. In my case I use this for pyTorch dataloading where all my requests are valid files, so this parallel check is unnecessary.

Could we have a similar option for mountpoint-s3?

The text was updated successfully, but these errors were encountered:

monthonk · 2024-07-26T15:38:30Z

Thanks for the feature request. This might not fit in with Mountpoint semantics today because we decided that files should be shadowed by directories with the same name (see detailed semantics). We are considering what can be done in this area, but we do not have anything to share yet.

tsdev added the enhancement New feature or request label Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding --cheap option #946

Adding --cheap option #946

tsdev commented Jul 18, 2024 •

edited

Loading

monthonk commented Jul 26, 2024

Adding --cheap option #946

Adding --cheap option #946

Comments

tsdev commented Jul 18, 2024 • edited Loading

Tell us more about this new feature.

monthonk commented Jul 26, 2024

tsdev commented Jul 18, 2024 •

edited

Loading