We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running
python ./downloader.py -data_root ./tmp -number_of_classes 1000 -images_per_class 500
This in classes with 507 images, 505 images, 507 images... etc. What's the reason for this slight more than 500 images?
Does this have something to do with the concurrent downloading, and the fact that some URLs no longer work?
The text was updated successfully, but these errors were encountered:
Also I read https://mf1024.github.io/2019/06/09/how-to-scrape-the-imagenet/ which is great research on this issue. Imagenet is not really a reliable source of data.
However nowhere does it indicate why there slightly more images per class than requested.
Sorry, something went wrong.
Does anybody know ? thanks.
@huntkao @CMCDragonkai It's the multiprocessing workers. It's fucking with the code. Set it to 1 and it'll download only as many as you specify
No branches or pull requests
Running
This in classes with 507 images, 505 images, 507 images... etc. What's the reason for this slight more than 500 images?
Does this have something to do with the concurrent downloading, and the fact that some URLs no longer work?
The text was updated successfully, but these errors were encountered: