Survey for datasets.
- Gebru, Timnit, et al. "Datasheets for datasets." arXiv preprint arXiv:1803.09010 (2018).
- Holland, Sarah, et al. "The dataset nutrition label: A framework to drive higher data quality standards." arXiv preprint arXiv:1805.03677 (2018).
- Bender, Emily M., and Batya Friedman. "Data statements for natural language processing: Toward mitigating system bias and enabling better science." Transactions of the Association for Computational Linguistics 6 (2018): 587-604.
- Hutchinson, Ben, et al. "Towards accountability for machine learning datasets: Practices from software engineering and infrastructure." Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. 2021.
- Changpinyo, Soravit, et al. "Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR2021).
- Haurum, Joakim Bruslund, and Thomas B. Moeslund. "Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR2021).
- Van Horn, Grant, et al. "Benchmarking Representation Learning for Natural World Image Collections." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR2021).
- Hu, Yuan-Ting, et al. "SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction from Video Data." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR2021).
- Liang, Jie, et al. "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR2021).