Reproduce-texture-vs-shape

Paper: ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness

Introduction

We reproduced an oral ICLR paper, named ``ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness''. We implemented the most important part in the original paper and briefly discussed experiments on a subset of the ImageNet dataset (only 16 classes). We found that ResNet-50 trained on Stylized ImageNet is more accurate and robust than the same network trained only on ImageNet. We also verified that shape-based representations are more robust than the texture representations. All codes except the style transfer are written by the two authors.

Dataset

A subset of ImageNet (only 16 classes), referred as IN-16, and stylized IN-16, referred as SIN-16, are used as raw inputs for training. The figure below shows an example of stylized images.

In a stylized image, original texture is replace by another random texture, only the shape is preserved. Datasets can be downloaded from imagenet-16 and stylizedimagenet-16.

Code for style transfer using AdaIN can be found here (stylize-datasets).

Texture Bias

The author believes that ImageNet-trained CNNs are biased towards texture, since models trained on IN achieved a low accuracy on SIN. However, models trained on SIN tend to achieve similar or better performance on IN. The table below (taken from the original paper) shows detailed results.

Results

Accuracy

The table below shows the performance of a Res-50 network when trained on various datasets. The model trained on SIN+IN dataset and fine-tuned using IN achieved a significantly better performance (on both top-1 and top-5 accuracy) than other models.

Robustness

The figure below shows the performance of Res-50 networks when evaluated on distorted images. Models trained on SIN tend to be more robust than those trained on IN.

Conclusion

Stylizing a whole dataset can be seen as a data augmentation method, which expands the dataset and helps capture shape representations. Deep CNNs trained on stylized dataset tend to be more accurate and robust.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
report		report
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
paper-abstract-chinese.md		paper-abstract-chinese.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproduce-texture-vs-shape

Introduction

Dataset

Texture Bias

Results

Accuracy

Robustness

Conclusion

About

Releases

Packages

Contributors 2

Languages

License

renph/Reproduce-texture-vs-shape

Folders and files

Latest commit

History

Repository files navigation

Reproduce-texture-vs-shape

Introduction

Dataset

Texture Bias

Results

Accuracy

Robustness

Conclusion

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages