Deep Learning

Representation Learning

Generalization

Optimization

Adversarial Examples Are Not Bugs, They Are Features
Neural Tangent Kernel: Convergence and Generalization in Neural Networks (Jacot, 2018)
- My notes
(Li, 2018)
- My notes
Investigating Learning in Deep Neural Networks using Layer-Wise Weight Change (Agrawal, 2020)
- Deeper layers change faster than shallower layers
- Does this have any ramifications on transfer learning practice? (Freeze inital layers and retrain classifier)
- Greg Yang says here around 26:00 that later layers have larger gradient than earlier layers. Which would explain this.

Gradient Descent

Network Design

HyperNetworks

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
README.md		README.md

Provide feedback