The Global Max Pooling Layer (GAP) Layer.
In a way it is an agressive way to do dimentionality reduction. It converts wxhwd tensor to 1x1x#filters.
It was introduced in the paper Network-in-Network paper. It was used to eliminate the FC layer number of parameters, which inturn avoid overfitting and redices the number of paramters. Also it was used to learn complex funtion due to averaging. A good explaination is found here: Coursera, blog, diff types of pooling, decent intro

Provide feedback

Saved searches

Use saved searches to filter your results more quickly