- Competition link: Zero to GANs - Human Protein Classification
- My kaggle kernel (might differ): Transfer learning using EfficientNet models
- Latest notebook: Transfer learning using EfficientNet models
In this kaggle in-class competition, we will develop models capable of classifying mixed patterns of proteins in microscope images. Images visualizing proteins in cells are commonly used for biomedical research, and these cells could hold the key for the next breakthrough in medicine. However, thanks to advances in high-throughput microscopy, these images are generated at a far greater pace than what can be manually evaluated. Therefore, the need is greater than ever for automating biomedical image analysis to accelerate the understanding of human cells and disease.
This is a multilabel image classification problem, where each image can belong to several classes. The class labels are as follows:
Mitochondria
,Nuclear bodies
',Nucleoli
,Golgi apparatus
,Nucleoplasm
,Nucleoli fibrillar center
,Cytosol
,Plasma membrane
,Centrosome
,Nuclear speckles
- This competition was solved using the
PyTorch
deep learning framework. - During the course of the competition, we performed various steps such as:
- Data preparation
- Building custom Dataset class
- Performing data augmentations
- Creating our own train and test dataloaders
- A wrapper class for dataloaders to move data and model to the GPU memory
- Building model Base class and our Protein Model class
- Some helper methods to bind together all our functions together and start the training proces
- Generating the prediction process.