GitHub - WangHewei16/CassavaNet-An-EfficientNet-Vision-Transforer-for-Real-Time-Leaf-Disease-Identification: [Kaggle2020 Competition Top1.59% Silver Medal🥈] Develop an EfficientNet-based DL network consisting MBConv modules and a ViT backbone to classify the disease types of cassava leaf.

CassavaNet: A Purificatory EfficientNet-Vision Transformer Network for Real-Time Leaf Disease Identification

1. Background of the problem to be solved

Cassava, Africa's second-largest provider of carbohydrates, is a critical food security crop grown by smallholder farmers due to its ability to withstand harsh conditions. This starchy root is grown on at least 80% of Sub-Saharan African household farms, but viral diseases are a major source of low yields. It may be possible to identify common diseases and treat them using data science.

Existing disease detection methods necessitate farmers enlisting the assistance of government-funded agricultural experts to visually inspect and diagnose the plants. This suffers from being labor-intensive, scarce, and expensive. As an added challenge, effective solutions for farmers must perform well under important constraints, as African farmers might only have access to low-bandwidth mobile-quality cameras.

The dataset for this competition consists of 21,367 labeled images collected during a regular survey in Uganda. The majority of the images were crowdsourced from farmers who took photos of their gardens and annotated by experts at the National Crops Resources Research Institute (NaCRRI) in collaboration with Makerere University's AI lab in Kampala. This is in a format that most closely resembles what farmers would need to diagnose in the field. Our task is to categorize each cassava image into one of four disease categories or one healthy leaf category. To assist farmers in quickly identifying diseased plants, potentially saving their crops before irreversible damage occurs.

2. Pipeline

This problem is a single-label image classification problem with large differences in the amount of data from various categories and high data noise. Designed pipeline is shown in the figure below, Use resize, crop, flip, normalize and other preprocessing methods, and then input into two backbones: Vit and EfficientNet. Adapt nn.CrossEntropyLoss() as loss function, using the LabelSmoothing anti-noise technique, and choosing a different learning rate strategy for these two backbones. Lastly, do a simple ensemble such as tst_preds = 0.452*tst_preds_vit + 0.548*test_preds_eff.

3. BackBone

3.1 EfficientNet

The figure below shows the architecture of EfficientNet. [Paper Link]

3.2 Vision Transformer (Vit)

The figure below shows the architecture of EfficientNet. Converting images to sequence into Transformer. [Paper Link]

4. Learning rate strategy

Use Cosine Annealing strategy for EfficientNet backbone and adapt ReduceLROnPlateau strategy for Vit backbone.

5. K-Fold cross validation skill

Implement K-Fold Cross Validation for each model to improve respective and ensemble effect.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
FMix		FMix
VisionTransformer-Pytorch		VisionTransformer-Pytorch
convert		convert
docs		docs
images		images
loss		loss
optim		optim
results		results
tests		tests
types		types
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Pytorch Efficientnet Baseline Light.ipynb		Pytorch Efficientnet Baseline Light.ipynb
Pytorch ViT Baseline.ipynb		Pytorch ViT Baseline.ipynb
README.md		README.md
avg_checkpoints.py		avg_checkpoints.py
clean_checkpoint.py		clean_checkpoint.py
distributed_train.sh		distributed_train.sh
hubconf.py		hubconf.py
inference.py		inference.py
mkdocs.yml		mkdocs.yml
requirements-docs.txt		requirements-docs.txt
requirements-sotabench.txt		requirements-sotabench.txt
requirements.txt		requirements.txt
setup.py		setup.py
sotabench.py		sotabench.py
sotabench_setup.sh		sotabench_setup.sh
train.py		train.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CassavaNet: A Purificatory EfficientNet-Vision Transformer Network for Real-Time Leaf Disease Identification

1. Background of the problem to be solved

2. Pipeline

3. BackBone

3.1 EfficientNet

3.2 Vision Transformer (Vit)

4. Learning rate strategy

5. K-Fold cross validation skill

About

Releases

Packages

Languages

License

WangHewei16/CassavaNet-An-EfficientNet-Vision-Transforer-for-Real-Time-Leaf-Disease-Identification

Folders and files

Latest commit

History

Repository files navigation

CassavaNet: A Purificatory EfficientNet-Vision Transformer Network for Real-Time Leaf Disease Identification

1. Background of the problem to be solved

2. Pipeline

3. BackBone

3.1 EfficientNet

3.2 Vision Transformer (Vit)

4. Learning rate strategy

5. K-Fold cross validation skill

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages