New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

initial commit of the branch SSL_comparison #205

Merged

nathanpainchaud merged 3 commits into main from SSL_comparison

Dec 13, 2024

Contributor

JulietteMoreau commented Dec 6, 2024

For my SSL benchmark review of next Thursday!


          initial commit of the branch SSL_comparison

ca719bf

JulietteMoreau requested a review from Clarax99

December 6, 2024 16:22

Clarax99 approved these changes

View reviewed changes

Contributor

Clarax99 left a comment

The content is structured and clear, sometimes the chosen phrasing is not the best but understandable.
I let you some remarks for some minor grammar corrections and rephrasing suggestions, up to you to take them into account. Sorry for the late review!

collections/_posts/2024-12-02-SSL_comparison.md Outdated


		# Introduction

		In medical domain there is often a lack of annotated data for deep learning model training while there is a lot of unlabeled data in data health records. Semi-supervised and self-supervised learning are methods developed to take advantage of these unlabeled images to improve classification accuracy. The first one train classifiers jointly with two loss terms, while the second is a two-stages approach, the first to learn deep representation and the second to fine-tune the classifier. However, the two methods are rarely compared. The first question they try to answer is: Which recent semi- or self-supervised methods are likely to be most effective? \

Contributor

Clarax99 Dec 12, 2024

In the medical domain
"while there is a lot of unlabeled data in data health records" → Awkward phrasing. Suggested: "while a large amount of unlabeled data exists in health records."
The first one trains classifiers jointly
the second is a two-stage approach
General comment : maybe you could gain in lisibility with bullet points for Semi and SSL such as :
Semi : blablabla introductive content
SSL: introductive content with two-stage approach

Pretraining
Finetuning

Suggestion of rephrasing of chatGPT (2ème review pour le prix d'une hihi):
In the medical domain, annotated data for deep learning is often scarce, while a large amount of unlabeled data exists in health records. Semi-supervised and self-supervised learning methods have been developed to leverage this unlabeled data and improve classification accuracy. The former trains classifiers jointly with two loss terms, while the latter follows a two-stage approach: first, learning deep representations, and second, fine-tuning the classifier. However, these two methods are rarely compared.

collections/_posts/2024-12-02-SSL_comparison.md Outdated

+              # Introduction
+              In medical domain there is often a lack of annotated data for deep learning model training while there is a lot of unlabeled data in data health records. Semi-supervised and self-supervised learning are methods developed to take advantage of these unlabeled images to improve classification accuracy. The first one train classifiers jointly with two loss terms, while the second is a two-stages approach, the first to learn deep representation and the second to fine-tune the classifier. However, the two methods are rarely compared. The first question they try to answer is: **Which recent semi- or self-supervised methods are likely to be most effective?** \
+              But those methods are very sensitive to hyperparameters, and benchmarks often don't consider this (either no hyperparameters tuning or tuning with a huge labeled validation set, bigger than the training set, which is not realistic). In this paper, they try to take that in account by answering the question: **Given limited available labeled data and limited compute, is hyperparameter tuning worthwhile?**

Contributor

Clarax99 Dec 12, 2024

benchmarks often don't consider this : Not very clear imo, it could be more explicit. Maybe you could add another sentence instead of the note in parentheses

collections/_posts/2024-12-02-SSL_comparison.md Outdated


		$$ v^, w^ ← argmin_{v,w} \sum_{x,y \in L} \lambda^L l^L(y, g_w(f_v(x))) + \sum_{x \in U} \lambda^U l^U(x, f_v,g_w) $$

		where $$l^L$$ and $$l^U$$ are the loss functions linked to the labeled and unlabeled datasets and $$\lambda^L$$ and $$\lambda_{U}$$ the associated weights. In addition, $$f_v(\cdot)$$ denotes a neural network backbone with parameters $$v$$ that produces an embedding of the input images and $$g_w(\cdot)$$ denotes a final linear softmax classification layer with parameters $$w$$.

Contributor

Clarax99 Dec 12, 2024

$$\lambda_{U}$$ --> $$\lambda^{U}$$ for uniformity

collections/_posts/2024-12-02-SSL_comparison.md Outdated

+              |-----------------|---------------|---------------|
+              | Supervised      | 1             | 0             |
+              | Semi-supervised | >0            | >0            |
+              | Self-supervised | 0 then 1      | 1 then 0      |

Contributor

Clarax99 Dec 12, 2024

"Self-supervised | 0 then 1 | 1 then 0" → Could be clearer. Suggested: "Self-supervised | Pretraining (0) → Fine-tuning (1) | Pretraining (1) → Fine-tuning (0)."
I agree with Chatgpt on this point but i can understand it doesn't fit nicely, so up to you

collections/_posts/2024-12-02-SSL_comparison.md

+              * Sup: to denote a classifier trained with classical multiclass cross entropy loss.
+              * MixUp: also multiclass cross entropy, but with mixup data augmentation (creating new training samples by mixing two samples from the original dataset).
+              * SupCon: using a supervised contrastive learning loss (pull together samples belonging to the same class in the embedded space while pushing apart from samples of other classes).

Contributor

Clarax99 Dec 12, 2024

I am a bit confused by this subsection, I imagine this is both the methods and their notation for the compared supervised methods, maybe it is lacking an introductive sentence (which you did in the next subsections)

collections/_posts/2024-12-02-SSL_comparison.md Outdated

+              * SimCLR: contrastive learning self-supervised method based on data augmentation with a learning nonlinear transformation between the representation and contrastive loss to improve the learned representations.
+              * MOCO v2: for momentum contrastive, which uses a dynamic dictionary with a queue and a moving-averaged encoder that facilitates contrastive unsupervised learning.
+              * SwAV: contrastive learning that does not compare features directly but comparing different augmentation of the same images.

Contributor

Clarax99 Dec 12, 2024

"...but comparing different augmentation of the same images." → Correct: "...but compares different augmentations of the same image."

collections/_posts/2024-12-02-SSL_comparison.md Outdated

+              For the low resolution images ResNet-18 is used, WideResNet-28-2 is used for TMED-2 and ResNet-18 or 50 are tested for AIROGS dataset. \
+              A maximum of 200 epochs is performed and training is stopped if the balanced accuracy plateaus for 20 consecutive epochs.
+              Hyperparameters (learning rate, weight decay and unlabeled loss weight, in addition to the parameters specific for each method) optimization is made for each method and dataset. To fit with real life constraints, they fixed a certain number of hours with a NVIDIA A100 GPU for each hyperparameter optimization (25h for PathMNIST, 50h for TissueMNIST and 100h for TMED-2 and AIROGS). Within this fixed budget, a random search of hyperparameters is done, tacking the best with validation set.

Contributor

Clarax99 Dec 12, 2024

taking the best using the validation set.

collections/_posts/2024-12-02-SSL_comparison.md Outdated


		Five separate trials were performed and mean balanced accuracy is calculated each 30 or 60 minutes on validation and test sets.

		Balanced accuracy on test set over time is represented for each dataset and method. As there is a rough monotonic improvement in test performance over time despite using a realistic-size validation set, they conclude that checkpoint selection and hyperparameters optimization can be effective with a realistically-sized validation set.

Contributor

Clarax99 Dec 12, 2024

"As there is a rough monotonic improvement in test performance..." → Suggested: " As a roughly monotonic improvement in test performance is observed..."

collections/_posts/2024-12-02-SSL_comparison.md Outdated


		![](/collections/images/SSL_comparison/accuracy_evolution.jpg)

		All the methods are compared and none of them clearly stands out from the other depending on the dataset. So they measure the balanced accuracy relative gain over the best supervised method for each dataset. MixMatch represent the best overall choice.

Contributor

Clarax99 Dec 12, 2024

MixMatch represents the best overall choice.

collections/_posts/2024-12-02-SSL_comparison.md Outdated


		# Discussion

		There is a real gain in performance of performing SSL, but it depends on the dataset (less gain on TMED-2). However, they insist on the importance of the selection of the hyperparameters and proper evaluation protocol depending on specific needs. Especially MixMatch requires cautious hyperparameters tuning when applied to a new dataset.

Contributor

Clarax99 Dec 12, 2024

"There is a real gain in performance of performing SSL" → Suggested: "SSL provides a tangible performance gain..."
"less gain on TMED-2" → Unclear. Suggested: "a smaller performance improvement is observed for TMED-2."
"MixMatch requires cautious hyperparameters tuning" → Suggested: "MixMatch requires careful hyperparameter tuning..."


          corrections after review

52fa724

nathanpainchaud requested changes

View reviewed changes

Member

nathanpainchaud left a comment

A few things that caught my eye during the review. Otherwise very clear review, good job 🙂

collections/_posts/2024-12-02-SSL_comparison.md Outdated Show resolved Hide resolved

collections/_posts/2024-12-02-SSL_comparison.md Outdated Show resolved Hide resolved

collections/_posts/2024-12-02-SSL_comparison.md Outdated Show resolved Hide resolved

collections/_posts/2024-12-02-SSL_comparison.md Outdated Show resolved Hide resolved


          corrections after Nathan's review

1a1780f

nathanpainchaud approved these changes

View reviewed changes

nathanpainchaud merged commit 2cf16d5 into main

2 checks passed

nathanpainchaud deleted the SSL_comparison branch

December 13, 2024 14:59

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet