FixMatch & SSL Exploration

Final project of the KTH DD2412 - Deep Learning, Advanced course.

The scope of this project is to replicate FixMatch, a method developed by Google researchers in order to perform semi-supervised learning.

The tasks we performed are the followings:

Replication of the method using PyTorch and training on CIFAR-10 and SVHN datasets.
Unbalance the supervised classes number of samples when training.
Varying the labeled-unlabeled data ratio.
Employing the merge of different datasets of the same nature as unlabeled data, SVHN and MNIST in this case.

The main outcomes obtained in each task can be seen below. In order to get a deeper insight of our findings, go to FixMatch & SLL Exploration.


Accuracy CIFAR-10 40 labels.	Accuracy CIFAR-10 250 labels.


Accuracy CIFAR-10 4000 labels.	Accuracy SVHN 250 labels.

Dataset	Scaling Factor	Test Accuracy Class 3	Test Accuracy Class 5
SVHN	1	92.44%	95.18%
SVHN	0.5	91.29%	96.35%
SVHN	1.5	95.62%	97.06%


Confusion matrix SVHN class 3 not scaled.	Confusion matrix SVHN class 3 downscaled 50%.	Confusion matrix SVHN class 3 upscaled 50%

Dataset	Percentage of Total Unlabeled Data	Test Accuracy
CIFAR-10	25%	49.00%
CIFAR-10	50%	51.23%
CIFAR-10	75%	57.93%
CIFAR-10	100%	64.44%


Accuracy CIFAR-10 25% unlabeled data.	Accuracy CIFAR-10 50% unlabeled data.


Accuracy CIFAR-10 75% unlabeled data.	Accuracy CIFAR-10 100% unlabeled data.

Labeled Data Dataset	Unlabeled Data Datasets	Test Dataset	Test Accuracy
SVHN	SVHN	SVHN	95.29%
SVHN	SVHN	MNIST	73.01%
SVHN	SVHN and MNIST	SVHN	96.51%
SVHN	SVHN and MNIST	MNIST	86.04%


Confusion matrix SVHN merged unlabeled data.	Confusion matrix MNIST merged unlabeled data.

Provide feedback