Detection of Dual Active Galactic Nuclei using Self-Supervised Active Learning Guided Detection System
(SSALD)
Official Repository for Project460.
The main purpose of this project
is to explore the use of self-supervised and active learning based algorithms to classify Dual-Active Galactic Nuclei from using a minuscle dataset.
The number of labeled samples are less than 100 for this project. The project has access to a extemely large unlabelled dataset SDSS(208 Million).
The number of samples in SDSS Dataset is extremely small. Labelling new samples is extremely difficult in our case due to the fact that DAGNs are rare in nature. Hence we use the following algorithm to train our model. The below pipeline is a mixture Active Learning and Self-Supervised Learning.
flowchart TD
A0[Entire SDSS Dataset]-- Self-Supervision ---B0[Feature-Extractor Backbone];
B0 -->C0[Data Pool: Sampled + Original];
C0-- Fine tune the Backbone --->D0[KNN-clustering Algorithm]
D0-- Find best </br> cluster size ---->D0
D0 -->E0[Sampler]
E0-- Oracle --->E0
E0-- Sample=old_samples + new_samples -->C0
- We thank Google TPU Research Cloud (TRC) program for supporting us with the computing resources(TPU)
- We thank Shubhankar Mishra Lab for supporting us with the GPU resources