Understanding Dimensional Collapse in Contrastive Self-supervised Learning

This repo contains the code used in paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning.

@article{Jing2021UnderstandingDC,
  title={Understanding Dimensional Collapse in Contrastive Self-supervised Learning},
  author={Li Jing and Pascal Vincent and Yann LeCun and Yuandong Tian},
  journal={arXiv preprint arXiv:2110.09348},
  year={2021}
}

Part 1: Visualize SimCLR's embedding Spectrum

We viaulize the embedding space spectrum of a pretrained SimCLR model.

The spectrum is generated by spectrum.py.

How to use:

python spectrum.py --data <path-to-imagenet-data> --checkpoint <path-to-checkpoint> --projector

Part 2: Toy Tasks

We show that 2 reasons (strong augmentaiton and implicit regularization) cause dimensional collapse in contrastive learning via toy tasks. Please see toy_tasks.

Part 3: DirectCLR

DirectCLR is a simple contrastive learning model for visual representation learning. It does not require a trainable projector as SimCLR. It is able to prevent dimensional collapse and outperform SimCLR with a linear projector.

For training / evaluation detail, please see diretclr.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
directclr		directclr
figures		figures
toy_task		toy_task
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
spectrum.py		spectrum.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Part 1: Visualize SimCLR's embedding Spectrum

Part 2: Toy Tasks

Part 3: DirectCLR

License

About

Releases

Packages

Contributors 2

Languages

License

facebookresearch/directclr

Folders and files

Latest commit

History

Repository files navigation

Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Part 1: Visualize SimCLR's embedding Spectrum

Part 2: Toy Tasks

Part 3: DirectCLR

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages