GitHub - Hzzone/torch_clustering: A pure PyTorch implementation of kmeans and GMM with distributed clustering.

torch_clustering

This repo contains a pure PyTorch implementation of the following:

Kmeans with kmeans++ initialization;
Gaussian Mixture Model (GMM);
Support for euclidean and cosine distance;
Support for both cpu and gpu tensors, and distributed clustering!

In addition, we provide a Faiss wrapper that can be used with my code without any changes!

If you found this code helps your work, do not hesitate to cite my paper or star this repo!

Install

git clone --depth https://github.com/Hzzone/torch_clustering
cd torch_clustering && pip install -e .

Example

There are two files for examples:

distribute_kmeans_example.py demonstrates how to use distributed clustering;
example_and_benchmark.ipynb

Snippet:

from torch_clustering import PyTorchKMeans, FaissKMeans, PyTorchGaussianMixture, evaluate_clustering
clustering_model = PyTorchKMeans(init='k-means++', max_iter=300, tol=1e-4, **kwargs)
psedo_labels = clustering_model.fit_predict(features)

Benchmark

Tested on colab (Tesla T4)

CIFAR-10	NMI	ACC	Speed
Faiss kmeans (nredo 5 maxiter 50)	0.8551	0.9236	4.59+-0.05
torch_clustering kmeans (nredo 10 maxiter 300)	0.8552	0.9235	0.73+-0.09
torch_clustering GMM (nredo 10x10 maxiter 300)	0.8559	0.9238	11.67+-0.33

On ImageNet, the performance of torch_clustering will be much better than Faiss.

Citation

@article{huang2022learning,
  title={Learning Representation for Clustering via Prototype Scattering and Positive Sampling},
  author={Zhizhong Huang and Jie Chen and Junping Zhang and Hongming Shan},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2022},
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
torch_clustering		torch_clustering
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
distribute_kmeans_example.py		distribute_kmeans_example.py
example_and_benchmark.ipynb		example_and_benchmark.ipynb
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

torch_clustering

Install

Example

Benchmark

Citation

About

Releases

Packages

Languages

License

Hzzone/torch_clustering

Folders and files

Latest commit

History

Repository files navigation

torch_clustering

Install

Example

Benchmark

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages