Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Show both heads all data or use auxillary overclustering? #102

Open
vivektreddy opened this issue Jan 25, 2021 · 0 comments
Open

Show both heads all data or use auxillary overclustering? #102

vivektreddy opened this issue Jan 25, 2021 · 0 comments

Comments

@vivektreddy
Copy link

vivektreddy commented Jan 25, 2021

In the case of STL10, it says that using overclustering with an extra 100000 images helps improve clustering of those initial 5000 images significantly.
What would have been the trade off if you had ran the algorithm with all 105000 images and showed each head the same images. Is there a better clustering achieved when you use auxillary head performing overclustering.
If I have 200,000 images for fully unsupervised clustering, would it be better to show both head A and head B all 200,000 images? Or would it better to shown one head 20,000 and the other head 200,000 (with more clusters)?
Furthermore, if I show both heads the same images, I might as well use one head right?

Thank you.

@vivektreddy vivektreddy changed the title How much performance is lost by showing both heads all data and not using auxillary overclustering? Is performance lost by showing both heads all data and not using auxillary overclustering? Jan 25, 2021
@vivektreddy vivektreddy changed the title Is performance lost by showing both heads all data and not using auxillary overclustering? Show both heads all data or use auxillary overclustering? Jan 25, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant