Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finding the optimal number of hidden factors #4

Open
RobertoNegro opened this issue Nov 12, 2019 · 1 comment
Open

Finding the optimal number of hidden factors #4

RobertoNegro opened this issue Nov 12, 2019 · 1 comment

Comments

@RobertoNegro
Copy link

RobertoNegro commented Nov 12, 2019

Hello, me and my team we're trying to understand how to choose the optimal number of hidden factor. For what we've found, the goal is to maximize the TC (Total Correlation). But after some tries with different settings, the value obtained by the property tc is always increasing when increasing the number of hidden factors.
We have doubts about the TCs property too, since we're not sure on the meaning: after some execution, the median of the TCs rapidly decrease with the increase of hidden factors. But we're not sure how to interpret that.

So, basically, the main problem is: how can we choose the optimal number of hidden factors?

Thank you,
Roberto

X the number of hidden factors, Y the TC value
TC

X the number of hidden factors, Y the TCs median
TCs median

@gregversteeg
Copy link
Owner

Hi Roberto, sorry to take so long to respond. You should look in the code for a method called "pick_n_hidden". It basically tries to different numbers of factors and peaks at some optimal value.

Your experiment is absolutely correct though: the lower bound on TC only keeps going up! However, if you enforce that each variable has only one latent factor as a parent, then this doesn't happen (accessible as corex.moments["TC_no_overlap"]). In that case, adding factors causes the TC (without overlaps) to go up and then plateau and possibly go back down.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants