Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about using for prediction #10

Open
santideleon opened this issue Jan 12, 2022 · 1 comment
Open

Question about using for prediction #10

santideleon opened this issue Jan 12, 2022 · 1 comment

Comments

@santideleon
Copy link

Hello Alfredo,

I work in Antonio’s lab and am using the HI-VAE for a project in Pablo’s graduate deep learning course. I am trying this out with heterogenous medical data (35 features) with true missing values to predict a Multilabel of has disease (0 or 1) and type of care unit (ordinal 0,1,2,). I have about 80% of the dataset with labels and 20% without labels (I am holding onto these labels and using them to test the output of the HI-VAE), while both still contain true missing value. I have the model working with randomly generated masks that never coincide with the true missing values and then the csv of true missing values. Do you have any tips or suggestions for improving accuracy? Also I am unsure if the final result I am getting is assuming the least amount missing as it should for the final prediction.

  • Santiago
@santideleon
Copy link
Author

I should also comment that for the model I have just labeled the data_types. In no way did I label what was the target, but from my understanding you don’t need to and there is no way to. I also did not use data_types_real, since I was not sure what it was for.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant