Question about using for prediction #10

santideleon · 2022-01-12T04:02:05Z

Hello Alfredo,

I work in Antonio’s lab and am using the HI-VAE for a project in Pablo’s graduate deep learning course. I am trying this out with heterogenous medical data (35 features) with true missing values to predict a Multilabel of has disease (0 or 1) and type of care unit (ordinal 0,1,2,). I have about 80% of the dataset with labels and 20% without labels (I am holding onto these labels and using them to test the output of the HI-VAE), while both still contain true missing value. I have the model working with randomly generated masks that never coincide with the true missing values and then the csv of true missing values. Do you have any tips or suggestions for improving accuracy? Also I am unsure if the final result I am getting is assuming the least amount missing as it should for the final prediction.

Santiago

santideleon · 2022-01-12T04:12:18Z

I should also comment that for the model I have just labeled the data_types. In no way did I label what was the target, but from my understanding you don’t need to and there is no way to. I also did not use data_types_real, since I was not sure what it was for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about using for prediction #10

Question about using for prediction #10

santideleon commented Jan 12, 2022

santideleon commented Jan 12, 2022

Question about using for prediction #10

Question about using for prediction #10

Comments

santideleon commented Jan 12, 2022

santideleon commented Jan 12, 2022