You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am working with Hail based Variant Spark.
Missing genotype calls in my dataset passed to random_forest_model as x cause a NullPointerExeption when trying to call fit_trees on the model.
For now I'll filter out rows with call rates !=1 with Hail, but it would be great if Variant Spark could handle this.
I believe in general case the imputation is better performed by specialised genomics tools.
However to basic support for imputation will be added in the following manner:
an extra parameter imputation_type will be added to methods.random_forest_model with allowed values:
None : no imputation - an error will be reported when input VCF contains uncalled genotypes
"mode" : perform a basic imputation by replacing missing values with the mode (most frequent value) of the non missing values. In case of multi-modal distribution the smallest mode is used.
I am working with Hail based Variant Spark.
Missing genotype calls in my dataset passed to random_forest_model as x cause a NullPointerExeption when trying to call fit_trees on the model.
For now I'll filter out rows with call rates !=1 with Hail, but it would be great if Variant Spark could handle this.
Code:
Error message:
fit_trees_nullpointer_exception.txt
The text was updated successfully, but these errors were encountered: