-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spark XGB converter accesses missing param improperly #697
Comments
It may be related to the fact onnxruntime does not support sparse features. What do you suggest as a fix? |
I saw your comments about sparsity, so I understand the motivation but it seems like the implementation has a bug. Do you recall any tests that were able to pass this check? I'm not particularly familiar with pyspark, but it seems we cannot operate directly on the |
After researching, the fix would be accessing via |
Should public access be open for feature branches? Having trouble pushing mine |
Description
#373 introduced this constraint on the converter
Even when I initialize
SparkXGBClassifier(missing=np.nan)
, this check still failsUpon inspecting,
xgb_node.missing
is typepyspark.ml.param.Param
so it makes sense that numpy can't apply the function. @xadupre can you provide some context on this change? I couldn't find much in the PR or linked issues but it seems like this is missing something likeParam.value
to access the data before passing to numpyExample
Reproducible script using the following library versions
The text was updated successfully, but these errors were encountered: