[ML][Inference] Fixing pre-processor value handling and size estimate #49270

benwtrent · 2019-11-18T21:12:23Z

There is a bug right now when handling numerics that are treated as categorical values. This is a valid use as encoding semi-ordinal numbers 10, 50, 5000 categorically can improve model performance instead of simply treating them as numerics.

Additionally, when estimating the size of the pre-processor HashMaps, Double entries were estimated with the default value of 256 when it should have been a simple shallowSizeOf. Setting defSize in the sizeOfMap to 0 corrects this.

elasticmachine · 2019-11-18T21:12:25Z

Pinging @elastic/ml-core (:ml)

przemekwitek

LGTM

…elastic#49270) * [ML][Inference] Fixing pre-processor value handling and size estimate * fixing npe

…#49270) (#49489) * [ML][Inference] Fixing pre-processor value handling and size estimate * fixing npe

[ML][Inference] Fixing pre-processor value handling and size estimate

f4b7b8f

benwtrent added >non-issue :ml Machine learning v8.0.0 v7.6.0 labels Nov 18, 2019

fixing npe

95bce24

przemekwitek approved these changes Nov 22, 2019

View reviewed changes

benwtrent merged commit 9360dc9 into elastic:master Nov 22, 2019

benwtrent deleted the feature/ml-inference-pre-processor-bug-fixes branch November 22, 2019 12:31

benwtrent added a commit to benwtrent/elasticsearch that referenced this pull request Nov 22, 2019

[ML][Inference] Fixing pre-processor value handling and size estimate (…

724b061

…elastic#49270) * [ML][Inference] Fixing pre-processor value handling and size estimate * fixing npe

benwtrent mentioned this pull request Nov 22, 2019

[7.x] [ML][Inference] Fixing pre-processor value handling and size estimate (#49270) #49489

Merged

benwtrent added a commit that referenced this pull request Nov 22, 2019

[ML][Inference] Fixing pre-processor value handling and size estimate (…

276b6c6

…#49270) (#49489) * [ML][Inference] Fixing pre-processor value handling and size estimate * fixing npe

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML][Inference] Fixing pre-processor value handling and size estimate #49270

[ML][Inference] Fixing pre-processor value handling and size estimate #49270

benwtrent commented Nov 18, 2019

elasticmachine commented Nov 18, 2019

przemekwitek left a comment

[ML][Inference] Fixing pre-processor value handling and size estimate #49270

[ML][Inference] Fixing pre-processor value handling and size estimate #49270

Conversation

benwtrent commented Nov 18, 2019

elasticmachine commented Nov 18, 2019

przemekwitek left a comment

Choose a reason for hiding this comment