Implement categorical prediction for CPU and GPU predict leaf. #7001

trivialfis · 2021-05-26T13:11:21Z

Implement categorical prediction for CPU prediction.
Implement categorical prediction for GPU predict leaf.
Refactor the prediction functions to have unified get next node.

Related: #6503 .

codecov-commenter · 2021-05-26T13:44:53Z

Codecov Report

Merging #7001 (21bb93d) into master (ee4f51a) will increase coverage by 0.00%.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #7001   +/-   ##
=======================================
  Coverage   81.71%   81.72%           
=======================================
  Files          13       13           
  Lines        3916     3917    +1     
=======================================
+ Hits         3200     3201    +1     
  Misses        716      716

Impacted Files	Coverage Δ
python-package/xgboost/dask.py	`81.35% <0.00%> (ø)`
python-package/xgboost/sklearn.py	`89.51% <0.00%> (+0.02%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ee4f51a...21bb93d. Read the comment docs.

include/xgboost/tree_model.h

* Implement categorical prediction for CPU prediction. * Implement categorical prediction for GPU predict leaf. * Refactor the prediction functions to have unified get next.

hcho3 · 2021-06-09T00:06:09Z

src/predictor/cpu_predictor.cc

-          int tid = model.trees[j]->GetLeafIndex(feats);
+          auto const& tree = *model.trees[j];
+          auto const& cats = tree.GetCategoriesMatrix();
+          bst_node_t tid = GetLeafIndex<true, true>(tree, feats, cats);


Should we also add a check for has_categorical here? Or are we purposefully removing it as GetLeaf is not performance critical?

also it's possible feats has no missing values

Yes, I'm assuming it's not critical. Otherwise we will have a lot more specializations

This is the previous default so I'm not slowing it down here. If optimization is needed (being the computation bottleneck of some algorithms/models), we can come back to it in a different PR that focuses on optimization.

trivialfis mentioned this pull request May 26, 2021

Categorical data support. #6503

Closed

67 tasks

trivialfis added status: WIP and removed status: WIP labels May 27, 2021

ShvetsKS reviewed May 27, 2021

View reviewed changes

include/xgboost/tree_model.h Show resolved Hide resolved

trivialfis added 2 commits June 1, 2021 18:11

Categorical prediction with CPU predictor and GPU predict leaf.

c3c82e4

* Implement categorical prediction for CPU prediction. * Implement categorical prediction for GPU predict leaf. * Refactor the prediction functions to have unified get next.

Move into cpu predictor.

4f444c2

trivialfis force-pushed the cat-cpu-predictor branch from 9701e60 to 4f444c2 Compare June 1, 2021 10:16

trivialfis and others added 5 commits June 1, 2021 18:41

Fix.

8be6b46

Lint.

9a68292

fix perf

907ce2e

Remove unused variable.

bf1af8f

Lint.

21bb93d

hcho3 reviewed Jun 9, 2021

View reviewed changes

trivialfis requested a review from hcho3 June 9, 2021 17:48

hcho3 approved these changes Jun 10, 2021

View reviewed changes

trivialfis merged commit f79cc4a into dmlc:master Jun 11, 2021

trivialfis deleted the cat-cpu-predictor branch June 11, 2021 02:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement categorical prediction for CPU and GPU predict leaf. #7001

Implement categorical prediction for CPU and GPU predict leaf. #7001

trivialfis commented May 26, 2021 •

edited

Loading

codecov-commenter commented May 26, 2021 •

edited

Loading

hcho3 Jun 9, 2021

ShvetsKS Jun 9, 2021

trivialfis Jun 9, 2021

trivialfis Jun 9, 2021

Implement categorical prediction for CPU and GPU predict leaf. #7001

Implement categorical prediction for CPU and GPU predict leaf. #7001

Conversation

trivialfis commented May 26, 2021 • edited Loading

codecov-commenter commented May 26, 2021 • edited Loading

Codecov Report

hcho3 Jun 9, 2021

Choose a reason for hiding this comment

ShvetsKS Jun 9, 2021

Choose a reason for hiding this comment

trivialfis Jun 9, 2021

Choose a reason for hiding this comment

trivialfis Jun 9, 2021

Choose a reason for hiding this comment

trivialfis commented May 26, 2021 •

edited

Loading

codecov-commenter commented May 26, 2021 •

edited

Loading