Use new predict function for R. #6819

trivialfis · 2021-03-31T20:56:10Z

This is an early PR so I can get some suggestions from R experts. The primary change in R interface is the use of iterationrange and deprecation of ntreelimit.

TODOs:

Remove the use of ntreelimit in internal code base.
Document its deprecation.
Figure out how to utilize the shape returned by the new predict function.
Add new tests for both iterationrange and strict_shape.
Handle 1-based indexing for best_iteration.

codecov-commenter · 2021-06-07T06:43:06Z

Codecov Report

Merging #6819 (48d5105) into master (7beb2f7) will decrease coverage by 0.00%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #6819      +/-   ##
==========================================
- Coverage   81.72%   81.71%   -0.01%     
==========================================
  Files          13       13              
  Lines        3917     3916       -1     
==========================================
- Hits         3201     3200       -1     
  Misses        716      716

Impacted Files	Coverage Δ
python-package/xgboost/core.py	`82.83% <100.00%> (-0.02%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7beb2f7...48d5105. Read the comment docs.

hcho3 · 2021-06-08T23:09:45Z

R-package/R/xgb.Booster.R

+#' @param iterationrange Specifies which layer of trees are used in prediction.  For example, if a
+#'        random forest is trained with 100 rounds.  Specifying `iteration_range=(0,
+#'        20)`, then only the forests built during [0, 20) (half open set) rounds are
+#'        used in this prediction.  It's 0 based index (unlike R vector).


Should we use 0-based index here?

@jameslamb What's the convention for R packages that wrap native code? Do users expect to use 1-based indexing?

Also, please add a note that iteration_range=(0,0) indicates the use of all trees.

In general, I'm in favor of parameters like this being 1-based and having the documentation clearly indicate that. I think that's more friendly for R users.

But, to be fair, in {lightgbm} we have not been very consistent about this. Keyword arguments in the package's interface expect 1-based values, but {lightgbm} won't look inside parameters passed through a list params and subtract 1 from any parameters that are indices.

So I think that there is not really a "right" answer to this, and that it's more important to:

be consistent (as much as possible)

over-communicate in the documentation (always say whether it is 1-based or 0-based)

I will try to use 1 based index.

Changed to use 1-based index. Thanks for the suggestions!

R-package/tests/testthat/test_basic.R

Kodiologist · 2021-07-30T14:31:19Z

I recently found a segfault in version 1.4.1.1 of the R package when providing ntreelimit greater than the number of trees in the model, but I'm guessing the bug no longer exists in XGBoost master, due to this PR.

trivialfis · 2021-07-30T16:09:32Z

@Kodiologist Did you test it? I think I have error tests in Python with model slicing, but not 100 percent sure about R error handling.

Kodiologist · 2021-07-30T16:39:35Z

No, not with anything newer than 1.4.1.1.

trivialfis marked this pull request as draft March 31, 2021 20:56

trivialfis added the status: WIP label Mar 31, 2021

trivialfis added 2 commits June 7, 2021 13:29

Use new predict function for R.

d06786e

Callback, vec.

baf9187

trivialfis force-pushed the R-predict branch from 6d386fd to baf9187 Compare June 7, 2021 05:29

Unused variable.

15dbd02

trivialfis added 18 commits June 7, 2021 16:33

Evil unbox.

71a55f9

Rename.

ceb7bc8

Fix strict_shape with softmax.

6671dc5

Fix softprob.

a58d875

Fix predict leaf.

8a28fab

Fix shap.

fee7a2e

Fix callback.

779862c

Lint.

e4b791f

Fix index.

35a43be

Small cleanup.

0329506

Note.

cfc8654

Remove some use of ntreelimit.

f4a2e5f

Quick tests.

c5e7383

Lint.

d20212c

lintr.

e5faa66

Change name.

48414aa

Add tests.

eb1ee9a

Lint.

b385eac

trivialfis changed the title ~~[WIP] Use new predict function for R.~~ Use new predict function for R. Jun 7, 2021

trivialfis marked this pull request as ready for review June 7, 2021 16:32

trivialfis removed the status: WIP label Jun 7, 2021

lintr.

48d5105

hcho3 requested changes Jun 8, 2021

View reviewed changes

trivialfis added 7 commits June 9, 2021 22:37

Handle it in predict.

e98ce00

Remove print.

dff4908

Comment.

7fc0c8c

Update documents.

67c05d9

Fix.

64b82fb

lintr.

5805bf4

Fix doc.

7805ea7

trivialfis requested a review from hcho3 June 9, 2021 17:23

Remove RF example.

2029e4e

trivialfis mentioned this pull request Jun 10, 2021

[JVM-Packages] Remove synchronized in java predict function. #7027

Closed

hcho3 approved these changes Jun 10, 2021

View reviewed changes

trivialfis merged commit b56614e into dmlc:master Jun 11, 2021

trivialfis deleted the R-predict branch June 11, 2021 05:03

trivialfis mentioned this pull request Jul 24, 2021

[R] Fix nthread in DMatrix constructor. #7127

Merged

jameslamb mentioned this pull request Jan 16, 2023

[R] could XGBoosterPredict_R be removed? #8687

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use new predict function for R. #6819

Use new predict function for R. #6819

trivialfis commented Mar 31, 2021 •

edited

Loading

codecov-commenter commented Jun 7, 2021 •

edited

Loading

hcho3 Jun 8, 2021

hcho3 Jun 8, 2021

jameslamb Jun 9, 2021

trivialfis Jun 9, 2021

trivialfis Jun 9, 2021

Kodiologist commented Jul 30, 2021

trivialfis commented Jul 30, 2021

Kodiologist commented Jul 30, 2021

Use new predict function for R. #6819

Use new predict function for R. #6819

Conversation

trivialfis commented Mar 31, 2021 • edited Loading

codecov-commenter commented Jun 7, 2021 • edited Loading

Codecov Report

hcho3 Jun 8, 2021

Choose a reason for hiding this comment

hcho3 Jun 8, 2021

Choose a reason for hiding this comment

jameslamb Jun 9, 2021

Choose a reason for hiding this comment

trivialfis Jun 9, 2021

Choose a reason for hiding this comment

trivialfis Jun 9, 2021

Choose a reason for hiding this comment

Kodiologist commented Jul 30, 2021

trivialfis commented Jul 30, 2021

Kodiologist commented Jul 30, 2021

trivialfis commented Mar 31, 2021 •

edited

Loading

codecov-commenter commented Jun 7, 2021 •

edited

Loading