Generalize prediction cache. #8783

trivialfis · 2023-02-10T08:41:59Z

Generalize prediction cache for other uses.

Extract most of the functionality into DMatrixCache.
Limit the size of the prediction cache to 32 items. This should be plenty. When the number is exceeded, it's usually because the user forgets to free the input DMatrix during inference, and by limiting the size of the cache, we are actually improving performance for those cases.
Move API entry struct to an independent file to reduce dependency on the predictor.h file.
Add a test.

I'm working on learning to rank related changes. One hurdle I run into is how to cache sorted indexes inside Metric. Currently, the cox metric adds a member function to meta info, while the AUC has an ad-hoc cache. This PR proposes we reuse the prediction cache in these places.

* Extract most of the functionality into `DMatrixCache`. * Move API entry to independent file to reduce dependency on `predictor.h` file. * Add test.

tidy.

include/xgboost/cache.h

include/xgboost/predictor.h

tests/cpp/test_cache.cc

Co-authored-by: Philip Hyunsu Cho <[email protected]>

trivialfis · 2023-02-12T17:46:43Z

@hcho3 Thank you for the review, all comments are addressed.

trivialfis added 3 commits February 10, 2023 16:32

Generalize prediction cache.

5b66ef0

* Extract most of the functionality into `DMatrixCache`. * Move API entry to independent file to reduce dependency on `predictor.h` file. * Add test.

rename.

e7eecf8

fix test.

238e7e3

tidy.

trivialfis force-pushed the dmatrix-cache branch from 954a0b3 to 238e7e3 Compare February 10, 2023 09:45

hcho3 requested changes Feb 12, 2023

View reviewed changes

include/xgboost/cache.h Outdated Show resolved Hide resolved

include/xgboost/cache.h Outdated Show resolved Hide resolved

include/xgboost/predictor.h Outdated Show resolved Hide resolved

tests/cpp/test_cache.cc Show resolved Hide resolved

trivialfis and others added 2 commits February 13, 2023 01:34

Apply suggestions from code review

b61bf11

Co-authored-by: Philip Hyunsu Cho <[email protected]>

Rename.

493d84d

trivialfis requested a review from hcho3 February 12, 2023 17:46

hcho3 approved these changes Feb 12, 2023

View reviewed changes

trivialfis merged commit d11a004 into dmlc:master Feb 13, 2023

trivialfis deleted the dmatrix-cache branch February 13, 2023 04:36

trivialfis mentioned this pull request Feb 13, 2023

Pass DMatrix into metric for optional caching. #8790

Merged

ShellLM mentioned this pull request Aug 11, 2024

Xgboost 2.0.0 · dmlc/xgboost irthomasthomas/undecidability#878

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize prediction cache. #8783

Generalize prediction cache. #8783

trivialfis commented Feb 10, 2023 •

edited

Loading

trivialfis commented Feb 12, 2023

Generalize prediction cache. #8783

Generalize prediction cache. #8783

Conversation

trivialfis commented Feb 10, 2023 • edited Loading

trivialfis commented Feb 12, 2023

trivialfis commented Feb 10, 2023 •

edited

Loading