[AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour #14468

cbalint13 · 2023-04-03T18:40:50Z

This PR fix the latest xgboost >= 2.0.0 behaviour requiring binarized labels.

The strict behaviour was introduced by Rework the MAP metric. dmlc/xgboost#8931
A possible solution here (also suggested) would be to binarize by threshold at the evaluation calls.
This is a continuation on [AutoScheduler][AutoTVM] Enable xgboost >= 1.7.x new changes #14036 xgboost

This address ~~both~~ onlyautotune (~~and autoscheduler~~).

Note:
Unsure about TVM overall tunner impact, but we can introduce more sophisticated way of measuring AP like PASCAL evenly spaced one, the advantages are unclear and would require extensive comparative tests.

The errors cought on TVM autotune process:

File "/usr/lib64/python3.11/site-packages/tvm/autotvm/tuner/xgboost_cost_model.py"
, line 538, in after_iteration
    bst_eval = model.eval_set(self.evals, epoch, feval)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/lib64/python3.11/site-packages/xgboost/core.py", line 1995, in eval_set
    _check_call(
File "/usr/lib64/python3.11/site-packages/xgboost/core.py", line 270, in _check_call
    raise XGBoostError(py_str(_LIB.XGBGetLastError()))
xgboost.core.XGBoostError: [15:59:59] /builddir/build/BUILD/xgboost/src/common/ranking_utils.h:378: 
Check failed: is_binary: MAP can only be used with binary labels.

Cc @Sunny-Island , @zxybazh , @junrushao , @vinx13 , please help with the review.

Thanks,
~Cristian.

Update:

The autoschedule (ansor) is not affected at all.
The autotune reg (reg:linear) loss_type is fine.
Only autotune with rank (rank:pairwise) loss_type is affected.

tvm-bot · 2023-04-03T18:40:54Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @elvin-n, @Icemist _{See #10317 for details}

_{Generated by tvm-bot}

tqchen · 2023-04-04T13:41:11Z

Given that we are doing cost model. I am not sure if binarization is the best approach here. Can you dump out the labels and check the current assigned behavior?

Likely we might want to move away from the MAP metric, and use other metric instead, either regression metric or pair-wise ranking.

cbalint13 · 2023-04-04T18:36:13Z

@tqchen

Given that we are doing cost model. I am not sure if binarization is the best approach here.

In a short discussion here this was suggested: Fix IsBinaryRel() check dmlc/xgboost#9007 (comment)
Looking through changes , the old behaviour also clamped somehow the values (not clear for me if to pure binary).

Can you dump out the labels and check the current assigned behavior?

Sure, attached is a small script + dmatrix dump: tvm-xgboost-dmatrix.zip with results.txt
This was captured from a real tvm autotunning process targeting a rk3399 opencl device.

Likely we might want to move away from the MAP metric, and use other metric instead, either regression metric or pair-wise ranking.

Apparently this proposal works, tunning finds good kernels, but the real impact is hard to measure (on personal side).

Another quick idea for now is to add condition of binarization to xgboost >=~~1.7.5~~ 2.0.0 version, keeping the old behaviour.

cbalint13 · 2023-04-04T18:47:40Z

@tqchen , additionally to my response to the request in the previous message:

Likely we might want to move away from the MAP metric, and use other metric instead, either regression metric or pair-wise ranking.

Apparently this proposal works, tunning finds good kernels, but the real impact is hard to measure (on personal side).

Another quick idea for now is to add condition of binarization to xgboost >=1.7.5 version, keeping the old behaviour.

If binarization is too steep , a simple (but recursive) way would be the PASCAL trick to avg/split into 11 levels:

tqchen · 2023-04-04T18:48:19Z

I think in this case we should change ranking loss to regression loss, use logistic regression so the values can still be used. binarization causes too much info loss

cbalint13 · 2023-04-04T21:00:41Z

@tqchen ,

I think in this case we should change ranking loss to regression loss, use logistic regression so the values can still be used. binarization causes too much info loss

Updates:

The autoschedule (ansor) is not affected at all.
The autotune reg (reg:linear) loss_type is not affected.
Only autotune with rank (rank:pairwise) loss_type``` is affected.
Only xgboost >= 2.0.0-dev (as it presents itself via py API).

I updated this PR code to do binarization only in case:

check if xgboost >= 2
check if rank:pairwise

Updated here the code, the title, the first comment (barred out any erroneous info).
I could imagine to leave this for autotune in a prepearing for what will be xgb >= 2.0.0

tqchen · 2023-04-04T23:02:09Z

I see, i think we should report error if binarization is needed, since the original intention was continuous prediction.

I know it might still work OK, but that was not the intention of the cost predictor.

Would be good to visit the default choice, i think if ranking is not possible, reg:logistic would be another good choice usually

cbalint13 · 2023-04-05T00:10:34Z

@tqchen ,

Introduced a new rank-binary loss_type along with current rank and reg.

I see, i think we should report error if binarization is needed, since the original intention was continuous prediction.

If loss_type is rank and xgboost >= 2.0.0 will report an error with explicit suggestion to use rank-binary instead.

I know it might still work OK, but that was not the intention of the cost predictor.

All remains as it was, it is user's choice (informed by tvm) to switch over with rank-binary or downgrade xgboost.

Would be good to visit the default choice, i think if ranking is not possible, reg:logistic would be another good choice usually

I won't touch this, all the described above would offer user a informed alternative.

Let me know if still need changes or more polishing, I stop here for now.
In future, will keep track on xgboost upcoming custom gain function , we may restore rank continuous behaviour.

tqchen · 2023-04-05T01:49:55Z

Thanks @cbalint13 ! we still need to make sure the default loss_type now updates to regression loss, mainly because rank-binary do not fit into the customized loss models.

cbalint13 · 2023-04-05T08:11:23Z

@tqchen ,

Thanks @cbalint13 ! we still need to make sure the default loss_type now updates to regression loss, mainly because rank-binary do not fit into the customized loss models.

Changed to "reg" all class init occurrences.
Lowered test case rmse for the new "reg" default.

There are still tutorials / applications that use explicit "rank", would like to change all of them ?

tqchen · 2023-04-05T14:43:57Z

@cbalint13 Yes, let us update and to change all to reg. cc @junrushao to double check cases in MetaSchedule.

Might be useful to use reg:logistic, if the output is scaled into [0, 1]

cbalint13 · 2023-04-05T18:14:15Z

@tqchen ,

@cbalint13 Yes, let us update and to change all to reg.

Updated all tutorials, apps and tests from any explicit "rank" to "reg".

cc @junrushao to double check cases in MetaSchedule.

Double check is welcome, as quick info, none of my metascheduler test failed, xgboost would fail like:

xgboost.core.XGBoostError: [15:59:59] /builddir/build/BUILD/xgboost/src/common/ranking_utils.h:378: 
Check failed: is_binary: MAP can only be used with binary labels.

Used xgboost 20230403 git hash 15e073ca .

Might be useful to use reg:logistic, if the output is scaled into [0, 1]

My thought on the newly introduced type_loss="rank-binary":

Is alternative to the original rank and booth have: ("objective": "rank:pairwise")
It binarize (within a cloned copy) the labels only for the xgboost eval step (being xgboost>=2 compatible).
The dynamics inside TVM further steps still remained with full continuous values.

I would leave it as described, if would like we can create another one type_loss="reg-binary" w/ reg:logistic.
For the case reg:logistic the naming "rank-binary" loose the "rank"-ness, so it would be just another "reg" class type.

cbalint13 · 2023-04-06T21:12:32Z

@tqchen , @junrushao

In continuation of previous comment, I also attach here some test result.

Comparative test confirms that rank-binary (binarized only at eval step) behaves identically with rank (original):

loss_type="rank-binary" (xgboost-2.0.0-dev 20230403 git hash 15e073ca)
[Task  1/54] (conv2d_nchw_spatial_pack.mali) {17.75 GFLOPS / #4912 records} SKIP
[Task  2/54] (conv2d_nchw_spatial_pack.mali) {40.74 GFLOPS / #1040 records} SKIP
[Task  3/54] (conv2d_nchw_spatial_pack.mali) {19.63 GFLOPS / #2032 records} SKIP

loss_type="rank" (xgboost-1.7.5 20230328 git hash 21d95f3d)
[Task  1/54] (conv2d_nchw_spatial_pack.mali) {11.71 GFLOPS / #1680 records} SKIP
[Task  2/54] (conv2d_nchw_spatial_pack.mali) {26.17 GFLOPS / #1024 records} SKIP
[Task  3/54] (conv2d_nchw_spatial_pack.mali) {13.15 GFLOPS / #1040 records} SKIP

Note:

In the case of rank-binary there was more steps (bit prologed / see amount of records) hence bit better results.
The tuned network was first three layers of yolov8s using float16 for half a day on a rk3399 board (nanopc-t4).

zxybazh · 2023-04-07T21:56:00Z

@tvm-bot rerun

vta/scripts/tune_resnet.py

Signed-off-by: Balint Cristian <[email protected]>

cbalint13 changed the title ~~Binarize labels for the new xgboost >= 1.7.5 behaviour~~ [AutoScheduler][AutoTVM] Binarize labels for the new xgboost >= 1.7.5 behaviour Apr 3, 2023

cbalint13 changed the title ~~[AutoScheduler][AutoTVM] Binarize labels for the new xgboost >= 1.7.5 behaviour~~ [AutoTVM] Binarize labels for the new xgboost >= 2.0.0 behaviour Apr 4, 2023

cbalint13 changed the title ~~[AutoTVM] Binarize labels for the new xgboost >= 2.0.0 behaviour~~ [AutoTVM] New rank-binary loss_type labels for the new xgboost >= 2.0.0 behaviour Apr 4, 2023

cbalint13 changed the title ~~[AutoTVM] New rank-binary loss_type labels for the new xgboost >= 2.0.0 behaviour~~ [AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour Apr 5, 2023

FrozenGene reviewed Apr 10, 2023

View reviewed changes

vta/scripts/tune_resnet.py Outdated Show resolved Hide resolved

New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour

f0f6eb9

Signed-off-by: Balint Cristian <[email protected]>

cbalint13 requested a review from FrozenGene April 10, 2023 14:37

FrozenGene approved these changes Apr 11, 2023

View reviewed changes

tqchen merged commit 515583c into apache:main Apr 11, 2023

ysh329 mentioned this pull request Jul 12, 2023

[Release] v0.13.0 Release Candidate Notes #15295

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour #14468

[AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour #14468

cbalint13 commented Apr 3, 2023 •

edited

Loading

tvm-bot commented Apr 3, 2023 •

edited

Loading

tqchen commented Apr 4, 2023 •

edited

Loading

cbalint13 commented Apr 4, 2023 •

edited

Loading

cbalint13 commented Apr 4, 2023

tqchen commented Apr 4, 2023

cbalint13 commented Apr 4, 2023 •

edited

Loading

tqchen commented Apr 4, 2023

cbalint13 commented Apr 5, 2023

tqchen commented Apr 5, 2023 •

edited

Loading

cbalint13 commented Apr 5, 2023

tqchen commented Apr 5, 2023

cbalint13 commented Apr 5, 2023 •

edited

Loading

cbalint13 commented Apr 6, 2023

zxybazh commented Apr 7, 2023

[AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour #14468

[AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour #14468

Conversation

cbalint13 commented Apr 3, 2023 • edited Loading

tvm-bot commented Apr 3, 2023 • edited Loading

tqchen commented Apr 4, 2023 • edited Loading

cbalint13 commented Apr 4, 2023 • edited Loading

cbalint13 commented Apr 4, 2023

tqchen commented Apr 4, 2023

cbalint13 commented Apr 4, 2023 • edited Loading

tqchen commented Apr 4, 2023

cbalint13 commented Apr 5, 2023

tqchen commented Apr 5, 2023 • edited Loading

cbalint13 commented Apr 5, 2023

tqchen commented Apr 5, 2023

cbalint13 commented Apr 5, 2023 • edited Loading

cbalint13 commented Apr 6, 2023

zxybazh commented Apr 7, 2023

cbalint13 commented Apr 3, 2023 •

edited

Loading

tvm-bot commented Apr 3, 2023 •

edited

Loading

tqchen commented Apr 4, 2023 •

edited

Loading

cbalint13 commented Apr 4, 2023 •

edited

Loading

cbalint13 commented Apr 4, 2023 •

edited

Loading

tqchen commented Apr 5, 2023 •

edited

Loading

cbalint13 commented Apr 5, 2023 •

edited

Loading