[QUESTION] GPU memory efficiency #6327

pseudotensor · 2020-10-30T11:47:34Z

https://news.developer.nvidia.com/gpu-accelerated-spark-xgboost/

mentions:

Efficient GPU memory utilization: XGBoost requires that data fit into memory which creates a restriction on data size using either a single GPU or distributed multi-GPU multi-node training. The latest release has improved GPU memory utilization by 5X, i.e., users now can now train with data that is five times the size as compared to the first version. This is one of the critical factors to improve total cost of training without impacting performance.

In this something only in xgboost4j? Or is it also in dmlc xgboost?

I'm asking because in playing around with multi-GPU using dask, the memory use is quite high. 37M rows by 20 features runs out of GPU memory on 2 11GB GPUs. If there was really 5X to gain, that would be incredible. I don't see any such significant changes in GPU memory usage since the first GPU implementations by @RAMitchell . @teju85

The text was updated successfully, but these errors were encountered:

trivialfis · 2020-10-30T13:18:49Z

On dask, you can try the DaskDeviceQuantileDMatrix if your input is from GPU.

trivialfis · 2020-10-30T13:19:20Z

Preferably with nightly build.

trivialfis · 2020-10-30T13:29:20Z

Feel free to close if DDQDM helps.

pseudotensor · 2020-10-30T13:40:07Z

Thanks will try. Is there some specific thing spark contributors did that allowed 5X memory improvement that dask has not yet done?

trivialfis · 2020-10-30T13:44:30Z

No. I implemented DDQDM based on quantile sketching algorithm recently. The post you linked is old.

pseudotensor · 2020-10-30T13:48:10Z

Sorry, I just mean, what is the 5X GPU memory improvement they are referring to?

pseudotensor · 2020-10-30T13:50:20Z

Also, with the scikit-learn API is the same option possible? Also, maybe good idea to allow scikit-learn API to accept dmatrix as X if not already possible.

trivialfis · 2020-10-30T13:55:52Z

Sorry, I just mean, what is the 5X GPU memory improvement they are referring to?

I think it meant comparing converting GPU dataframe to XGBoost DMatrix directly, and their old approach of saving memory.

Also, with the scikit-learn API is the same option possible?

Right now no.

maybe good idea to allow scikit-learn API to accept dmatrix as X

Thanks for the suggestion, that's a possible option. Or maybe we can dispatch based on tree method and use DDQDM internally for gpu_hist by default. I'm not sure yet.

pseudotensor · 2020-10-30T14:00:54Z

Ya, if there was another parameter in constructor for scikit-learn API or xgboost parameters to choose that option, that would work and be aligned with how have to choose gpu_hist vs. hist, default of gpu_predictor instead of cpu_predictor (AFAIK with rapids/cudf can't switch to cpu_predictor), gpu_id = 0 as default, etc.

So as parameter or as default sounds reasonable.

pseudotensor · 2020-10-30T21:05:54Z

When is 1.3.0 release planned? I couldn't find out the plan, only old roadmaps. The notes on releases says the plan of when to release is made once prior release is out. So I suppose there is a plan for 1.3.0? It seems to have good fixes and features for dask.

hcho3 · 2020-10-30T21:38:32Z

@pseudotensor Here is the roadmap for 1.3.0: #6031. We will make the release once all the blocking issues are addressed.

trivialfis · 2020-11-05T12:09:53Z

Closing as the integer overflow issue is resolved and now the DDQDM doesn't have known limitation. It's very close to inplace data initialization. If there's better idea on how to stream data for gradient boosting, that will be another topic.

trivialfis closed this as completed Nov 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] GPU memory efficiency #6327

[QUESTION] GPU memory efficiency #6327

pseudotensor commented Oct 30, 2020

trivialfis commented Oct 30, 2020 •

edited

Loading

trivialfis commented Oct 30, 2020

trivialfis commented Oct 30, 2020

pseudotensor commented Oct 30, 2020

trivialfis commented Oct 30, 2020

pseudotensor commented Oct 30, 2020

pseudotensor commented Oct 30, 2020

trivialfis commented Oct 30, 2020 •

edited

Loading

pseudotensor commented Oct 30, 2020

pseudotensor commented Oct 30, 2020

hcho3 commented Oct 30, 2020

trivialfis commented Nov 5, 2020

[QUESTION] GPU memory efficiency #6327

[QUESTION] GPU memory efficiency #6327

Comments

pseudotensor commented Oct 30, 2020

trivialfis commented Oct 30, 2020 • edited Loading

trivialfis commented Oct 30, 2020

trivialfis commented Oct 30, 2020

pseudotensor commented Oct 30, 2020

trivialfis commented Oct 30, 2020

pseudotensor commented Oct 30, 2020

pseudotensor commented Oct 30, 2020

trivialfis commented Oct 30, 2020 • edited Loading

pseudotensor commented Oct 30, 2020

pseudotensor commented Oct 30, 2020

hcho3 commented Oct 30, 2020

trivialfis commented Nov 5, 2020

trivialfis commented Oct 30, 2020 •

edited

Loading

trivialfis commented Oct 30, 2020 •

edited

Loading