Missing binding for cuda.empty_cache #896

xd009642 · 2024-10-01T10:09:31Z

In the tch and torch-sys crates there doesn't appear to be a version of https://pytorch.org/docs/stable/generated/torch.cuda.empty_cache.html#torch-cuda-empty-cache or torch._C.cuda_emptyCache which it called. I'll have a deeper look into this and PRing it but any guidance would be appreciated as this is a fairly important feature when sharing GPUs with other jobs.

The text was updated successfully, but these errors were encountered:

xd009642 · 2024-10-01T10:10:07Z

That and the other memory controls - but for my own issue I'm happy going for the blunt force approach to get torch to free up some of it's excessive allocations.

xd009642 · 2024-11-13T11:45:42Z

Bumping slightly as this is posing more of a problem. I'm probably going to resort to using https://crates.io/crates/nvml-wrapper in the short term to detect issues before they happen and otherwise try to find some time to look into how empty_cache would be implemented

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing binding for cuda.empty_cache #896

Missing binding for cuda.empty_cache #896

xd009642 commented Oct 1, 2024

xd009642 commented Oct 1, 2024

xd009642 commented Nov 13, 2024

Missing binding for cuda.empty_cache #896

Missing binding for cuda.empty_cache #896

Comments

xd009642 commented Oct 1, 2024

xd009642 commented Oct 1, 2024

xd009642 commented Nov 13, 2024