try to fix Zero3 Memory Leak following @tohtana idea #363

dumpmemory · 2023-04-24T06:54:33Z

here i am following @tohtana 's modification from microsoft/DeepSpeed#3002 to fix #161 . it worked with deepspeed 0.9.1 and torch 2.0. Thanks for @tohtana 's help.

HuggingFaceDocBuilderDev · 2023-04-24T06:59:22Z

The documentation is not available anymore as the PR was closed or merged.

pacman100

Hello @dumpmemory, great work getting this issue solved from DeepSpeed and raising the fix here. Could you apply the fix to all places in lora and adalora wherein F.linear is being used . That would solve the issue in all places

aashay96 · 2023-04-26T21:30:07Z

When will this be deployed?

dumpmemory · 2023-04-28T02:21:55Z

Hello @dumpmemory, great work getting this issue solved from DeepSpeed and raising the fix here. Could you apply the fix to all places in lora and adalora wherein F.linear is being used . That would solve the issue in all places

cool i will. Thanks

dumpmemory · 2023-04-28T02:45:13Z

@pacman100 pls help me to check it. i have made all F.linear replaced.

pacman100

Thank you @dumpmemory for iterating, LGTM! 🤗

Could you run make style and make quality to fix the quality issues?

dumpmemory · 2023-05-05T01:11:31Z

Thank you @dumpmemory for iterating, LGTM! 🤗

Could you run make style and make quality to fix the quality issues?

yes, i will. I will also test new commits from deepspeed sides. Thanks again.

pacman100 · 2023-05-10T04:43:52Z

Hello, @dumpmemory, there are still some code quality issues. Please resolve them to go ahead with the PR

pacman100 · 2023-05-10T06:51:51Z

Hello, is this PR still required? As the DeepSpeed team fixed it in their codebase

dumpmemory · 2023-05-10T06:54:52Z

this pr is no longer required.

dumpmemory added 2 commits April 24, 2023 14:50

try to fix Zero3 Memory Leak

c34099d

Update README.md

faeda26

handle bias ==None

26c0c67

pacman100 reviewed Apr 26, 2023

View reviewed changes

dumpmemory added 2 commits April 28, 2023 10:32

replace all F.linear with torch.matul and fix not fan_in_fan_out bug

1928d98

add not back according to @tohtana's idea.

585cebb

pacman100 reviewed May 3, 2023

View reviewed changes

fix style and quality issues

0dde806

dumpmemory added 3 commits May 10, 2023 14:08

Merge branch 'huggingface:main' into patch-1

a1e9e22

merge main and update missing F

f09e563

Merge branch 'main' into patch-1

2238095

dumpmemory closed this May 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

try to fix Zero3 Memory Leak following @tohtana idea #363

try to fix Zero3 Memory Leak following @tohtana idea #363

dumpmemory commented Apr 24, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 24, 2023 •

edited

Loading

pacman100 left a comment

aashay96 commented Apr 26, 2023

dumpmemory commented Apr 28, 2023

dumpmemory commented Apr 28, 2023

pacman100 left a comment

dumpmemory commented May 5, 2023

pacman100 commented May 10, 2023

pacman100 commented May 10, 2023

dumpmemory commented May 10, 2023

try to fix Zero3 Memory Leak following @tohtana idea #363

try to fix Zero3 Memory Leak following @tohtana idea #363

Conversation

dumpmemory commented Apr 24, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Apr 24, 2023 • edited Loading

pacman100 left a comment

Choose a reason for hiding this comment

aashay96 commented Apr 26, 2023

dumpmemory commented Apr 28, 2023

dumpmemory commented Apr 28, 2023

pacman100 left a comment

Choose a reason for hiding this comment

dumpmemory commented May 5, 2023

pacman100 commented May 10, 2023

pacman100 commented May 10, 2023

dumpmemory commented May 10, 2023

dumpmemory commented Apr 24, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 24, 2023 •

edited

Loading