Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

使用GPU训练时出现error in modulated_deformable_im2col_cuda: out of memory #62

Open
creater-zq opened this issue Sep 8, 2022 · 2 comments

Comments

@creater-zq
Copy link

creater-zq commented Sep 8, 2022

您好,您的代码解决了我遇到的一些bugs。
但是在训练时我遇到一些错误。

  • error in modulated_deformable_im2col_cuda: out of memory
    我查看如下了一些信息,如下:
  • bug1
  • nvidia-smi结果
    我设置的batch size为2,但我设置batch size=4的时候,cuda会直接中断退出。
    谢谢您!

跑着跑着,退出了。

  File "/DCNv2/dcn_v2.py",  in forward
    return dcn_v2_conv(input, offset, mask, self.weight, self.bias, self.stride, self.padding, self.dilation, self.deformable_groups,)
  File "DCNv2/dcn_v2.py",  in forward
    output = _backend.dcn_v2_forward(
RuntimeError: CUDA error: an illegal memory access was encountered
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
已放弃 (核心已转储)
@SilyRab
Copy link

SilyRab commented Jan 17, 2024

我也是这样,后来解决了吗?我在1卡上跑,但是dcn有一部分东西在0卡上,然后数据或者网络参数不在一张卡上,就报和你一样的错误了

@creater-zq
Copy link
Author

我也是这样,后来解决了吗?我在1卡上跑,但是dcn有一部分东西在0卡上,然后数据或者网络参数不在一张卡上,就报和你一样的错误了

我后来修改了一些代码,跑通了,不过我发现他这个不适合我的数据集,就没搞了,是好久之前了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants