Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Large systems fail in CUDA Kernels for fixable reasons #440

Open
GregorySchwing opened this issue May 16, 2022 · 0 comments
Open

Large systems fail in CUDA Kernels for fixable reasons #440

GregorySchwing opened this issue May 16, 2022 · 0 comments

Comments

@GregorySchwing
Copy link
Collaborator

Describe the bug
Systems for which the number of ewald images > 65535, the max number of blocks, fails for invalid grid configuration.
GPUassert: invalid configuration argument /home/udpk5hr/GOMC/src/GPU/CalculateEwaldCUDAKernel.cu 100

Furthermore, the reduction strategy allocated a array of size threadsPerBlock * numberOfBlocks. This can exceed memory capabilities of the GPU easily.

To Reproduce
Run an 400x400x400 ang box of water using Ewald GPU.

Expected behavior
GOMC should be able to use all the memory of a single GPU without failing at certain system sizes.

Additional context
This can be fixed by defining a 3D grid of blocks. Also, the reduction can be collapsed to use less memory.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant