Reduce redundant GPU allocations #2393

thorstenhater · 2024-08-26T08:14:28Z

Introduction

Do not allocate and fill storage for resetting ion concentrations if these are never changed.
Reasoning: If concentrations are never changed, we do not reset them and thus do not need
to store the values.
Remove a duplicate of the CV area in solver and shared state and pass a reference during solving
Remove redundant solution from GPU solver
Do not store / allocate diffusion values if not needed.

This saves per CV

1 x 8B for cv_area unconditionally
1 x 8B for Xd for each ion with no diffusion is in use (majority of cases)
2 x 8B for Xi for each ion (reset and init) if not written (reasonably often)
2 x 8B for Xo for each ion (reset and init) if not written (majority of cases)
1 x 8B for eX reset for each ion if not read (majority)
1 x 8B for eX for each ion if not read (rarely)

In my standard benchmark, busyring with complex cells, this saves about 18% of the total GPU
allocation for the cell data (shared_state).

This has become a mixed bag, fixing a few additional things that came up during testing this:

a bug in event handling on GPU
pybind11 stub generation

…tions

jlubo · 2024-10-10T10:32:29Z

This is nice!

The previously high memory consumption prevented the execution of my large network application with multi-compartment neurons (see here for the code of the single-compartment case, the code for the multi-compartment case is not public yet) on common GPUs. The present PR reduces the GPU memory consumption for that application from >> 8 GB to ~100 MB.

Also, the whole variety of tests for my network application do pass (for both the single-compartment and the multi-compartment case).

thorstenhater · 2024-10-10T11:01:06Z

You can also try #2394, which takes the concept one step further.

…tions

thorstenhater · 2024-10-23T06:12:05Z

Spack is failing due to pb11-stubgen not being updated. I made the corresponding PR here
spack/spack#47162

boeschf

the conditional initialization and allocation of the different ion state arrays is not very easy to follow. There must be a better way to do this, but that's maybe for another PR.

spack/package.py

arbor/backends/multicore/shared_state.cpp

arbor/backends/gpu/shared_state.cpp

arbor/backends/multicore/shared_state.cpp

arbor/backends/gpu/shared_state.cpp

…m/trim-gpu-allocations

spack/package.py

Co-authored-by: boeschf <[email protected]>

boeschf · 2024-10-23T12:56:57Z

can we add a spack variant that would disable the pybind11 stubgen? This would be consistent with the cmake and would allow the spack CI to still pass.

…m/trim-gpu-allocations

boeschf

thanks, looks good to me

thorstenhater added 25 commits August 23, 2024 10:14

Only allocate reset values if needed.

c1bb6d9

Only allocate Xd if needed.

012a648

Do no longer store solution.

f9dfc29

Area is now just a reference...

035d725

Just a ref now.

7aa2da6

A view?

cb97c81

More views...

4016150

Pass area to assemble

a5c647e

Leftovers

c661c28

og

ad26a1a

Proper ctor

b194c52

Backport.

5c4c257

?

484364f

revert

84e3a07

Now?

987f597

fix tests

1556c8d

Fix the tests?

788fce0

More?

dc38d58

Guard Xd

afeda43

Allocate reset_init only if neede on CPU.

30b818b

Properly order resets.

f9a0c23

Use proper lengths.

44ff1b5

add defaults

ceeab8a

align

9a2c733

mk_array for GPU

2a87566

thorstenhater mentioned this pull request Aug 26, 2024

Reduce even more GPU allocations #2394

Merged

1 task

thorstenhater added 2 commits September 30, 2024 15:43

Allocate reset_Xi if _either_ Xi _or_ Xd need it.

15a282d

Merge remote-tracking branch 'origin/master' into mem/trim-gpu-alloca…

435edf3

…tions

thorstenhater added 8 commits October 22, 2024 09:53

No stubs for codeQL

08d73bd

Use find_program and dependent option

aea1a8f

Typo

fec405f

CAPITALISATION

b0965f3

Merge remote-tracking branch 'origin/master' into mem/trim-gpu-alloca…

a248f60

…tions

Do as the comments say...

dd6e410

Fix bug and make read/write intent clear.

513d011

Fix spackage.

351db98

thorstenhater mentioned this pull request Oct 22, 2024

use supported compiler for codeql #2422

Merged

thorstenhater added 2 commits October 23, 2024 08:14

Bump pb11-sg to soon to be merged new version

145fbdd

Merge branch 'master' into mem/trim-gpu-allocations

a081cab

boeschf reviewed Oct 23, 2024

View reviewed changes

thorstenhater added 4 commits October 23, 2024 12:26

Make reset logic a bit clearer.

9adf35f

Be a bit more intentional in our dependency on pb11-sg

13cdf6a

Fix bug

c1e09cd

Merge remote-tracking branch 'hater/mem/trim-gpu-allocations' into me…

18a08e1

…m/trim-gpu-allocations

boeschf reviewed Oct 23, 2024

View reviewed changes

spack/package.py Outdated Show resolved Hide resolved

thorstenhater and others added 2 commits October 23, 2024 14:20

Update spack/package.py

9c9f39c

Co-authored-by: boeschf <[email protected]>

Factor out flags

bd1a2f1

thorstenhater added 3 commits October 23, 2024 14:57

Clean-up.

87c0497

Merge remote-tracking branch 'hater/mem/trim-gpu-allocations' into me…

973b7f6

…m/trim-gpu-allocations

Tweak spack for stubgen

572b081

thorstenhater requested review from boeschf and jlubo October 23, 2024 15:13

obey the spack lore

4edd88c

boeschf approved these changes Oct 23, 2024

View reviewed changes

thorstenhater merged commit b8b768d into arbor-sim:master Oct 23, 2024
28 of 29 checks passed

thorstenhater deleted the mem/trim-gpu-allocations branch October 23, 2024 17:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce redundant GPU allocations #2393

Reduce redundant GPU allocations #2393

thorstenhater commented Aug 26, 2024 •

edited

Loading

jlubo commented Oct 10, 2024 •

edited

Loading

thorstenhater commented Oct 10, 2024

thorstenhater commented Oct 23, 2024

boeschf left a comment

boeschf commented Oct 23, 2024

boeschf left a comment

Reduce redundant GPU allocations #2393

Reduce redundant GPU allocations #2393

Conversation

thorstenhater commented Aug 26, 2024 • edited Loading

Introduction

jlubo commented Oct 10, 2024 • edited Loading

thorstenhater commented Oct 10, 2024

thorstenhater commented Oct 23, 2024

boeschf left a comment

Choose a reason for hiding this comment

boeschf commented Oct 23, 2024

boeschf left a comment

Choose a reason for hiding this comment

thorstenhater commented Aug 26, 2024 •

edited

Loading

jlubo commented Oct 10, 2024 •

edited

Loading