Batched diagonalization on CUDA GPUs #219

kbarros · 2024-01-10T20:08:48Z

CuSolve provides a function to perform batched diagonalization of Hermitian matrices:
https://docs.nvidia.com/cuda/cusolver/index.html#cusolverdn-t-syevj

Performance benefits may depend a lot on matrix size, etc: https://discourse.julialang.org/t/eigenvalues-for-lots-of-small-matrices-gpu-batched-vs-cpu-eigen/50792

We could consider using this for accelerating LSWT. Note that for many LSWT calculations, especially in dipole mode, the diagonalization subroutine itself may not be the dominant cost. To make this beneficial, we would probably need to move a lot of the calculation onto the GPU (e.g., the matrix-builds for each q).

kbarros added the enhancement New feature or request label Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batched diagonalization on CUDA GPUs #219

Batched diagonalization on CUDA GPUs #219

kbarros commented Jan 10, 2024

Batched diagonalization on CUDA GPUs #219

Batched diagonalization on CUDA GPUs #219

Comments

kbarros commented Jan 10, 2024