Skip to content

rocSOLVER 3.16.0 for ROCm 5.0.0

Compare
Choose a tag to compare
@lawruble13 lawruble13 released this 09 Feb 20:34
39ccf7a

Added

  • Symmetric matrix factorizations:
    • LASYF
    • SYTF2, SYTRF (with batched and strided_batched versions)
  • Added rocsolver_get_version_string_size to help with version string queries
  • Added rocblas_layer_mode_ex and the ability to print kernel calls in the trace and profile logs
  • Expanded batched and strided_batched sample programs.

Optimized

  • Improved general performance of LU factorization
  • Increased parallelism of specialized kernels when compiling from source, reducing build times on multi-core systems.

Changed

  • The rocsolver-test client now prints the rocSOLVER version used to run the tests,
    rather than the version used to build them
  • The rocsolver-bench client now prints the rocSOLVER version used in the benchmark

Fixed

  • Added missing stdint.h include to rocsolver.h