Skip to content

hipBLASLt 0.3.0 for ROCm 5.7.1

Compare
Choose a tag to compare
@rocm-ci rocm-ci released this 13 Oct 18:57

Added

  • Add getAllAlgos extension APIs
  • TensileLite support new epilogues: gradient gelu, gradient D, gradient A/B, aux
  • Add sample package including three sample apps
  • Add new C++ GEMM class in hipblaslt extension

Changed

  • refactor GroupGemm APIs as C++ class in hipblaslt extension
  • change scaleD vector enum as HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER

Fixed

  • Enable norm check validation for CI

Optimizations

  • GSU kernel optimization: wider memory, PGR N
  • update logic yaml to improve some FP16 NN sizes
  • GroupGemm support GSU kernel
  • Add grouped gemm tuning for aldebaran