Skip to content

Releases: ROCm/hipBLASLt

hipBLASLt 0.8.0 for ROCm 6.2.2

27 Sep 16:01
44efd61
Compare
Choose a tag to compare

hipBLASLt code for ROCm 6.2.2 did not change. The library was rebuilt for the updated ROCm 6.2.2 stack.

hipBLASLt 0.8.0 for ROCm 6.2.1

20 Sep 19:57
44efd61
Compare
Choose a tag to compare

hipBLASLt code for ROCm 6.2.1 did not change. The library was rebuilt for the updated ROCm 6.2.1 stack.

hipBLASLt 0.8.0 for ROCm 6.2.0

02 Aug 16:15
Compare
Choose a tag to compare

hipBLASLt code for ROCm 6.2.0 did not change. The library was rebuilt for the updated ROCm 6.2.0 stack.

hipBLASLt 0.7.0 for ROCm 6.1.2

04 Jun 16:53
77e0580
Compare
Choose a tag to compare

hipBLASLt code for ROCm 6.1.2 did not change. The library was rebuilt for the updated ROCm 6.1.2 stack.

hipBLASLt 0.7.0 for ROCm 6.1.1

08 May 17:59
77e0580
Compare
Choose a tag to compare

hipBLASLt code for ROCm 6.1.1 did not change. The library was rebuilt for the updated ROCm 6.1.1 stack.

hipBLASLt 0.7.0 for ROCm 6.1.0

16 Apr 19:08
1549b02
Compare
Choose a tag to compare

Additions

  • Added hipblasltExtSoftmax extension API
  • Added hipblasltExtLayerNorm extension API
  • Added hipblasltExtAMax extension API
  • Added GemmTuning extension parameter to set split-k by user
  • Support for mix precision datatype: fp16/fp8 in with fp16 out

hipBLASLt 0.6.0 for ROCm 6.0.2

31 Jan 20:12
592518e
Compare
Choose a tag to compare

Added

  • Add UserArguments for GroupedGemm
  • Support datatype: fp16 in with fp32 out
  • Add samples
  • Support datatype: Int8 in Int32 out
  • Support platform gfx94x
  • Support fp8/bf8 datatype (only for gfx94x platform)
  • Support Scalar A,B,C,D for fp8/bf8 datatype

Changed

  • Replace hipblasDatatype_t with hipDataType
  • Replace hipblasLtComputeType_t with hipblasComputeType_t
  • Deprecate HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER

hipBLASLt 0.6.0 for ROCm 6.0.0

15 Dec 18:30
592518e
Compare
Choose a tag to compare

Added

  • Add UserArguments for GroupedGemm
  • Support datatype: fp16 in with fp32 out
  • Add samples
  • Support datatype: Int8 in Int32 out
  • Support platform gfx94x
  • Support fp8/bf8 datatype (only for gfx94x platform)
  • Support Scalar A,B,C,D for fp8/bf8 datatype

Changed

  • Replace hipblasDatatype_t with hipDataType
  • Replace hipblasLtComputeType_t with hipblasComputeType_t
  • Deprecate HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER

hipBLASLt 0.3.0 for ROCm 5.7.1

13 Oct 18:57
Compare
Choose a tag to compare

Added

  • Add getAllAlgos extension APIs
  • TensileLite support new epilogues: gradient gelu, gradient D, gradient A/B, aux
  • Add sample package including three sample apps
  • Add new C++ GEMM class in hipblaslt extension

Changed

  • refactor GroupGemm APIs as C++ class in hipblaslt extension
  • change scaleD vector enum as HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER

Fixed

  • Enable norm check validation for CI

Optimizations

  • GSU kernel optimization: wider memory, PGR N
  • update logic yaml to improve some FP16 NN sizes
  • GroupGemm support GSU kernel
  • Add grouped gemm tuning for aldebaran

hipBLASLt 0.3.0 for ROCm 5.7.0

15 Sep 17:29
Compare
Choose a tag to compare

Added

  • Add getAllAlgos extension APIs
  • TensileLite support new epilogues: gradient gelu, gradient D, gradient A/B, aux
  • Add sample package including three sample apps
  • Add new C++ GEMM class in hipblaslt extension

Changed

  • refactor GroupGemm APIs as C++ class in hipblaslt extension
  • change scaleD vector enum as HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER

Fixed

  • Enable norm check validation for CI

Optimizations

  • GSU kernel optimization: wider memory, PGR N
  • update logic yaml to improve some FP16 NN sizes
  • GroupGemm support GSU kernel
  • Add grouped gemm tuning for aldebaran