Skip to content

rocBLAS 14.1.0 for ROCm1.8.2

Compare
Choose a tag to compare
@amcamd amcamd released this 29 Jun 15:33
· 3893 commits to master since this release

Changelist:

  • partition gemm m and n dimension to avoid offset exceeding 32 bit
  • fix set_get_matrix memory leak
  • TRSM improved performance and make asynch
  • Use hip_device target for ROCm1.8.2
  • Improve gemm-strided-batched testing