Skip to content

hipBLASLt 0.1.0 for ROCm 5.5.0

Compare
Choose a tag to compare
@rocm-ci rocm-ci released this 01 May 21:03

Added

  • Enable hipBLASLt APIs
  • Support gfx90a
  • Support problem type: fp32, fp16, bf16
  • Support activation: relu, gelu
  • Support bias vector
  • Support Scale D vector
  • Integreate with tensilelite kernel generator
  • Add Gtest: hipblaslt-test
  • Add full function tool: hipblaslt-bench
  • Add sample app: example_hipblaslt_preference

Optimizations

  • Gridbase solution search algorithm for untuned size
  • Tune 10k sizes for each problem type