Skip to content

Conversation

@yzhou103
Copy link
Contributor

@yzhou103 yzhou103 commented Jul 8, 2025

  1. parallel at all kernels
  2. add log information of tuned shape
  3. fix typo in GemmTuner

@valarLip valarLip self-assigned this Jul 8, 2025
   1. parallel at all kernels
   2. add log information of tuned shape
   3. fix typo in GemmTuner
@valarLip valarLip merged commit caf1e19 into main Jul 12, 2025
13 checks passed
@valarLip valarLip deleted the tuning_parallel branch July 12, 2025 06:16
cagrikymk pushed a commit that referenced this pull request Jul 30, 2025
* enable parallel tuning on CK kernels

   1. parallel at all kernels
   2. add log information of tuned shape
   3. fix typo in GemmTuner

* enable parallel tuning on CK kernels

   1. parallel at all kernels
   2. add log information of tuned shape
   3. fix typo in GemmTuner

* add log info for tuned shape

* add log info for tuned shape

* fix lint error

* fix error in gemm_a8w8_blockscale and update gemm_op_a4w4 interface

* update gemm_op_a4w4/a8w8 interface

* fix lint error in gemm_op_a4w4/a8w8wq

---------

Co-authored-by: Ying.Zhou2 <340077269@qq.com>
Co-authored-by: Ying.Zhou2 <YingZhou2@amd.com>
Co-authored-by: valarLip <103567126+valarLip@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants