`cutlass` integration + `segment_matmul` implementation #51

rusty1s · 2022-05-30T13:05:03Z

No description provided.

codecov-commenter · 2022-05-30T13:12:59Z

Codecov Report

Merging #51 (6bd7ed1) into master (45aafe6) will decrease coverage by 3.73%.
The diff coverage is 28.57%.

@@            Coverage Diff             @@
##           master      #51      +/-   ##
==========================================
- Coverage   94.42%   90.68%   -3.74%     
==========================================
  Files          12       13       +1     
  Lines         233      247      +14     
==========================================
+ Hits          220      224       +4     
- Misses         13       23      +10

Impacted Files	Coverage Δ
pyg_lib/csrc/ops/matmul.cpp	`28.57% <28.57%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 45aafe6...6bd7ed1. Read the comment docs.

rusty1s · 2022-06-15T13:47:02Z

@pyg-team/nvidia-team This PR is now ready to review.

teju85

This is really a great example showing cutlass integration. Nice job @rusty1s ! Do you have an example where the pyg_lib.segment.grouped_matmul is actually getting called in a training script?

puririshi98

LGTM!

pyg_lib/csrc/segment/cuda/matmul_kernel.cu

yaoyaowd · 2022-06-15T16:54:18Z

pyg_lib/csrc/segment/matmul.cpp

@@ -0,0 +1,41 @@
+#include "matmul.h"
+
+#include <ATen/core/dispatch/Dispatcher.h>


rusty1s · 2022-06-15T16:57:49Z

Thanks @teju85. I will work on backward implementation and PyG integration next. Can share an example by then!

pyg_lib/csrc/segment/cuda/matmul_kernel.cu

hwu36 · 2022-06-16T21:15:27Z

Haicheng from nvidia cutlass. LGTM. Thank you. BTW, we are improving group gemm now.

rusty1s · 2022-06-19T10:47:54Z

@hwu36 Thanks! Please ping me if you make any improvements :)

hwu36 · 2022-06-21T06:00:34Z

@hwu36 Thanks! Please ping me if you make any improvements :)

@jackkosaian just fixed occupancy calculation in NVIDIA/cutlass#532 . This number is used to calculate the number of threadblocks to launch group gemm. I know you hard coded this number now so you are not affected.

@jackkosaian is going to further improve group gemm in the summer.

rusty1s added 4 commits May 30, 2022 12:13

initial commit

eb72aec

update

3b82b95

update

1c4bfd0

update

2f7dead

rusty1s self-assigned this May 30, 2022

rusty1s added 0 - Priority P0 feature ops labels May 30, 2022

rusty1s added 2 commits May 30, 2022 13:06

update

b424117

update

3717789

rusty1s added 8 commits May 30, 2022 13:23

update

7f15943

update

0a3f784

update

55f2424

update

df1cff4

update

2a2ab25

changelog

03c2df4

update

17badb6

update

b751e6e

rusty1s changed the title ~~[WIP] cutlass integration + segment_matmul implementation~~ cutlass integration + segment_matmul implementation Jun 15, 2022

rusty1s added 4 commits June 15, 2022 13:19

update

993dd87

update

ce60bcf

doc

33a40f6

Update

880f567

rusty1s requested review from puririshi98 and a team June 15, 2022 13:45

Merge branch 'master' into cutlass

022e1ba

rusty1s requested review from yaoyaowd and ZenoTan June 15, 2022 13:50

rusty1s added 4 commits June 15, 2022 14:02

update

8971327

Merge branch 'cutlass' of github.com:pyg-team/pyg-lib into cutlass

41b68b1

updatE

0584668

fix includes

59ab779

teju85 reviewed Jun 15, 2022

View reviewed changes

puririshi98 mentioned this pull request Jun 15, 2022

[RFC] segment Matrix-Multiplication #49

Closed

puririshi98 approved these changes Jun 15, 2022

View reviewed changes

yaoyaowd reviewed Jun 15, 2022

View reviewed changes

yaoyaowd approved these changes Jun 15, 2022

View reviewed changes

ZenoTan reviewed Jun 16, 2022

View reviewed changes

pyg_lib/csrc/segment/cuda/matmul_kernel.cu Outdated Show resolved Hide resolved

ZenoTan reviewed Jun 16, 2022

View reviewed changes

pyg_lib/csrc/segment/cuda/matmul_kernel.cu Outdated Show resolved Hide resolved

rusty1s added 4 commits June 24, 2022 18:15

rename

44c6198

Merge branch 'master' into cutlass

6bd7ed1

TORCH_CHECK

8cbe28f

shape check

bc08ffa

rusty1s merged commit 941b77f into master Jun 24, 2022

rusty1s deleted the cutlass branch June 24, 2022 18:25

puririshi98 mentioned this pull request Jun 29, 2022

segment_matmul integration from pyg-lib pyg-team/pytorch_geometric#4887

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`cutlass` integration + `segment_matmul` implementation #51

`cutlass` integration + `segment_matmul` implementation #51

rusty1s commented May 30, 2022

codecov-commenter commented May 30, 2022 •

edited

Loading

rusty1s commented Jun 15, 2022

teju85 left a comment

puririshi98 left a comment

yaoyaowd Jun 15, 2022

rusty1s commented Jun 15, 2022

hwu36 commented Jun 16, 2022

rusty1s commented Jun 19, 2022

hwu36 commented Jun 21, 2022 •

edited

Loading

		@@ -0,0 +1,41 @@
		#include "matmul.h"

		#include <ATen/core/dispatch/Dispatcher.h>

cutlass integration + segment_matmul implementation #51

cutlass integration + segment_matmul implementation #51

Conversation

rusty1s commented May 30, 2022

codecov-commenter commented May 30, 2022 • edited Loading

Codecov Report

rusty1s commented Jun 15, 2022

teju85 left a comment

Choose a reason for hiding this comment

puririshi98 left a comment

Choose a reason for hiding this comment

yaoyaowd Jun 15, 2022

Choose a reason for hiding this comment

rusty1s commented Jun 15, 2022

hwu36 commented Jun 16, 2022

rusty1s commented Jun 19, 2022

hwu36 commented Jun 21, 2022 • edited Loading

`cutlass` integration + `segment_matmul` implementation #51

`cutlass` integration + `segment_matmul` implementation #51

codecov-commenter commented May 30, 2022 •

edited

Loading

hwu36 commented Jun 21, 2022 •

edited

Loading