Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Matmul benchmarking: case without tile quantization: #1980

Merged
merged 5 commits into from
Sep 30, 2022

Conversation

shmsong
Copy link

@shmsong shmsong commented Sep 13, 2022

This is the benchmarking PR in this series, tracking the resulting performance from this stack of PRs.

Most recent run on A100:

initial_sweep_0912ext

@shmsong
Copy link
Author

shmsong commented Sep 22, 2022

most recent check on DGX A100 after the recent round of moving commits around and cleaning up:
3stage0922
4stage0922

@zasdfgbnm zasdfgbnm mentioned this pull request Sep 29, 2022
@@ -20,6 +20,7 @@ if(USE_CUDA)
softmax_backward.cpp
Copy link
Collaborator

@zasdfgbnm zasdfgbnm Sep 29, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note to myself: I have split this file out and merged separately in #2007
This file is no longer needed here anymore.

@@ -0,0 +1,356 @@
#include <torch/csrc/jit/codegen/cuda/arith.h>
Copy link
Collaborator

@zasdfgbnm zasdfgbnm Sep 29, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note to myself: I have split this file out and merged separately in #2007
This file is no longer needed here anymore.

@zasdfgbnm
Copy link
Collaborator

After rebasing, this PR is just a trivial PR adding a test, I will merge this now to the bottom of the stack

@zasdfgbnm zasdfgbnm merged commit 7c77b39 into cache_op_interface Sep 30, 2022
@zasdfgbnm zasdfgbnm deleted the matmul_benchmarking1 branch September 30, 2022 07:43
zasdfgbnm added a commit that referenced this pull request Feb 9, 2023
… options (#1978)

* pipe through cpasyncCG

* Matmul benchmarking: case without tile quantization: (#1980)

* add matmul benchmark

* more benchmark and test extension

* fixes

Co-authored-by: Xiang Gao <qasdfgtyuiop@gmail.com>

* fix

---------

Co-authored-by: Xiang Gao <qasdfgtyuiop@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants