Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Difference performance with simple_mode enabled? #5

Open
LeiWang1999 opened this issue Aug 12, 2022 · 0 comments
Open

Difference performance with simple_mode enabled? #5

LeiWang1999 opened this issue Aug 12, 2022 · 0 comments

Comments

@LeiWang1999
Copy link

Hi all, could you kindly introduce the difference between auto-tensorize and auto-tensorize-v4 ? from the observation of amos-gemm benchmarking, the performance of this two strategies is quite resemblance

M K N amos-1000-step-fp16-simple(ms) amos-1000-step-fp16(ms)
2 2 2 Failed to Run Failed to Run
4 4 4 Failed to Run Failed to Run
8 8 8 Failed to Run Failed to Run
16 16 16 0.004545906 0.003936828
32 32 32 0.004610093 0.004310548
64 64 64 0.004638971 0.004614832
128 128 128 0.005128772 0.005059945
256 256 256 0.006975747 0.007367229
512 512 512 0.018055338 0.016287096
1024 1024 1024 0.066839093 0.071785023
2048 2048 2048 0.382059749 0.336489417
4096 4096 4096 2.00519422 2.252330443
8192 8192 8192 21.62599663 18.10944683
16384 16384 16384 111.4660256 132.6751751
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant