Help! Want a toy example to run matmul with q40 weight by cuda kernel #219

Eutenacity · 2024-09-11T15:19:38Z

Sorry, i am not familiar with the library, I want to run a matmul between a tensor created by pytorch and the q40 weight read from gguf.
I can read the weight from gguf and convert it to pytorch tensor.
But I have no idea to run the matmul between a tensor created by pytorch and the q40 weight by cuda kernel.

Eutenacity added the question Further information is requested label Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help! Want a toy example to run matmul with q40 weight by cuda kernel #219

Help! Want a toy example to run matmul with q40 weight by cuda kernel #219

Eutenacity commented Sep 11, 2024

Help! Want a toy example to run matmul with q40 weight by cuda kernel #219

Help! Want a toy example to run matmul with q40 weight by cuda kernel #219

Comments

Eutenacity commented Sep 11, 2024