HPC dev, web3 dev, CUDA dev.
Use C++/CUDA/Python programming, for:
- CPU-side optimization (Instruction Set/.asm).
- CUDA low-level operators dev (general cuda-core kernels, cuBLAS, wmma, PTX).
- LLM inference system, especially hpc operator support.
Currently High Performance Computing interning in the Paddle R&D team at Baidu.
If there is any issue or project I can help you with, email me: yazhu00776@outlook.com.