Intro Use C++ CUDA and Python programming, for: CUDA operators dev (High Performance Computing) LLM inference system I am currently interning in the Paddle R&D team at Baidu. Contact E-mail: d31409163@163.com