feat: Support bfloat16 and ensure valid precision and activation functions consistent everywhere #1463
test_cuda.yml
on: pull_request
Test Python and C++ on CUDA
0s
Pass testing on CUDA
3s