AcceleratedKernels v0.2.1
Merged pull requests:
- Add Buildkite CI for CUDA (#9) (@jpsamaroo)
- added foreach + tests. Started updating indices within kernels to use… local types without int64 promotions - about 25% faster in sort for example. Set default block_size to 256 (#11) (@anicusan)
Closed issues:
- Support for a
:serial
scheduler (#7)