Releases: JuliaGPU/AcceleratedKernels.jl
Releases · JuliaGPU/AcceleratedKernels.jl
v0.2.1
AcceleratedKernels v0.2.1
Merged pull requests:
- Add Buildkite CI for CUDA (#9) (@jpsamaroo)
- added foreach + tests. Started updating indices within kernels to use… local types without int64 promotions - about 25% faster in sort for example. Set default block_size to 256 (#11) (@anicusan)
Closed issues:
- Support for a
:serial
scheduler (#7)
v0.2.0
AcceleratedKernels v0.2.0
- N-dimensional
reduce
andmapreduce
map
- docs + and tests for each of the above.
- In-place functions now also return the modified argument as in Base.
Merged pull requests: