Refactor package into one part dealing with LLVM and one part that builds a Vec on top of that #63

exception of some of the indexing implemented by tkf) while keeping the API intact. The reason for this is that I felt that the code could gain a lot of clarity by clearly separating the parts that deals with LLVM/`llvmcall` and then build a `Vec` on top of that. The number of lines of code has also been reduced from ~1600 to 1000. The code is structured as follows: - `LLVM_Intrinsics.jl` is pretty much a direct mapping of Julia Vectors (`NTuple{N, VecElement{T}}`) to the operators and intrinsics defined in https://llvm.org/docs/LangRef.html. It contains almost no higher level logic. - `simdvec.jl` contains the `Vec` (wrapping the tuple of `VecElement`s) with definitions defined on it that maps to the intrinsics defined in `LLVM.jl`. In some cases this is pretty automatic but in some cases requires some logic (like in the bitshifts partly to avoid undefined behavior or in the different conversions). - `arrayops.jl` is the stuff that deals with Julia `Array` like `vload`, `vstore`, `vgather`. Things that have gotten added to the API: - The `count_ones, count_zeros, leading_ones, leading_zeros, trailing_ones, trailing_zeros` family of functions. - Type conversions and different types of reinterprets from scalar to vectors and back and between vectors of different size: ```jl julia> v = Vec((Int32(2), Int32(4))) <2 x Int32>[2, 4] julia> reinterpret(Int64, v) 17179869186 julia> reinterpret(Vec{4, Int16}, v) <4 x Int16>[2, 0, 4, 0] julia> reinterpret(Vec{2, Int32}, 4) <2 x Int32>[4, 0] julia> convert(Vec{2, Float32}, v) <2 x Float32>[2.0, 4.0] ``` - Uses the LLVM vector reduction intrinsics (https://llvm.org/docs/LangRef.html#experimental-vector-reduction-intrinsics) instead of a hand rolled reducer. Things that has been removed from the API: - Removed the `Val` arguments from many functions (`setindex`, `>>` etc). Julia's constant propagation + LLVM's optimization are enough for these not to be needed. Things are specialized on the constant just as well as if using `Val`. - Removed the `Val{}` arguments and just use `Val()` consistently everywhere. - Removed `exp10`. This used to just call `10^v` but the reason you would use `exp10` is that there is a more efficient implementation for it than the naive one. I feel that providing `exp10` gives the false impression that it provides a benefit over the naive version Co-Authored-By: Valentin Churavy <vchuravy@users.noreply.github.com>

fixup: fix supported element types

…tead

the error we get is "LLVM ERROR: Symbols not found: { __mulodi4 }" which seems like it would require compiler-rt support"

Commits on Mar 5, 2020

add an extra fastmath test

KristofferC committed Mar 5, 2020

Configuration menu

View commit details

Copy full SHA for 05949bc

Browse repository at this point

Copy the full SHA

05949bc View commit details

Browse the repository at this point in the history

Commits on Mar 22, 2020

fix some boundschecks

Kristoffer Carlsson authored Mar 22, 2020

Configuration menu

View commit details

Copy full SHA for 37b2340

Browse repository at this point

Copy the full SHA

37b2340 View commit details

Browse the repository at this point in the history

Commits on Mar 23, 2020

add docs for fastmath

KristofferC committed Mar 23, 2020

Configuration menu

View commit details

Copy full SHA for e8f5815

Browse repository at this point

Copy the full SHA

e8f5815 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor package into one part dealing with LLVM and one part that builds a Vec on top of that #63

Refactor package into one part dealing with LLVM and one part that builds a Vec on top of that #63

Commits on Feb 21, 2020

Commits on Feb 22, 2020

Commits on Feb 23, 2020

Commits on Mar 4, 2020

Commits on Mar 5, 2020

Commits on Mar 22, 2020

Commits on Mar 23, 2020

Commits on Mar 31, 2020