Implement `core::ops` #10

calebzulawski · 2020-09-28T16:17:50Z

Closes #5.

Note that none of this is tested--I'm not actually sure where to start with testing. @workingjubilee any recommendations from packed-simd?

Also note that I didn't implement bitwise ops for floats. We can discuss it, but a solution that mirrors f32 and f64 are to_bits and from_bits fns that use u32 and u64 vectors.

Lokathor · 2020-09-28T17:51:14Z

I think that this is effectively blocked on having CI basics in place to run the tests. Maybe we can have that be a subject at today's meeting.

The tests themselves can be simple math tests in a tests/ folder like any normal integration test suite. Example inputs, expected output, assert that the actual output is equal to (or within a tolerance of) the actual output.

For the specific organization of this sort of "many small code bits" crate, I usually make one tests/ file per src/ file, and then each test name is generally one of impl_[trait]_for_[type], type_[method], or test_[property].

I'd be happy to just donate the entire test suite from wide to this crate. Other than possible renames here or there, I expect that stdsimd should be able to pass any test that wide does. It doesn't cover all the stuff that we'd want since it's focused on actual hardware ops, so there are no tests for integer simd division for example, but it's a good start probably.

workingjubilee · 2020-09-28T21:39:40Z

We will eventually want to build up to running most of the same set of example projects that packed_simd did, as well. They look a lot like real programs and will provide useful full-scale integration testing and benchmarks, essentially.

Lower things will be useful for revealing analytical diagnostics, and there I am mostly indifferent as long as the tests are easily findable. packed_simd had macroed tests as part of its impls, and there I often found it difficult to figure out where the test and the implementation code diverged.

otherwise: What Lokathor said. Copy-pasta from wide sounds delicious.

Lokathor · 2020-09-28T21:58:38Z

Heavy macros in the tests sounds pretty gross.

However, a macro that can take in the inputs+outputs for the widest possible value we want to test (say: f32x16), and then automatically generate the same test test at lower lane counts (eg: f32x8, f32x4, f32x2) by just skipping the values at the end of the list, or even by running more than one assert cycle, well that does sound extremely useful.

workingjubilee · 2020-09-28T22:33:45Z

This PR is waiting on #3 to go first.

calebzulawski · 2020-09-28T22:34:50Z

I'll take that opportunity to clean up a couple things.

crates/core_simd/src/intrinsics.rs

crates/core_simd/src/masks.rs

crates/core_simd/src/ops.rs

KodrAus · 2020-09-29T00:09:41Z

Alrighty, after a rebase this should pick up our initial CI

Lokathor

I think most of the vec/scalar ops should be commutative when the op is.

that is, if you can do f32x4 + f32 -> f32x4, you should probably also be able to do f32 + f32x4 -> f32x4.

Obviously this would only apply to Op, not OpAssign variants.

Lokathor · 2020-09-29T14:41:06Z

~~All set to merge?~~

EDIT: oh, right, tests, ignore me XD

calebzulawski · 2020-09-30T19:06:50Z

I just pushed some floating point tests (which uncovered that Neg can't be implemented with 0 - val, since it doesn't work for -0). If we're okay with this test style I'll expand it to the other types as well.

programmerjake · 2020-09-30T19:17:59Z

-0.0 - val works for neg assuming LLVM's default rounding mode (round to nearest, ties to even).

workingjubilee · 2020-09-30T21:33:30Z

I am not sure how I feel about the f32x4 + f32 example. I understand the intent behind including the operation but I am not sure if that is the best API for it, as my internal sense of what that would express actually is non-commutative:

f32x4 + f32 // -> f32x4, the rhs f32 added to each lane of lhs
f32 + f32x4 // -> f32, lhs added to the summation of rhs

And I feel the rhs vector is more ambiguous in general. And I must note, in general Rust is very conservative with extending Add impls even in "obvious" cases, e.g. compare i32 + i64. Of course you could have it return an i64, always, but

error[E0277]: cannot add `i32` to `i64`
 --> src/main.rs:2:18
  |
2 |     let n = 4i64 + -3i32;
  |                  ^ no implementation for `i64 + i32`
  |
  = help: the trait `std::ops::Add<i32>` is not implemented for `i64`

I don't believe packed_simd implemented Add for scalar + vector, but that might have been a technical limitation and not on purpose? In any case, I feel like having a fn to widen a scalar into an appropriate vector would be better than implementing coercion, and I believe with appropriate const generics madness it would be possible to have it handled by type inference(!) so as to make it as ergonomic as possible.

calebzulawski · 2020-09-30T21:42:25Z

Personally, I'm fine with removing the mixed scalar/vector ops. a + f32x4::splat(b) or simply a + b.into() isn't too bad in my opinion and might be less likely to be confusing.

Lokathor · 2020-10-01T00:41:32Z

Does a + b.into() infer properly?

I'll say that auto-splatting ops impls are one of the things I was almost immediately asked about adding when I released wide-0.5.0 without them. Code can really quickly get noisey without the auto-splatting. Particularly, if you have a scalar function, it's easier to convert to being vector if the literals will automatically promote themselves.

Also, C++ libs generally do have this, and that's another language and all, but it does mean that people using SIMD are generally comfortable with it.

calebzulawski · 2020-10-01T00:59:37Z

Good point, I just tried it and using Into doesn't infer correctly. I'm a little less sold since that doesn't work. You could always implement a function/trait on scalars to splat them, like a + b.splat() but I'm not sure that's any better.

workingjubilee · 2020-10-01T03:32:15Z

...is there a For/Into impl here for that?

C++ and C implement some pretty intense mathematical coercion by default and Rust deliberately chose another path. I would like at least to experiment with an alternative API here because I think there is a fiendish trick I can make work.

Lokathor · 2020-10-01T03:54:57Z

What makes Rust's minimal coercion work out well is that you can still quite casually convert numbers by throwing as _ onto anything. Since that's not available with simd types, things get wordy fast.

Feel free to continue to try and think of good alternatives, but literals particularly are usually a pain point like this.

calebzulawski · 2020-10-01T04:41:56Z

...is there a For/Into impl here for that?

C++ and C implement some pretty intense mathematical coercion by default and Rust deliberately chose another path. I would like at least to experiment with an alternative API here because I think there is a fiendish trick I can make work.

All of the vectors implement From<Scalar>. I guess it's getting caught up on the generic Rhs type.

I'm also not sure that it's really considered coersion--while the scalar gets splatted as far as the actual implementation goes, conceptually it can be thought of without any splat at all. I'm also not sure that behavior is unique to C++ libs, MATLAB and Wolfram automatically broadcast scalars for elementwise ops.

calebzulawski · 2020-10-01T04:43:30Z

Unrelated to the scalar-vector ops, are we ok with tests in this form? Should I expand them to the rest of the types?

Lokathor · 2020-10-01T05:22:21Z

It's... quite full of macros, but seems to still make sense.

Also, our sample data probably needs a little more flair than just small positive whole numbers. As soon as we start mixing it up a bit, we'll probably need an ApproxEq as well as just BitEq.

crates/core_simd/tests/helpers/biteq.rs

crates/core_simd/tests/ops_impl/float_macros.rs

crates/core_simd/src/ops.rs

workingjubilee · 2020-10-01T14:59:11Z

We will always want to test:
desired interactions with ty::MAX and ty::MIN
desired interactions with 1 and -1
desired interactions with 0 as ty
Because these are both exceedingly simple and also exceedingly revealing for some edge cases.

Lokathor · 2020-10-01T15:00:42Z

also for floats:

nan
very large values
very small values
infinity, neg infinity

workingjubilee · 2020-10-01T20:05:21Z

I think for very large and small values we can use a crate like quickcheck to test our assumptions there, and we can defer those until later. NaN and the infinities are important also though, but we're going to have a hard time testing NaN due to LLVM's canonicalizing behavior with them.

Lokathor · 2020-10-01T20:12:30Z

Well theoretically all the NaN are equally "you'll get nonsense", we shouldn't have to worry about specific nan bit patterns. That would itself be non-portable i think.

Also, it seems like we've got 16 lanes or so of test data space, and we can run the op a second time to get a second set of 16 cases, so there's no big harm in having some preset "weird input" cases. We can also have quickcheck of course.

calebzulawski · 2020-10-02T00:46:41Z

Ok, didn't implement any edge cases (and I definitely like the idea of something like quickcheck) but everything should work, nominally.

crates/core_simd/tests/ops_impl/float_macros.rs

crates/core_simd/src/ops.rs

workingjubilee · 2020-10-02T00:50:58Z

I think we can TODO more tests then and rebase-and-merge this on green to unblock parallelized work on vector math and more tests, then.

calebzulawski added 6 commits September 28, 2020 12:01

Add vector-vector arithmetic ops

c919d49

Improve operator implementations

7eef31e

Add operators and integer conversions for masks

3124cea

Add unary traits

54dff9f

Implement Index and IndexMut

163f6eb

Implement by-ref ops for masks

8e60378

workingjubilee added the I-nominated We should discuss this at the next weekly meeting label Sep 28, 2020

workingjubilee removed the I-nominated We should discuss this at the next weekly meeting label Sep 28, 2020

calebzulawski marked this pull request as draft September 28, 2020 22:34

Lokathor reviewed Sep 28, 2020

View reviewed changes

calebzulawski added 4 commits September 28, 2020 20:37

Add missing assignment ops on masks

65ab3c0

Document intrinsics

cc75371

Remove some macro magic and unnecessary trait

913efef

Fix Not implementation

aa0caa9

calebzulawski marked this pull request as ready for review September 29, 2020 02:20

Lokathor reviewed Sep 29, 2020

View reviewed changes

Add lhs scalar ops, fix Rem unsoundness

6512c6e

Lokathor approved these changes Sep 29, 2020

View reviewed changes

Implement format traits for masks and add floating point ops tests

2266775

Fix Neg for floats

f4ebc9d

Lokathor approved these changes Oct 1, 2020

View reviewed changes

workingjubilee reviewed Oct 1, 2020

View reviewed changes

crates/core_simd/tests/helpers/biteq.rs Show resolved Hide resolved

crates/core_simd/tests/ops_impl/float_macros.rs Outdated Show resolved Hide resolved

crates/core_simd/src/ops.rs Outdated Show resolved Hide resolved

calebzulawski added 3 commits October 1, 2020 18:50

Simplify operator generation macros

f1273af

Add integer tests

d428859

Add mask tests

d4c5d2d

workingjubilee approved these changes Oct 2, 2020

View reviewed changes

crates/core_simd/tests/ops_impl/float_macros.rs Outdated Show resolved Hide resolved

crates/core_simd/src/ops.rs Outdated Show resolved Hide resolved

workingjubilee merged commit 43dabd1 into master Oct 2, 2020

calebzulawski deleted the feature/ops branch October 2, 2020 03:23

programmerjake mentioned this pull request Nov 19, 2021

Consider removing splatting from operator overrides #196

Open

Implement core::ops #10

Implement core::ops #10

Uh oh!

Conversation

calebzulawski commented Sep 28, 2020

Uh oh!

Lokathor commented Sep 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

workingjubilee commented Sep 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lokathor commented Sep 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

workingjubilee commented Sep 28, 2020

Uh oh!

calebzulawski commented Sep 28, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KodrAus commented Sep 29, 2020

Uh oh!

Lokathor left a comment

Choose a reason for hiding this comment

Uh oh!

Lokathor commented Sep 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

calebzulawski commented Sep 30, 2020

Uh oh!

programmerjake commented Sep 30, 2020

Uh oh!

workingjubilee commented Sep 30, 2020

Uh oh!

calebzulawski commented Sep 30, 2020

Uh oh!

Lokathor commented Oct 1, 2020

Uh oh!

calebzulawski commented Oct 1, 2020

Uh oh!

workingjubilee commented Oct 1, 2020

Uh oh!

Lokathor commented Oct 1, 2020

Uh oh!

calebzulawski commented Oct 1, 2020

Uh oh!

calebzulawski commented Oct 1, 2020

Uh oh!

Lokathor commented Oct 1, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

workingjubilee commented Oct 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lokathor commented Oct 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

workingjubilee commented Oct 1, 2020

Uh oh!

Lokathor commented Oct 1, 2020

Uh oh!

calebzulawski commented Oct 2, 2020

Uh oh!

Uh oh!

Uh oh!

workingjubilee commented Oct 2, 2020

Uh oh!

Uh oh!

Implement `core::ops` #10

Implement `core::ops` #10

Lokathor commented Sep 28, 2020 •

edited

Loading

workingjubilee commented Sep 28, 2020 •

edited

Loading

Lokathor commented Sep 28, 2020 •

edited

Loading

Lokathor commented Sep 29, 2020 •

edited

Loading

workingjubilee commented Oct 1, 2020 •

edited

Loading

Lokathor commented Oct 1, 2020 •

edited

Loading