Add test suite #186

Schultzer · 2019-06-21T00:24:20Z

Hi guys.

I created a test suite. when running this in dev it spotted overflow in cos, fma, sin, sincos, tan functions.

let me know what you guys think.

I have not run the formatter yet.

alexcrichton · 2019-06-21T01:05:47Z

Nice! I'm all for adding more tests, especially if they're just executed with simple cargo test commands :)

gnzlbg · 2019-06-21T11:58:24Z

Looks good to me in general.

I wonder whether we could auto-generate at least some of these tests, e.g., via a proc macro like:

#[libm_test(type = "trigonometric", include = "pi/2", ulp_max_error = "4.0")]
fn sind(x: f64) -> f64; 
#[libm_test(include(x = "2"))]
fn powf(x: f32, y: f32) -> f32;

Many of these tests appear to be going for corner cases inputs, e.g., +-INFINITY, +-0, sqrt(...), pi / n, 2^n, e^n, etc. and their immediate neighborhoods. Depending on the type of function, some other regions are also worth testing, and well for binary inputs we can compute the cartesian product, and for unary f32 functions we can actually test all inputs. We could generate tests like this, and then use ULPs to compare the results against, e.g., the systems libm, requiring 0.5 ulps by default, and allowing the user to pass in a higher number as part of the proc macro, to allow for slight deviations in, e.g., transcendental functions.

For the manual tests, it might be interesting to consider moving the test inputs and results to some sort of external file, that's loaded by the test suite, so that, e.g., in case of a modification that introduces differences, one might just need to do something like --bless to update the testsuite, as opposed to having to go through all of these by hand.

@Schultzer I think adding these manual tests is good for now, but do you think these approach might be worth exploring in the future?

Schultzer · 2019-06-21T16:05:53Z

@gnzlbg do you know of any projects they're doing something similar?
I've done this one time before with my blas impl. where I used JSON-FORTRAN to generate fortran result into a json file then having serde deserialize them. but I feel like including serde isn't really an option here.

I like your idea with the proc macro. I'm gonna look into that.

gnzlbg · 2019-06-21T16:10:40Z

do you know of any projects they're doing something similar?

Kind of. stdsimd auto-generates tests using a proc macro.

but I feel like including serde isn't really an option here.

Why do you think that? AFAICT these dependencies only need to be dev-dependencies.

Schultzer · 2019-06-21T16:19:37Z

My thinking was because one of the goals are to include this in core. But if there is no limit on dev tools then I’ll refactor this draft

alexcrichton · 2019-06-21T17:00:38Z

FWIW we only include the library source in libcore, not the test dependencies. It's fine to pull in whatever is needed here for testing this crate.

Schultzer · 2019-06-24T23:26:15Z

ping @alexcrichton and @gnzlbg, I've done the first part and moved the test out and implemented a proc-macro for the nearest function.
This was the easiest way I could find for making it more DRY.

I'm planning to implement one for each trigonometric, exponential, power, hyperbolic, error, gamma, and manipulation functions, etc.

Please let me know what you guys think about this approach.

alexcrichton · 2019-06-25T06:27:30Z

Hm I'm not really sure I fully understand this, why are tests generated via a macro rather than being written as before? Is there enough shared between each set of tests that it's worth having in a macro?

gnzlbg · 2019-06-25T06:51:38Z

@alexcrichton for each type of math functions you often want to test similar input domains. For example, for the trigonometric functions you often want to test all the floats close to 0, +-pi/4, pi/2, 3pi/4, etc. So while for each of the functions one might want to have extra tests, testing the classic bunch is something that one wants to do by default against all of them.

alexcrichton · 2019-06-25T07:39:35Z

While that's true the randomized testing we already do I think covers a good portion of that. I'm also not sure why it requires extraction into a full-blown procedural macro vs just writing test cases by hand.

gnzlbg · 2019-06-25T11:29:51Z

I'm also not sure why it requires extraction into a full-blown procedural macro vs just writing test cases by hand.

This doesn't require a proc macro, but I do think that hardcoding all inputs and outputs is a bad idea, except for super special cases.

With a good testing crate, one can write this code easily by hand, e.g.,

fn sind(x: f64)->f64;

#[cfg(test)]
mod tests {
  #[test] fn sind_zero() { libm_test::trigonometric(|x| sind(x), Around(0_f64), Ulp(4)); }
  #[test] fn sind_pi_half() { libm_test::trigonometric(|x| sind(x), Around(PI / 2.), Ulp(4)); }
  #[test] fn sind_infinity() { ... }
  #[test] fn sind_denormals() { ... }
...
  #[test] fn sind_special() { libm_test::assert_system(|x| sind(x), Exact(MySpecialVal), Ulp(2)) }
}

With the proc macro we can just generate all the boiler plate tests and the code can focus on the special ones instead, e.g.,

#[libm_test(type = "trigonometric", ulp = "4")] 
fn sind(x: f64)->f64;

#[test] fn sind_special() { libm_test::check_output(|x| sind(x), Input(MySpecialVal), Ulp(2)) }

crates/libm-test/Cargo.toml

src/math/floorf.rs

alexcrichton · 2019-06-26T09:28:04Z

I primarily just want to make sure that tests are as readable and easy to modify as possible, as I think that's the priority rather than ensuring we have deduplicated testing and such. The cost of a macro for something like this I feel is pretty high, especially if the macro has a lot of complexity.

Schultzer · 2019-07-01T21:09:05Z

This has been refactored, and includes the validation suite.

closes #150

gnzlbg

@alexcrichton this LGTM.

crates/libm-test/src/lib.rs

Schultzer · 2019-07-01T21:28:39Z

@alexcrichton let me know if you want me to squash the commits.

alexcrichton

Hm this looks like it deletes the existing test-generation harness? Why was that deleted?

crates/libm-test/Cargo.toml

crates/libm-test/src/validation.rs

Schultzer · 2019-07-02T01:42:43Z

It's been moved to libm-test crate, and refactored to include validation for x86_64, and cases for nan and infinities, as mention above we plan to include precision testing via mpfr, so keeping it all together seems like the right way.

burrbull · 2019-07-02T02:28:48Z

When you say mpfr, you mean rug?

alexcrichton · 2019-07-02T02:41:11Z

It's been moved to libm-test crate

Hm ok I see that but it seems to use sort of odd-looking macros now and has to be exhaustively tested? Previously it would automatically test new functions and they'd conform to a known set of signatures. I'd prefer to avoid new macro soup and duplication if possible.

Schultzer · 2019-07-02T03:40:41Z

Hm ok I see that but it seems to use sort of odd-looking macros now and has to be exhaustively tested?

The only part we are exhaustively testing is unary f32 functions on x86_64. The exhaustive tests did catch some bugs.

it would automatically test new functions

Thats one reason why I liked the proc-macro idea, then it would be explicit what we are testing, which it is also now.

I'd prefer to avoid new macro soup and duplication if possible

Which duplication are you referring too?

I would argue that it's easier to change the tests and understand whats get tested now, than before.

Schultzer · 2019-07-02T03:42:38Z

@burrbull It could be rug or another crate there is quite a few.

Schultzer · 2019-07-02T03:58:17Z

I plan to add some validation for f64 too, this could properly catch issues like #165

gnzlbg · 2019-07-02T06:38:41Z

Previously it would automatically test new functions and they'd conform to a known set of signatures.

Yes. That was the main advantage of the proc macro approach. Now new functions need to be manually added to the test suite. That's a one line change, but needs to be done manually.

Signed-off-by: Benjamin Schultzer <benjamin@schultzer.com>

Schultzer · 2019-07-02T18:32:40Z

@alexcrichton would you prefer to just write these tests out in each function?

This has to do with libcore itself being built with panic=unwind, but why not just link to std and have it provide these?

Does that mean that when we call 0f32.acos() it makes a call to extern "C" { pub fn acosf(x: f32) -> f32; } ?

burrbull · 2019-07-03T11:00:28Z

I made initial commit for support test sleef against rug::Float if someone is interested
burrbull/sleef-rs@289e46e

Results:

Max ULP = 0.703125 at -4.944768947135878e307, Average ULP = 0.25104109375	test f64::u10::test_sin ... ok

Max ULP = 1.53125 at -1.7301104562983707e308, Average ULP = 0.329521328125	test f64::u35::test_cos ... ok

I think something like this can be done for libm.

alexcrichton · 2019-07-03T16:23:39Z

Sorry I really do not have a ton of time to maintain this crate, and this PR alone has over 40 (!) comments already for what I was hoping would be very simple, adding some more tests. This is a very important crate to rust-lang/rust and arbitrary rewrites of infrastructure need to be carefully done rather than simply proposed off-the-cuff.

I see testing this crate as falling into a few major categories:

One is handwritten tests for each function. For example regression tests due to bugs that have been fixed with specific inputs/outputs.
Tests against a reference implementation (like musl). This is what the build script covers today. We need to guarantee that this is present for all functions that are defined in musl.
Eventually tests like fuzz tests or some sort of more random/exhaustive checking could be performed, or something like that.

I don't know why the reference tests are being rewritten in this PR. I don't know why you can't already just write normal unit tests for each function. I'm not really sure why there's a big rewrite happening here vs simply adding new tests in appropriate modules.

Schultzer force-pushed the add-test-suite branch 2 times, most recently from 1b0c539 to 5e9bae1 Compare June 21, 2019 00:52

Schultzer force-pushed the add-test-suite branch 3 times, most recently from e9e3c5f to 78c28e5 Compare June 24, 2019 23:18

gnzlbg reviewed Jun 25, 2019

View reviewed changes

crates/libm-test/Cargo.toml Outdated Show resolved Hide resolved

src/math/floorf.rs Outdated Show resolved Hide resolved

Schultzer force-pushed the add-test-suite branch 12 times, most recently from a352ede to b709039 Compare July 1, 2019 04:24

Schultzer force-pushed the add-test-suite branch 2 times, most recently from 7d2f781 to ecfec13 Compare July 1, 2019 21:02

Schultzer force-pushed the add-test-suite branch from ecfec13 to 84a516d Compare July 1, 2019 21:09

gnzlbg approved these changes Jul 1, 2019

View reviewed changes

crates/libm-test/src/lib.rs Show resolved Hide resolved

Schultzer force-pushed the add-test-suite branch from 84a516d to 33ab99d Compare July 1, 2019 21:19

alexcrichton reviewed Jul 2, 2019

View reviewed changes

crates/libm-test/Cargo.toml Show resolved Hide resolved

crates/libm-test/src/validation.rs Outdated Show resolved Hide resolved

Schultzer force-pushed the add-test-suite branch 4 times, most recently from 4415fd1 to bd02d4a Compare July 2, 2019 02:03

Schultzer force-pushed the add-test-suite branch 3 times, most recently from 15c9e0e to 2d242bc Compare July 2, 2019 04:32

Schultzer force-pushed the add-test-suite branch 2 times, most recently from 9ded3ac to fa89234 Compare July 2, 2019 17:42

Move musl tests to test-framework crate

a934bad

Signed-off-by: Benjamin Schultzer <benjamin@schultzer.com>

Schultzer force-pushed the add-test-suite branch from fa89234 to a934bad Compare July 2, 2019 18:23

gnzlbg mentioned this pull request Jul 5, 2019

Refactor testing framework into a libm-test #198

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test suite #186

Add test suite #186

Schultzer commented Jun 21, 2019 •

edited

Loading

alexcrichton commented Jun 21, 2019

gnzlbg commented Jun 21, 2019 •

edited

Loading

Schultzer commented Jun 21, 2019

gnzlbg commented Jun 21, 2019

Schultzer commented Jun 21, 2019

alexcrichton commented Jun 21, 2019

Schultzer commented Jun 24, 2019

alexcrichton commented Jun 25, 2019

gnzlbg commented Jun 25, 2019 via email

alexcrichton commented Jun 25, 2019

gnzlbg commented Jun 25, 2019 •

edited

Loading

alexcrichton commented Jun 26, 2019

Schultzer commented Jul 1, 2019

gnzlbg left a comment

Schultzer commented Jul 1, 2019

alexcrichton left a comment

Schultzer commented Jul 2, 2019

burrbull commented Jul 2, 2019

alexcrichton commented Jul 2, 2019

Schultzer commented Jul 2, 2019 •

edited

Loading

Schultzer commented Jul 2, 2019

Schultzer commented Jul 2, 2019

gnzlbg commented Jul 2, 2019

Schultzer commented Jul 2, 2019 •

edited

Loading

burrbull commented Jul 3, 2019

alexcrichton commented Jul 3, 2019

Add test suite #186

Are you sure you want to change the base?

Add test suite #186

Conversation

Schultzer commented Jun 21, 2019 • edited Loading

alexcrichton commented Jun 21, 2019

gnzlbg commented Jun 21, 2019 • edited Loading

Schultzer commented Jun 21, 2019

gnzlbg commented Jun 21, 2019

Schultzer commented Jun 21, 2019

alexcrichton commented Jun 21, 2019

Schultzer commented Jun 24, 2019

alexcrichton commented Jun 25, 2019

gnzlbg commented Jun 25, 2019 via email

alexcrichton commented Jun 25, 2019

gnzlbg commented Jun 25, 2019 • edited Loading

alexcrichton commented Jun 26, 2019

Schultzer commented Jul 1, 2019

gnzlbg left a comment

Choose a reason for hiding this comment

Schultzer commented Jul 1, 2019

alexcrichton left a comment

Choose a reason for hiding this comment

Schultzer commented Jul 2, 2019

burrbull commented Jul 2, 2019

alexcrichton commented Jul 2, 2019

Schultzer commented Jul 2, 2019 • edited Loading

Schultzer commented Jul 2, 2019

Schultzer commented Jul 2, 2019

gnzlbg commented Jul 2, 2019

Schultzer commented Jul 2, 2019 • edited Loading

burrbull commented Jul 3, 2019

alexcrichton commented Jul 3, 2019

Schultzer commented Jun 21, 2019 •

edited

Loading

gnzlbg commented Jun 21, 2019 •

edited

Loading

gnzlbg commented Jun 25, 2019 •

edited

Loading

Schultzer commented Jul 2, 2019 •

edited

Loading

Schultzer commented Jul 2, 2019 •

edited

Loading