Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix sum() accuracy #10927

Merged
merged 1 commit into from
Dec 19, 2013
Merged

Fix sum() accuracy #10927

merged 1 commit into from
Dec 19, 2013

Conversation

g3xzh
Copy link
Contributor

@g3xzh g3xzh commented Dec 12, 2013

[1e20, 1.0, -1e20].sum() returns 0.0. This happens because during
the summation, 1.0 is too small relative to 1e20, making it
negligible.

I have tried Kahan summation but it hasn't fixed the problem.
Therefore, I've used Python's fsum() implementation.
For more details, read:
www.cs.cmu.edu/~quake-papers/robust-arithmetic.ps
#10851

Python's fsum (msum)
http://code.activestate.com/recipes/393090/

@huonw, your feedback is more than welcome.
It looks unpolished; Do you have suggestions how to make it more beautiful and elegant?

Thanks in advance,

for &(i1, f1) in modif_tuple.iter() {
partials[i1] = f1;
}
if (partials.len() > i as uint) {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These parens are not necessary.

@g3xzh
Copy link
Contributor Author

g3xzh commented Dec 12, 2013

I am not sure if the benchmark tests are good enough.

fn sum(self) -> f64 {
self.iter().fold(0.0, |p,q| p + *q)
/*
* Note: this method sacrifices performance at the altar of accuracy
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be a doc comment on the method definition in the trait, otherwise, unfortunately, it's not visible in the docs. That is, something like,

    /// The sum of the samples.
    ///
    /// Note: this method sacrifices performance at the altar of 
    /// accuracy. Depends... [etc.]
    fn sum(self) -> f64;

(Also, we might as well cite this properly to help guard against the links going dead (i.e. still make it possible to find the paper), e.g. like I have been doing in std::rand, so something like Shewchuk J.R., ["Adaptive Precision Floating-Point Arithmetic and Fast Robust Geometric Predicates"](http://www.cs.cmu.edu/~quake-papers/robust-arithmetic.ps) *Discrete & Computational Geometry 18*, 3 (Oct 1997), 305-363. (the page I'm basing this off, feel free to use some other citation format.))

@huonw
Copy link
Member

huonw commented Dec 12, 2013

Looks pretty good; address my comments, and rebase on top of master and we should be good to go!

@jfager
Copy link
Contributor

jfager commented Dec 14, 2013

For reference, here is how this got added to Python: http://bugs.python.org/issue2819. Their final C implementation handles subtleties with overflow and rounding that direct translation of Raymond Hettinger's python code misses.

@huonw
Copy link
Member

huonw commented Dec 14, 2013

Ah, thanks for that @jfager; it looks like this python code is referenced from the current code in Python, and is presumably what the C implementation was adapted from.

@jfager
Copy link
Contributor

jfager commented Dec 14, 2013

This algo happens to be one that I implement when I'm picking up a new language, so I've got a Rust version that doesn't need the modifications vec. I've put up a comparison of the two approaches at https://gist.github.com/jfager/7955901; it's about 2x as fast w/o the extra vec.

Still needs the overflow and rounding fixes, though.

@huonw
Copy link
Member

huonw commented Dec 14, 2013

I believe the implementation you took from this PR is actually incorrect, and better comparison would be with the change I suggested here. (e: performing that change still shows a similar performance gap. e2: for larger cases, the constant overhead of the second vector is reduced to make the performance gap about 1.5× (which is still pretty significant), also, using unsafe_set and unsafe_get instead of the checked indexing gives no appreciable difference.)

@jfager
Copy link
Contributor

jfager commented Dec 14, 2013

Updated the gist w/ your suggestion, check it (or feel free to fork it) to make sure I'm understanding right. It's still quite a bit slower than w/o modifications.

@@ -146,9 +147,45 @@ impl Summary {
}

impl<'self> Stats for &'self [f64] {

/// Note: this method sacrifices performance at the altar of accuracy
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This comment needs to add to the declaration of this method in the trait, i.e. there is a line above that says fn sum(self) -> f64; with a short comment above it. This should become:

/// <old comment>
///
/// <this comment>
fn sum(self) -> f64;

so that this text is the second paragraph of the "main" doc-comment.

@g3xzh
Copy link
Contributor Author

g3xzh commented Dec 18, 2013

OK, fixed indentation as @adridu59 has mentioned, and ran "make check".

@huonw, waiting for your feedback too.

😃

let mut j = 0;
// This inner loop applies `hi`/`lo` summation to each
// partial so that the list of partial sums remains exact.
while i < partials.len() {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This can just be for i in range(0, partials.len()).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Got so much to learn. Thanks for your review, though.
;-)

@huonw
Copy link
Member

huonw commented Dec 18, 2013

Looks good, however there are still some deficiencies with "extreme" values. In the following example, all print nan (the comment on the right is the correct value):

let nums = [
     &[f64::NAN, 1.0], // nan
     &[f64::INFINITY, 1.0], // inf
     &[1e308, 1e308, -1e308], // intermediate overflow, 1e308
     &[1e308, 1e308], // inf
     &[0.0, f64::NEG_INFINITY]]; // -inf
for v in nums.iter() {
    println!("{}", v.sum());
}

See this comment for a way to fix these; however, I think I'm ok with approving this if you open an issue saying "Stats.sum doesn't handle NaNs, infinities and overflow correctly" with the links from my comment and leave a comment like // FIXME #the_number_of_that_bug handle NaN, inf and overflow at the top of the method implementation.

@huonw
Copy link
Member

huonw commented Dec 18, 2013

Also, for merging, we'll want this a single commit, which you can do by squashing them.

let mut partials : ~[f64] = ~[];

for &mut x in self.iter() {
let mut i = 0;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Now that you're using range you don't need this separate i variable.

`[1e20, 1.0, -1e20].sum()` returns `0.0`. This happens because during
the summation, `1.0` is too small relative to `1e20`, making it
negligible.

I have tried Kahan summation but it hasn't fixed the problem.
Therefore, I've used Python's `fsum()` implementation with some
help from Jason Fager and Huon Wilson.
For more details, read:
www.cs.cmu.edu/~quake-papers/robust-arithmetic.ps

Moreover, benchmark and unit tests were added.

Note: `Status.sum` is still not fully fixed. It doesn't handle
NaNs, infinities and overflow correctly. See issue 11059:
rust-lang#11059
bors added a commit that referenced this pull request Dec 19, 2013
`[1e20, 1.0, -1e20].sum()` returns `0.0`. This happens because during
the summation, `1.0` is too small relative to `1e20`, making it
negligible.

I have tried Kahan summation but it hasn't fixed the problem.
Therefore, I've used Python's `fsum()` implementation.
For more details, read:
www.cs.cmu.edu/~quake-papers/robust-arithmetic.ps
#10851

Python's fsum (msum)
http://code.activestate.com/recipes/393090/

@huonw, your feedback is more than welcome.
It looks unpolished; Do you have suggestions how to make it more beautiful and elegant?

Thanks in advance,
@bors bors closed this Dec 19, 2013
@bors bors merged commit 05395cb into rust-lang:master Dec 19, 2013
@g3xzh g3xzh deleted the sum_bugfix branch December 25, 2013 00:30
flip1995 pushed a commit to flip1995/rust that referenced this pull request Jun 30, 2023
Ignore more type aliases in unnecessary_cast

Fixes rust-lang#10555

changelog: [`unnecessary_cast`]: No longer lints cast from locals that are type aliases
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants