-
Notifications
You must be signed in to change notification settings - Fork 13.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix sum()
accuracy
#10927
Fix sum()
accuracy
#10927
Conversation
for &(i1, f1) in modif_tuple.iter() { | ||
partials[i1] = f1; | ||
} | ||
if (partials.len() > i as uint) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These parens are not necessary.
I am not sure if the benchmark tests are good enough. |
fn sum(self) -> f64 { | ||
self.iter().fold(0.0, |p,q| p + *q) | ||
/* | ||
* Note: this method sacrifices performance at the altar of accuracy |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be a doc comment on the method definition in the trait, otherwise, unfortunately, it's not visible in the docs. That is, something like,
/// The sum of the samples.
///
/// Note: this method sacrifices performance at the altar of
/// accuracy. Depends... [etc.]
fn sum(self) -> f64;
(Also, we might as well cite this properly to help guard against the links going dead (i.e. still make it possible to find the paper), e.g. like I have been doing in std::rand
, so something like Shewchuk J.R., ["Adaptive Precision Floating-Point Arithmetic and Fast Robust Geometric Predicates"](http://www.cs.cmu.edu/~quake-papers/robust-arithmetic.ps) *Discrete & Computational Geometry 18*, 3 (Oct 1997), 305-363.
(the page I'm basing this off, feel free to use some other citation format.))
Looks pretty good; address my comments, and rebase on top of master and we should be good to go! |
For reference, here is how this got added to Python: http://bugs.python.org/issue2819. Their final C implementation handles subtleties with overflow and rounding that direct translation of Raymond Hettinger's python code misses. |
Ah, thanks for that @jfager; it looks like this python code is referenced from the current code in Python, and is presumably what the C implementation was adapted from. |
This algo happens to be one that I implement when I'm picking up a new language, so I've got a Rust version that doesn't need the Still needs the overflow and rounding fixes, though. |
I believe the implementation you took from this PR is actually incorrect, and better comparison would be with the change I suggested here. (e: performing that change still shows a similar performance gap. e2: for larger cases, the constant overhead of the second vector is reduced to make the performance gap about 1.5× (which is still pretty significant), also, using |
Updated the gist w/ your suggestion, check it (or feel free to fork it) to make sure I'm understanding right. It's still quite a bit slower than w/o |
@@ -146,9 +147,45 @@ impl Summary { | |||
} | |||
|
|||
impl<'self> Stats for &'self [f64] { | |||
|
|||
/// Note: this method sacrifices performance at the altar of accuracy |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This comment needs to add to the declaration of this method in the trait, i.e. there is a line above that says fn sum(self) -> f64;
with a short comment above it. This should become:
/// <old comment>
///
/// <this comment>
fn sum(self) -> f64;
so that this text is the second paragraph of the "main" doc-comment.
OK, fixed indentation as @adridu59 has mentioned, and ran "make check". @huonw, waiting for your feedback too. 😃 |
let mut j = 0; | ||
// This inner loop applies `hi`/`lo` summation to each | ||
// partial so that the list of partial sums remains exact. | ||
while i < partials.len() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This can just be for i in range(0, partials.len())
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Got so much to learn. Thanks for your review, though.
;-)
Looks good, however there are still some deficiencies with "extreme" values. In the following example, all print let nums = [
&[f64::NAN, 1.0], // nan
&[f64::INFINITY, 1.0], // inf
&[1e308, 1e308, -1e308], // intermediate overflow, 1e308
&[1e308, 1e308], // inf
&[0.0, f64::NEG_INFINITY]]; // -inf
for v in nums.iter() {
println!("{}", v.sum());
} See this comment for a way to fix these; however, I think I'm ok with approving this if you open an issue saying " |
Also, for merging, we'll want this a single commit, which you can do by squashing them. |
let mut partials : ~[f64] = ~[]; | ||
|
||
for &mut x in self.iter() { | ||
let mut i = 0; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now that you're using range
you don't need this separate i
variable.
`[1e20, 1.0, -1e20].sum()` returns `0.0`. This happens because during the summation, `1.0` is too small relative to `1e20`, making it negligible. I have tried Kahan summation but it hasn't fixed the problem. Therefore, I've used Python's `fsum()` implementation with some help from Jason Fager and Huon Wilson. For more details, read: www.cs.cmu.edu/~quake-papers/robust-arithmetic.ps Moreover, benchmark and unit tests were added. Note: `Status.sum` is still not fully fixed. It doesn't handle NaNs, infinities and overflow correctly. See issue 11059: rust-lang#11059
`[1e20, 1.0, -1e20].sum()` returns `0.0`. This happens because during the summation, `1.0` is too small relative to `1e20`, making it negligible. I have tried Kahan summation but it hasn't fixed the problem. Therefore, I've used Python's `fsum()` implementation. For more details, read: www.cs.cmu.edu/~quake-papers/robust-arithmetic.ps #10851 Python's fsum (msum) http://code.activestate.com/recipes/393090/ @huonw, your feedback is more than welcome. It looks unpolished; Do you have suggestions how to make it more beautiful and elegant? Thanks in advance,
Ignore more type aliases in unnecessary_cast Fixes rust-lang#10555 changelog: [`unnecessary_cast`]: No longer lints cast from locals that are type aliases
[1e20, 1.0, -1e20].sum()
returns0.0
. This happens because duringthe summation,
1.0
is too small relative to1e20
, making itnegligible.
I have tried Kahan summation but it hasn't fixed the problem.
Therefore, I've used Python's
fsum()
implementation.For more details, read:
www.cs.cmu.edu/~quake-papers/robust-arithmetic.ps
#10851
Python's fsum (msum)
http://code.activestate.com/recipes/393090/
@huonw, your feedback is more than welcome.
It looks unpolished; Do you have suggestions how to make it more beautiful and elegant?
Thanks in advance,