Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch the Standard distribution for floats to [0, 1) #420

Merged
merged 2 commits into from
May 3, 2018

Conversation

pitdicker
Copy link
Contributor

See the discussion in #416. cc @sbarral

This moves the current Standard distribution for generating floats to Open01.
Standard instead samples from the closed-open interval [0, 1) using a multiply-based method.

I did not change the Ziggurat and Uniform implementations, they continue to use the transmute-based methods. An implementation of Uniform optimized for the other method would look quite different.

I hope just those two distributions, Standard and Open01, are good enough for the 0.5 release. I am not convinced adding an explicit ClosedOpen01 adds much benefit, but it can easily be added in the future. I am also not convinced on the usefullness of OpenClosed01, but added the suggestion of using 1.0 - rng.gen() in the documentation.

Copy link
Member

@dhardy dhardy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good!

I still think there's some benefit to explicit OpenClosed01 and ClosedOpen01 distributions but I don't know whether there is sufficient justification to add them.

/// The `Open01` distribution provides an alternative: it generates values in
/// the open interval (0, 1), with one less random bit. It uses the slightly
/// faster transmute-based approach for the conversion to a floating point
/// value.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"Slightly faster" isn't really correct; I just benchmarked and found no difference. You could however say (which may be faster on some architectures).

/// ```
/// use rand::{thread_rng, Rng};
/// `Rng::gen_range(0, 1)` also uses the faster transmute-based method, but
/// produces values in a half-open interval just like `Standard`.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just remove 'faster' here?

/// ```
/// As a final alternative, if you require the floating point value to never be
/// exactly `0.0`: consider using `1.0 - rng.gen()`, which transforms the
/// half-open `Standard` distribution to the `(0, 1]` interval.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would prefer to assume any users who care know what they want:

If you wish to sample from the (0, 1] half-open interval consider using 1.0 - rng.gen().

Looking at the code, I couldn't actually think of an alternative which samples with this precision and doesn't use an extra op (though of course one could add ε/2 or 1 before conversion to float).

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looking at the code, I couldn't actually think of an alternative which samples with this precision and doesn't use an extra op (though of course one could add ε/2 or 1 before conversion to float).

Explicit OpenClosed01 does have advantages though:

  • the intent is clearer and the code less cluttered,
  • the +1 integer addition before conversion is faster than the 1.0 - u user-side workaround

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree with you. There is also some benefit to an explicit ClosedOpen01, even if just for local documentation (this value is strictly less than 1).

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, this is also something that hit me when looking at the diff: if not for @pitdicker's perceptivity, subtle bugs would have been introduced in Ziggurat and Gamma where rng.gen() was previously used. When I read rng.gen() it is always unclear to me if the author of the code did so for the convenient notation or whether the code actually relies on the guarantees regarding the 0.0 and 1.0 bounds.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suppose that makes for a good argument.

In this case I had it easy, as I also was the one introducing the ambiguity in #274 😄.

@pitdicker pitdicker force-pushed the closed_open_float_conversion branch from 53e8ebd to caaaf0a Compare May 3, 2018 10:48
@pitdicker
Copy link
Contributor Author

pitdicker commented May 3, 2018

Changed the documentation as suggested and added back the benchmarks.

I am not against OpenClosed01 and ClosedOpen01, but I don't see them as something very useful. Is it ok to merge this PR, and I leave implementing those two to someone who cares more for them? This PR is tackling a breaking change, the other two are only additions.

@dhardy
Copy link
Member

dhardy commented May 3, 2018

@pitdicker can you pull my commit? I added OpenClosed01 but not the other one (partly because a completely redundant distribution may cause confusion as to which to use).

I think I'm ready to merge after that.

And CI works again! 🎉

@pitdicker
Copy link
Contributor Author

Looks good!

@dhardy dhardy merged commit 0950203 into rust-random:master May 3, 2018
@pitdicker pitdicker deleted the closed_open_float_conversion branch May 3, 2018 15:00
@vks
Copy link
Collaborator

vks commented May 3, 2018

Does it make sense to have some alias ClosedOpen01? This is more explicit (documenting intent) and makes code more forward compatible in case we change Standard again.

@dhardy
Copy link
Member

dhardy commented May 3, 2018

Yes and no: it is more explicit, but redundant and not very useful. I don't think we're likely to move Standard away from [0, 1) after all this discussion.

If you really want to see it open a PR, but I'm not sure there's sufficient motive.

@vks
Copy link
Collaborator

vks commented May 3, 2018

I would prefer

let r = rng.sample(ClosedOpen01);

over

let r = rng.sample(Standard);  // Can never be 1.0

but I can't think of a use case. (It would be different if Standard was like OpenClosed01.) If that ever changes, I will open a new issue/PR.

benjamin-lieser pushed a commit to benjamin-lieser/rand that referenced this pull request Feb 5, 2025
…onversion

Switch the `Standard` distribution for floats to [0, 1)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants