-
Notifications
You must be signed in to change notification settings - Fork 431
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add distribution plots to rand_distr
documentation
#1434
Add distribution plots to rand_distr
documentation
#1434
Conversation
Need to look into it more, but maybe whatever solution we decide on here could also be used for #278 |
Not an issue: you already feature-gated it.
Good to have in the same repo, but it doesn't need to be in the generated crate package. Could you try investigating why this happens? We already have an explicit
We don't package any of these files except by embedding, so we don't need an Or, simpler: forget Bernoulli (doesn't really need it) and put all the rest under PlotsBernoulliX-axis values should be false, true; not 0, 1. BinomialCan we add a second parameterisation, e.g. n=6, p=0.5? Cauchy👍 (straight copy from WP, I see) |
I think SVG would be a better fit (separate files, not embedded into HTML). You should be able to generate the same plots with Python without issues. |
@newpavlov you missed the discussions in #131 (specifically, how to make this work on docs.rs). SVG appears larger in some cases, smaller in others. It does scale better. |
Ah, I misread the discussion and thought it was about embedding SVG into HTML. I don't think the terrible diff objection is important, plus we probably should use compression either way (i.e. use SVGZ instead of plain SVG). Though there may be some issues with browser support of SVGZ files. In RustCrypto we use links to images which reside in the RustCrypto/media repository, e.g. see docs for the |
I think it may be because the folder was named
Got it. That also removes the need for the added dependency on
I went ahead and added n=10, p=0.6 because I think it looks better next to n=10, p=0.2. If you think having a demonstration of a different
I prefer SVG as well. However, I just did the switch and the diff count exploded to +11,000. Wowza. |
The Python script folder should probably be explicitly excluded from packaged crate (check if it gets included by running It also may be worth to consider the separate repo approach described above. It will be a simpler solution than relying on |
Ah, thanks for the tip! According to this
Certainly. I'm not attached to embedding them if we find a better solution. |
Diffs normally ignore binary files; this is just because SVG is text based? If so, switching to SVGZ may avoid this?
Aha!
The only real issue is versioning. I see usage of commit hashes in the URLs that There is also the minor issue that repo clones don't include all docs, but we already have that with the book. The other issue is that anyone trying to make themselves a copy of the docs for offline usage likely won't have the plots, but that's not critical. Considering these plots aren't directly tied to the Rust code in any way, a separate repo probably is the way to go. |
Yes, SVGZ is better in this regard. However embed-doc-image doesn't support it.
Okay then! Want to create it? |
|
Can you authorize me to push in that repo, or should I make a fork? |
@MichaelOwenDyer you should already have admin privileges on that repo. |
Hm, strange, I am getting a 403 whenever I try to push. Edit: Nevermind, I logged in using a GitHub token and then it worked. |
We can tell Git via attributes to treat the files as binary if necessary. |
# Conflicts: # rand_distr/src/geometric.rs # rand_distr/src/hypergeometric.rs
The way Wikipedia puts it is that there are two definitions of the distribution. The code and the PMF count the number of failures, while your plot and description count until the first success. |
That's presumably an error in the documentation since the code just uses |
Okay, yes I saw that too. I will adjust the x-axis of the plot to start at 0.
I'll fix it then 🤠 |
Another thing I noticed: We are using Also, should I make an effort to embed some more links to Wikipedia across the documentation? Some already exist. |
@vks implemented this (#1136). @vks?
WP is a useful resource; I guess this would be convenient for some people but not important. |
@MichaelOwenDyer Can you be more specific? The notation and naming are not very consistent across the literature. For example, what we call Zeta is sometimes called Zipf, or vice versa. |
@vks I'm referring to the struct at |
I guess the I'm fine with changing it to |
Okay. We may want to do that in a different PR since ZetaError::ATooSmall is pub, and I believe renaming that would be a breaking change. |
@MichaelOwenDyer I think this just needs a |
Thanks for the nudge, I've had a lot on my plate recently. Will finalize this today |
- Change parameter of Gumbel from σ to β - Skew normal now uses the correct symbols ξ, ω, α - Other tweaks
Okay @dhardy, I've made my final changes. I added Wikipedia links to the documentation and tried to make it look as consistent as possible. I would appreciate you taking one last look at it and if it looks good to you then I would say its ready to merge :) |
rand_distr/src/pareto.rs
Outdated
//! The Pareto distribution. | ||
//! The Pareto distribution `Pareto(α, xₘ)`. | ||
|
||
use crate::{Distribution, OpenClosed01}; | ||
use core::fmt; | ||
use num_traits::Float; | ||
use rand::Rng; | ||
|
||
/// The Pareto distribution `Pareto(scale, shape)`. | ||
/// The Pareto distribution `Pareto(α, xₘ)`. | ||
/// | ||
/// The Pareto distribution is a continuous probability distribution with | ||
/// parameters `scale` (`α`) and `shape` (`x`<sub>`m`</sub> or `k`). | ||
/// parameters `scale` (`α`) and `shape` (`xₘ` or `k`). | ||
/// | ||
/// # Plot | ||
/// | ||
/// The following plot shows the Pareto distribution with various values of | ||
/// `scale` and `shape`. | ||
/// Note how the scale parameter `α` corresponds to the height of the jump | ||
/// in density at `x = x`<sub>`m`</sub>, and to the rate of decay in the tail. | ||
/// in density at `x = xₘ`, and to the rate of decay in the tail. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the parameter names got confused somewhere — looks like α
is shape
. Also, match argument order of the constructor: scale
comes first.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Okay, I think I fixed it; please take another look. The plot still lists them in the other order (α, xₘ) but I guess that's okay.
Okay, I'm going to go ahead and merge this :D |
CHANGELOG.md
entrySummary
This PR adds some graphs to the documentation of different distributions in
rand_distr
to illustrate their behavior.Motivation
Make the distributions in this crate easier to understand via visual aids.
Details
The diagrams will be hosted in https://github.com/rust-random/charts (see this PR) and links will be inserted into the documentation.
Closes #131.
Documentation Progress:
binomial::Binomial
cauchy::Cauchy
dirichlet::Dirichlet
exponential::Exp
exponential::Exp1
frechet::Frechet
gamma::Gamma
gamma::ChiSquared
gamma::FisherF
gamma::StudentT
gamma::Beta
geometric::Geometric
geometric::StandardGeometric
gumbel::Gumbel
hypergeometric::Hypergeometric
inverse_gaussian::InverseGaussian
normal::Normal
normal::StandardNormal
normal::LogNormal
normal_inverse_gaussian::NormalInverseGaussian
pareto::Pareto
pert::Pert
poisson::Poisson
skew_normal::SkewNormal
triangular::Triangular
unit_ball::UnitBall
unit_circle::UnitCircle
unit_disc::UnitDisc
unit_sphere::UnitSphere
weibull::Weibull
zipf::Zeta
zipf::Zipf