Use `f64` instead of `usize` for fragment widths #421

mgeisler · 2022-01-02T01:09:35Z

This changes the type used for internal width computations in the wrap algorithms. Before, we used usize to represent the fragment widths and for the line widths. This could make the optimal-fit wrapping algorithm overflow when it tries to compute the optimal wrapping cost. The problem is that the algorithm computes a cost using integer values formed by

(line_width - target_width)**2

When line_width is near usize::MAX, this computation can easily overflow.

By using an f64 for the cost computation, we achieve two things:

A much larger range for the cost computation: f64::MAX is about 1.8e308 whereas u64::MAX is only 1.8e19. Computing the cost with a fragment width in the range of u64, will thus not exceed 3e38, something which is easily represented with a f64. This means that wrapping fragments derived from a &str cannot overflow.

Overflows can still be triggered when fragments with extreme proportions are formed directly. The boundary seems to be around 1e170 with fragment widths above this limit triggering overflows.
Applications which wrap text using proportional fonts will already be operating with widths measured in floating point units. Using such units internally makes life easier for such applications, as shown by the changes in the Wasm demo.

Fixes #247
Fixes #416

The optimization problem solved by the optimal-fit algorithm is fundamentally a minimization problem. It is therefore not sensible to allow negative penalties since all penalties are there to discourage certain features: * `nline_penalty` discourages breaks with more lines than necessary, * `overflow_penalty` discourages lines longer than the line width, * `short_last_line_penalty` discourages short last lines, * `hyphen_penalty` discourages hyphenation Making this change surfaces the overflow bug behind #247 and #416. This will be fixed next via #421 and this commit can be seen as a way of simplifying that PR.

This changes the type used for internal width computations in the wrap algorithms. Before, we used `usize` to represent the fragment widths and for the line widths. This could make the optimal-fit wrapping algorithm overflow when it tries to compute the optimal wrapping cost. The problem is that the algorithm computes a cost using integer values formed by (line_width - target_width)**2 When `line_width` is near `usize::MAX`, this computation can easily overflow. By using an `f64` for the cost computation, we achieve two things: * A much larger range for the cost computation: `f64::MAX` is about 1.8e308 whereas `u64::MAX` is only 1.8e19. Computing the cost with a fragment width in the range of `u64`, will thus not exceed 3e38, something which is easily represented with a `f64`. This means that wrapping fragments derived from a `&str` cannot overflow. Overflows can still be triggered when fragments with extreme proportions are formed directly. The boundary seems to be around 1e170 with fragment widths above this limit triggering overflows. * Applications which wrap text using proportional fonts will already be operating with widths measured in floating point units. Using such units internally makes life easier for such applications, as shown by the changes in the Wasm demo. Fixes #247 Fixes #416

This tests the wrapping using fragments with widths which could come from a &str.

This changes the panics in `wrap_optimal_fit` to a `Result` type, allowing clients to catch them.

Following the advice from [1], this PR updates Cargo.toml to use precise version numbers for all dependencies. The latest versions at the time of writing are used. It turns out that we made precisely the mistake mentioned in the post: ``` % cargo -Z minimal-versions update % cargo test ``` failed on nightly because of the dependency on smawk 0.3: we need 0.3.1 after #421. [1]: https://users.rust-lang.org/t/psa-please-specify-precise-dependency-versions-in-cargo-toml/71277

mgeisler force-pushed the f64-fragment-widths branch 2 times, most recently from 6948a3d to 023bb76 Compare January 3, 2022 01:09

This was referenced Jan 3, 2022

Prevent overflows by using u32 instead of usize to represent the line width #420

Closed

Text is wrapped at small and inconsistent widths when attempting to wrap at a very large width. #416

Closed

Integer size and overflow when calculating penalty #247

Closed

mgeisler mentioned this pull request Jan 9, 2022

Change penalties to non-negative numbers #424

Merged

mgeisler force-pushed the f64-fragment-widths branch 2 times, most recently from dbcc245 to 432f20d Compare January 9, 2022 10:37

mgeisler added 3 commits January 9, 2022 11:44

Add optimal-fit fuzz test which uses integers

5da12a7

This tests the wrapping using fragments with widths which could come from a &str.

Introduce OverflowError

d380b95

This changes the panics in `wrap_optimal_fit` to a `Result` type, allowing clients to catch them.

mgeisler force-pushed the f64-fragment-widths branch from 432f20d to d380b95 Compare January 9, 2022 10:55

mgeisler merged commit 0f2183e into master Jan 9, 2022

mgeisler deleted the f64-fragment-widths branch January 9, 2022 11:06

This was referenced Jan 9, 2022

wrap_optimal_fit() - Checked arithmetic #392

Closed

Switch to f64 for computing penalties in wrap_optimal_fit #289

Closed

Handle overflows in wrap_optimal_fit by divide-and-conquer #259

Closed

mgeisler mentioned this pull request Feb 5, 2022

Use precise dependency versions in Cargo.toml #432

Merged

mgeisler mentioned this pull request Feb 15, 2022

add fill_optimal_fit fuzz target to CI #400

Closed

mgeisler mentioned this pull request Feb 26, 2022

Make type used for line width computations generic #327

Closed

github-actions bot mentioned this pull request Feb 27, 2022

Release 0.15.0 #443

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `f64` instead of `usize` for fragment widths #421

Use `f64` instead of `usize` for fragment widths #421

mgeisler commented Jan 2, 2022 •

edited

Loading

Use f64 instead of usize for fragment widths #421

Use f64 instead of usize for fragment widths #421

Conversation

mgeisler commented Jan 2, 2022 • edited Loading

Use `f64` instead of `usize` for fragment widths #421

Use `f64` instead of `usize` for fragment widths #421

mgeisler commented Jan 2, 2022 •

edited

Loading