`lexical` and `fast-float` might soon not be needed. #1010

ghuls · 2021-07-20T11:41:18Z

Describe your feature request

lexical and fast-float might soon not be needed anymore as fast-float like algorithm is merged in the standard library.

rust-lang/rust#86761
https://www.reddit.com/r/rust/comments/omelz4/making_rust_float_parsing_fast_libcore_edition/

The text was updated successfully, but these errors were encountered:

ritchie46 · 2021-07-20T16:21:32Z

I saw! 🙂

We still need lexical for the integer parsing, but fast-float is out.

Alexhuszagh · 2021-07-21T15:31:02Z

I saw! slightly_smiling_face

We still need lexical for the integer parsing, but fast-float is out.

I'll be working on making lexical lighter for integer parsing (as in, using workspaces for each different component). So compile times should decrease. And, ideally, bring those back into Rust core as well.

ritchie46 · 2021-07-22T08:36:28Z

I'll be working on making lexical lighter for integer parsing (as in, using workspaces for each different component). So compile times should decrease. And, ideally, bring those back into Rust core as well.

Very nice. Great work on the float parsing. 🚀

Alexhuszagh · 2021-09-06T16:07:38Z

Just an FYI: the optimized versions of the new integer and float-parsers have been implemented as of v0.8, and the API is identical. However, it does require a fairly recent Rust compiler (1.51.0), due to the requirement of const generics. I'm currently also trying to integrate the further improvements back into Rust core now.

ritchie46 · 2021-09-06T17:27:52Z

Just an FYI: the optimized versions of the new integer and float-parsers have been implemented as of v0.8, and the API is identical. However, it does require a fairly recent Rust compiler (1.51.0), due to the requirement of const generics. I'm currently also trying to integrate the further improvements back into Rust core now.

Great! I will update! Thank you for your great work on lexical.
I don't know the ins and outs, so I have a few questions:

How does lexical float parsing now compares to fast-float.
Is the lexical float parsing algorithm the one that gets embedding in rust std?

Alexhuszagh · 2021-09-06T18:17:02Z

Just an FYI: the optimized versions of the new integer and float-parsers have been implemented as of v0.8, and the API is identical. However, it does require a fairly recent Rust compiler (1.51.0), due to the requirement of const generics. I'm currently also trying to integrate the further improvements back into Rust core now.

Great! I will update! Thank you for your great work on lexical.
I don't know the ins and outs, so I have a few questions:
* How does lexical float parsing now compares to `fast-float`.

They're practically identical, except for rare cases. The extensive benchmarks compared to rust std can be found here, and the results for rust std are practically identical to fast-float-rust.

* Is the lexical float parsing algorithm the one that gets embedding in rust std.

Yes it is, except for the slow algorithm. In fact, I currently have issues (in core and fast-float-rust) and will be writing PRs (and an upstream PR for the reference C++ implementation). This can be quite a difference in performance for very rare cases, but can have impacts on a few real-world datasets (as shown below, in mesh).

So short term, they're similar, but lexical has a few optimizations the others don't have. Long-term, they will ideally be identical, because I want everyone to benefit. If you want a detailed explanation of what lexical does currently that the other two don't do right now, read the issue I've opened in core. Hopefully, this will be integrated shortly.

Detailed Benchmarks

The most relevant result is this, which benchmarks against a few real-world datasets:

Canada is practically identical, while for earth and most important mesh, lexical is faster. The datasets can be found here:

The biggest difference is in near-halfway cases, otherwise, the performance is nearly identical. The near-halfway cases with differing digit counts are as follows. Note that for contrived, the performance of halfway and moderate would be identical with fast-float-rust and lexical, just due to less inlining, core is slightly slower. This has no impact on any real-world dataset, however.

These are obviously contrived cases, but meant to demonstrate worst-case scenarios and performance of specific algorithms.

ghuls mentioned this issue Sep 14, 2021

fast-float can be removed when using ezrosent/frawk#72

Closed

matteosantama mentioned this issue Aug 21, 2022

Housekeeping: Issues that can be closed #4519

Closed

ghuls closed this as completed Aug 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`lexical` and `fast-float` might soon not be needed. #1010

`lexical` and `fast-float` might soon not be needed. #1010

ghuls commented Jul 20, 2021

ritchie46 commented Jul 20, 2021

Alexhuszagh commented Jul 21, 2021 •

edited

Loading

ritchie46 commented Jul 22, 2021

Alexhuszagh commented Sep 6, 2021

ritchie46 commented Sep 6, 2021

Alexhuszagh commented Sep 6, 2021 •

edited

Loading

lexical and fast-float might soon not be needed. #1010

lexical and fast-float might soon not be needed. #1010

Comments

ghuls commented Jul 20, 2021

Describe your feature request

ritchie46 commented Jul 20, 2021

Alexhuszagh commented Jul 21, 2021 • edited Loading

ritchie46 commented Jul 22, 2021

Alexhuszagh commented Sep 6, 2021

ritchie46 commented Sep 6, 2021

Alexhuszagh commented Sep 6, 2021 • edited Loading

Detailed Benchmarks

`lexical` and `fast-float` might soon not be needed. #1010

`lexical` and `fast-float` might soon not be needed. #1010

Alexhuszagh commented Jul 21, 2021 •

edited

Loading

Alexhuszagh commented Sep 6, 2021 •

edited

Loading