Investigate the Ryū algorithm for a simpler/faster implementation of float -> string conversion #52811

bstrie · 2018-07-28T18:41:37Z

There's a new paper making the rounds on the topic of converting floats to their decimal string representations that claims to be both simpler and faster than prior algorithms: https://pldi18.sigplan.org/event/pldi-2018-papers-ry-fast-float-to-string-conversion . I'm particularly interested in the simplicity aspect, since I recall some old conversations regarding our current machinery for this being somewhat subtle and complicated (for speed purposes, I imagine). If we could drastically simplify the implementation without sacrificing speed, that might be a win. Good student or intern project, methinks.

(Apologies for how wishlisty this issue is, I've no clue who might be the presiding expert on our current float->string implementation.)

Mark-Simulacrum · 2018-07-28T18:45:28Z

cc @dtolnay in case this is of interest for serde

eddyb · 2018-07-28T18:51:11Z

cc @lifthrasiir @rkruppe

varkor · 2018-07-28T19:21:01Z

The repository can be found at https://github.com/ulfjack/ryu and the paper at https://dl.acm.org/citation.cfm?id=3192369.

dtolnay · 2018-07-29T01:06:46Z

I did a line-by-line translation of the C implementation to unsafe Rust in https://github.com/dtolnay/ryu. It performs better than dtoa on some inputs but the output is less human readable. For example dtoa and serde_json prefer to print 0.3 as 0.3, while Ryū currently prints it as 3E-1.

Input f64	dtoa	Ryū
0.1234	47 ns	66 ns
2.718281828459045	64 ns	40 ns
1.7976931348623157e308	73 ns	42 ns

Benchmark commit: dtolnay/dtoa@655152b

matthieu-m · 2018-07-29T10:57:18Z

@dtolnay : I would imagine that changing the output format would be trivial, no?

On the other hand, the fact that it performs worse on 0.1234 (+40%) is a tad concerning. I know that in some application domains the numbers are decimals by nature, and only floating point representation for performance reasons. If Ryu performs worse on numbers with a small number of digits, then dtoa could be preferable.

dtolnay · 2018-07-29T16:12:58Z

As I understand it, the fact that short numbers perform worse is an inherent aspect of the algorithm. Dtoa goes left to right and avoids generating too many noise digits, so the performance is better on short numbers as reflected in the table. Ryū generates an exact representation and then goes right to left removing noise digits, so the performance is better on the longest numbers.

The Ryū benchmarks all focus on a random distribution of numbers, and those have on average a lot of digits.

matthieu-m · 2018-07-29T16:16:04Z

As I understand it, the fact that short numbers perform worse is an inherent aspect of the algorithm. Dtoa goes left to right and avoids generating too many noise digits, so the performance is better on short numbers as reflected in the table. Ryū generates an exact representation and then goes right to left removing noise digits, so the performance is better on the longest numbers.

Thank you for confirming my suspicion.

The Ryū benchmarks all focus on a random distribution of numbers, and those have on average a lot of digits.

I would guess that whether the number of digits is random, or not, varies by application. This makes it quite difficult to pick one algorithm or the other.

dzamlo · 2018-07-29T18:00:52Z

Ryū is seems to be slower in a few case versus dtoa, but it seems to always be faster than the std implementation (which use grisu3 and dragon as a fallback if I'm not mistaken).

What trade-off does dtoa makes to be faster than std? And wouldn't it be fairer to compare Ryū with the std implementation ? As far as I understand Ryū should produce the same output as the std implementation.

prefer to print 0.3 as 0.3, while Ryū currently prints it as 3E-1

From what I understand, both Ryū and the std implementation compute the digits to print and the 10 exponent. And from that you can produce a string representation. It just happen that the implementation of the second part provided in the Ryū make a different choice, but using what we have now for that is very likely to be trivial.

nagisa · 2018-07-30T19:40:18Z

Retagging as T-libs, T-compiler hardly has anything to do with libstd matters.

dtolnay · 2018-08-14T17:35:11Z

I switched from dtoa (Grisu2) to the ryu crate in serde_json for a nice improvement in benchmarks.

steveklabnik · 2019-10-15T13:00:19Z

Triage: not aware of any changes here

matthieu-m · 2019-10-15T16:06:03Z

Note: at CppCon19, STL presented the culmination of his work on C++17 <charconv> for MSVC, resulting in Ryu being used for to_chars. He also mentioned that he had submitted a number of small performance improvement to Ulf's repository, and thought there were still possibilities to fine-tune the implementation.

wooster0 · 2021-07-16T17:12:33Z

Has the Dragonbox algorithm ever been discussed? https://github.com/jk-jeon/dragonbox/
From https://github.com/jk-jeon/dragonbox/#performance:

In my machine (Intel Core i7-7700HQ 2.80GHz, Windows 10), it defeats or is on par with other contemporary algorithms including Grisu-Exact, Ryu, and Schubfach.

It looks like if the float to string conversion algorithm is going to be rewritten, the Dragonbox algorithm might be an even better candidate than Ryu.

blueglyph · 2022-10-14T07:10:47Z

Has the Dragonbox algorithm ever been discussed? https://github.com/jk-jeon/dragonbox/ From https://github.com/jk-jeon/dragonbox/#performance:

In my machine (Intel Core i7-7700HQ 2.80GHz, Windows 10), it defeats or is on par with other contemporary algorithms including Grisu-Exact, Ryu, and Schubfach.

It looks like if the float to string conversion algorithm is going to be rewritten, the Dragonbox algorithm might be an even better candidate than Ryu.

I was wondering the same. Maybe it's a code size issue? EDIT: it's fairly recent too. Was there an intention to rewrite the ftl2dec string conversion code?

Here is a paper with a more detailed explanation of the algorithm.
Dragonbox: A New Floating-Point Binary-to-Decimal Conversion Algorithm

The current Grisu3 algorithm implementation seems to have issues, but it doesn't look like a fix has been proposed. Still, it has to fallback on Dragon4 sometimes, I don't know if DragonBox would avoid this altogether. Maybe I'll check it out, though I'm sure the developers have already thought about other algorithms.

This lib provides a few pointers and implementations for the different algorithms, as a complement to the link provided by @r00ster91 (which provides detailed performance graphs): https://github.com/abolz/Drachennest

kyrias · 2023-08-18T13:01:58Z

In addition to being faster, it would also be helpful on embedded systems from a stack perspective as the current implementation uses over 1K of stack which leads to formatting strings with floats on embedded is a common source for stack overflows. Meanwhile dtolnay's Ryū implementation only requires a 24 byte buffer, and doesn't look to require significantly more for local variables.

bstrie added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Jul 28, 2018

dtolnay mentioned this issue Jul 28, 2018

Investigate switching to Ryū algorithm dtolnay/dtoa#14

Closed

bmahler mentioned this issue Jul 31, 2018

Consider adopting Ryu for faster floating point to string conversion. Tencent/rapidjson#1332

Open

plokhotnyuk mentioned this issue May 25, 2019

Missing loop in two-digit optimization? ulfjack/ryu#85

Open

varkor mentioned this issue Jan 20, 2020

Float parsing can fail on valid float literals #31407

Closed

timotheecour mentioned this issue Feb 10, 2020

notes / questions disruptek/ryu#1

Open

Tropid mentioned this issue Jun 14, 2020

Use correct exponential representation for rational values boa-dev/boa#493

Merged

bstrie mentioned this issue Jul 17, 2021

Update Rust Float-Parsing Algorithms to use the Eisel-Lemire algorithm. #86761

Merged

lnicola mentioned this issue Sep 25, 2024

Use ryu for faster float-to-string conversion georust/wkt#118

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate the Ryū algorithm for a simpler/faster implementation of float -> string conversion #52811

Investigate the Ryū algorithm for a simpler/faster implementation of float -> string conversion #52811

bstrie commented Jul 28, 2018

Mark-Simulacrum commented Jul 28, 2018

eddyb commented Jul 28, 2018

varkor commented Jul 28, 2018

dtolnay commented Jul 29, 2018

matthieu-m commented Jul 29, 2018

dtolnay commented Jul 29, 2018

matthieu-m commented Jul 29, 2018

dzamlo commented Jul 29, 2018

nagisa commented Jul 30, 2018

dtolnay commented Aug 14, 2018

steveklabnik commented Oct 15, 2019

matthieu-m commented Oct 15, 2019

wooster0 commented Jul 16, 2021 •

edited

Loading

blueglyph commented Oct 14, 2022 •

edited

Loading

kyrias commented Aug 18, 2023

Investigate the Ryū algorithm for a simpler/faster implementation of float -> string conversion #52811

Investigate the Ryū algorithm for a simpler/faster implementation of float -> string conversion #52811

Comments

bstrie commented Jul 28, 2018

Mark-Simulacrum commented Jul 28, 2018

eddyb commented Jul 28, 2018

varkor commented Jul 28, 2018

dtolnay commented Jul 29, 2018

matthieu-m commented Jul 29, 2018

dtolnay commented Jul 29, 2018

matthieu-m commented Jul 29, 2018

dzamlo commented Jul 29, 2018

nagisa commented Jul 30, 2018

dtolnay commented Aug 14, 2018

steveklabnik commented Oct 15, 2019

matthieu-m commented Oct 15, 2019

wooster0 commented Jul 16, 2021 • edited Loading

blueglyph commented Oct 14, 2022 • edited Loading

kyrias commented Aug 18, 2023

wooster0 commented Jul 16, 2021 •

edited

Loading

blueglyph commented Oct 14, 2022 •

edited

Loading