[FEA] Implement a more accurate float to decimal conversion that supports rounding instead of truncation #16155

ttnghia · 2024-07-01T23:42:18Z

Currently, there not exist any accurate float-to-decimal conversion. The closest operation to it is cudf::round which can produce some results that are not correct all the time. As such, we have issues like NVIDIA/spark-rapids#9682 and NVIDIA/spark-rapids#10809.

A new dedicated conversion code in #15905 supposes to add some special handling for float-decimal conversion. Unfortunately, it performs truncation instead of rounding. It should be great to support an optional flag to that code, allowing to do either truncation or rounding depending on the applications.

The text was updated successfully, but these errors were encountered:

pmattione-nvidia · 2024-07-02T14:04:25Z

I'm working on composing a solution for this. A question in the meantime: if the result under- or overflows, what behavior is needed to match spark-rapids?

E.g. for floating -> decimal, should we return one of 0 / INT_MIN / INT_MAX? for decimal -> floating do we set 0 / +-inf? Or do we null the field entirely?

pmattione-nvidia · 2024-07-09T19:20:14Z

The code in the new conversion PR has been modified so that the cuDF-specific code wraps around the core of the conversion algorithm. In spark-rapids-jni, we can similarly wrap around this core to perform the spark-specific rounding that we need.

This cuDF draft PR has the code for the spark-specific rounding that we need, wrapping around the core of the cuDF conversion code. This code is just in cuDF for my ease of testing, and should be migrated to spark-rapids-jni for full integration.

pmattione-nvidia · 2024-07-11T14:40:19Z

The cuDF conversion PR has been merged.

ttnghia added the feature request New feature or request label Jul 1, 2024

github-project-automation bot added this to cuDF/Dask/Numba/UCX Jul 1, 2024

github-project-automation bot moved this to In Progress in cuDF/Dask/Numba/UCX Jul 1, 2024

ttnghia changed the title ~~[FEA] Implement a more accurate float to decimal conversion that support rounding instead of flooring~~ [FEA] Implement a more accurate float to decimal conversion that supports rounding instead of truncation Jul 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Implement a more accurate float to decimal conversion that supports rounding instead of truncation #16155

[FEA] Implement a more accurate float to decimal conversion that supports rounding instead of truncation #16155

ttnghia commented Jul 1, 2024

pmattione-nvidia commented Jul 2, 2024

pmattione-nvidia commented Jul 9, 2024 •

edited

Loading

pmattione-nvidia commented Jul 11, 2024

[FEA] Implement a more accurate float to decimal conversion that supports rounding instead of truncation #16155

[FEA] Implement a more accurate float to decimal conversion that supports rounding instead of truncation #16155

Comments

ttnghia commented Jul 1, 2024

pmattione-nvidia commented Jul 2, 2024

pmattione-nvidia commented Jul 9, 2024 • edited Loading

pmattione-nvidia commented Jul 11, 2024

pmattione-nvidia commented Jul 9, 2024 •

edited

Loading