Optimise decimal casting for infallible conversions #7021

aweltsch · 2025-01-25T14:15:17Z

Which issue does this PR close?

Closes Optimise Decimal Casting #6877.

Rationale for this change

As mentioned in the issue there are certain conversions for which we don't need to check precision / overflow and can thus improve performance by using the infallible unary kernel.

Explanation of the optimisation:

Case 1, increasing scale:

Increasing the scale will result in multiplication with powers of 10, thus being equivalent to a "shift left" of the digits.
Every number will thus "gain" additional digits. The operation is safe when the output type gives enough precision (i.e. digits) to contain the original digits, as well as the digits gained.
This can be boiled down to the condition input_prec + (output_scale - input_scale) <= output_prec

Example:

consider a conversion from Decimal(5, 0) to Decimal(8, 3) with the number 12345 * 10^0 = 12345000 * 10^(-3)
If we are starting with any number xxxxx, then an increase of scale by 3 will have the following effect on the representation: [xxxxx] -> [xxxxx000].
So for this to work, every output type needs to have at least 8 digits of precision.

Case 2, decreasing scale:

Decreasing the scale will result in division with powers of 10, thus "shifting right" of the digits and adding a rounding term. We usually need less precision to represent the result. By shifting right we lose input_scale - output_scale digits, but adding a rounding term can result in one additional digit being required, therefore we can boil this down to
input_prec - (input_scale - output_scale) < output_prec

Example:

consider a conversion from Decimal(5, 0) to Decimal(3, -3) with the number 99900 * 10^0 = 99.9 * 10^(3) which is then rounded to 100 * 10^3 = 100000
If we are starting with any number xxxxx, then and decrease the scale by 3 will have the following effect on the representation: [xxxxx] -> [xx] (+ 1 possibly). In the example plus one adds an additional digit, so the conversion to work, every output type needs to have at least 3 digits of precision.
A conversion to Decimal(2, -3) would not be possible.

Performance impact

The only cases affected are between decimal128 types, for increasing scale there is a considerable improvements that I measured of around 80%.
For decreasing the scale there was an improvement of around 25%.

cast decimal128 to decimal128 512                                  6.98      2.8±0.00µs        ? ?/sec    1.00    402.4±0.35ns        ? ?/sec
cast decimal128 to decimal128 512 lower precision                  1.01      3.0±0.00µs        ? ?/sec    1.00      2.9±0.01µs        ? ?/sec
cast decimal128 to decimal128 512 with lower scale (infallible)    1.31      3.3±0.00µs        ? ?/sec    1.00      2.5±0.00µs        ? ?/sec
cast decimal128 to decimal128 512 with same scale                  1.07     54.2±1.47ns        ? ?/sec    1.00     50.5±1.41ns        ? ?/sec
cast decimal128 to decimal256 512                                  1.00      6.1±0.01µs        ? ?/sec    1.00      6.1±0.01µs        ? ?/sec
cast decimal256 to decimal128 512                                  1.02     13.2±0.67µs        ? ?/sec    1.00     12.9±0.12µs        ? ?/sec
cast decimal256 to decimal256 512                                  1.01      6.1±0.02µs        ? ?/sec    1.00      6.1±0.03µs        ? ?/sec
cast decimal256 to decimal256 512 with same scale                  1.05     54.1±1.98ns        ? ?/sec    1.00     51.6±1.15ns        ? ?/sec

What changes are included in this PR?

I've added a new specialization for dealing with "safe" casts between the same decimal type both when reducing and increasing scale
A new benchmark for reducing scale

Are there any user-facing changes?

No, the behavior of the casts should stay the same.

alamb · 2025-01-25T17:12:52Z

Thank @aweltsch -- I started the CI on this PR

aweltsch added 2 commits January 25, 2025 15:04

Implement fast path for infallible case.

30d3fc3

Add benchmark for infallible casting to smaller scale.

69a665b

github-actions bot added the arrow Changes to the arrow crate label Jan 25, 2025

aweltsch mentioned this pull request Jan 25, 2025

Optimise Decimal Casting #6877

Open

Format benchmark.

e7736bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise decimal casting for infallible conversions #7021

Optimise decimal casting for infallible conversions #7021

aweltsch commented Jan 25, 2025 •

edited by alamb

Loading

alamb commented Jan 25, 2025

Optimise decimal casting for infallible conversions #7021

Are you sure you want to change the base?

Optimise decimal casting for infallible conversions #7021

Conversation

aweltsch commented Jan 25, 2025 • edited by alamb Loading

Which issue does this PR close?

Rationale for this change

Explanation of the optimisation:

Case 1, increasing scale:

Example:

Case 2, decreasing scale:

Example:

Performance impact

What changes are included in this PR?

Are there any user-facing changes?

alamb commented Jan 25, 2025

aweltsch commented Jan 25, 2025 •

edited by alamb

Loading