Draft impl of JP06 pseudo-uniform sieve #65

dvdplm · 2024-11-11T21:45:37Z

Plenty to do here still, but I wanted to check the temperature on adding something like this. I have some preliminary benchmarks but need more, as well as some sort of measurement of the quality of the candidate distribution.

Benchmarks:

Uniform sieve/(U128) Random prime
                        time:   [16.186 µs 16.271 µs 16.355 µs]
                        change: [-57.114% -56.686% -56.285%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 5 outliers among 100 measurements (5.00%)
  2 (2.00%) low mild
  3 (3.00%) high mild
Uniform sieve/(U1024) Random prime
                        time:   [3.7322 ms 3.8515 ms 3.9741 ms]
                        change: [-4.1998% -0.0392% +4.4595%] (p = 0.99 > 0.05)
                        No change in performance detected.
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high mild
Uniform sieve/(U2048) Random prime
                        time:   [44.705 ms 48.435 ms 52.268 ms]
                        change: [-16.090% -6.0587% +4.9886%] (p = 0.27 > 0.05)
                        No change in performance detected.
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high mild

Presets/(U128) Random prime
                        time:   [29.215 µs 29.282 µs 29.350 µs]
                        change: [-0.6125% -0.0475% +0.5168%] (p = 0.87 > 0.05)
                        No change in performance detected.
Found 5 outliers among 100 measurements (5.00%)
  1 (1.00%) low severe
  2 (2.00%) low mild
  2 (2.00%) high mild
Presets/(U1024) Random prime
                        time:   [16.208 ms 17.986 ms 19.881 ms]
                        change: [-14.583% -1.9649% +13.994%] (p = 0.79 > 0.05)
                        No change in performance detected.
Found 3 outliers among 100 measurements (3.00%)
  2 (2.00%) high mild
  1 (1.00%) high severe
Benchmarking Presets/(U2048) Random prime: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 8.2s, or reduce sample count to 60.
Presets/(U2048) Random prime
                        time:   [58.075 ms 68.075 ms 78.841 ms]
                        change: [-19.450% -0.2867% +23.699%] (p = 0.98 > 0.05)
                        No change in performance detected.
Found 7 outliers among 100 measurements (7.00%)
  6 (6.00%) high mild
  1 (1.00%) high severe

The above benchmark results are quite surprising to me. I expected the uniform sieve to be significantly slower than our PRIMEINC sieve. One odd thing is that I discovered by accident that when finding the upper bounds for algorithm2, using T::random_mod is significantly slower than T::random_bits (code here vs here). I can't quite explain why this is and have not (yet) benchmarked crypto-bigint.

…thod

Avoid a clone

codecov · 2024-11-11T21:49:18Z

Codecov Report

Attention: Patch coverage is 47.74775% with 58 lines in your changes missing coverage. Please review.

Project coverage is 95.25%. Comparing base (4ea84ca) to head (82aa804).

Files with missing lines	Patch %	Lines
src/uniform_sieve.rs	47.27%	58 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master      #65      +/-   ##
==========================================
- Coverage   99.37%   95.25%   -4.13%     
==========================================
  Files           9       10       +1     
  Lines        1280     1390     +110     
==========================================
+ Hits         1272     1324      +52     
- Misses          8       66      +58

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fjarri · 2024-11-11T22:53:29Z

src/uniform_sieve.rs

+//                          λ(m) = 0x0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000099F002D0502D70DF7B64150D19B477A781987C56EBCDB7637842794D22D0D022010DC9BA1DB141FC9D07A9587E7A130D9DF6F9B67812800D00
+// TODO(dp): Would love for this to be `const`. Or not here at all. Does it have any value, in some form?
+#[allow(unused)]
+fn calculate_m_and_lambda_m<T>() -> (T, T)


How long does that take, compared to prime searching? If it's not too long, we can just generate it in the sieve on every call. If it is, could be put in https://doc.rust-lang.org/std/sync/struct.LazyLock.html, but that requires std, so will have to be gated behind a feature.

I had a thought about moving it to a build.rs and spit out a file on disk with the values. That way it wouldn't bloat the binary. But that sucks in a lot of ways.

Another thought I had was to remove the code completely and add a git commit hash as a comment in the code, providing future readers a way to check how the constants were generated if they really care.

dvdplm added 19 commits October 21, 2024 15:44

Add a "rayon" feature that enables a parallelized prime generation me…

5c7b5f4

…thod

Clippy

52024e3

Drop the threadpool when recursing

f0e5edf

Avoid a clone

Add parallel safe prime generation

e4b1adb

Let users pick the thread count

7491a06

Fix tests

ca8b74d

Document available features and provide an example

9dac4e9

No need for num_cpus

be85f48

CHANGELOG entry

2be5919

Fix doctest in README

5929045

Criterion benches for the rayon feature

2934030

Add parallel versions of generate_prime and generate_safe_prime

61a860b

Rename the feature to "multicore" and other review feedback

d994c51

Missed feature gated NonZeroU32s

0e2e7e6

Merge branch 'master' into dp-rayon-prime-gen

b273519

working draft

70cda7d

working draft

09d04b8

draft of JP06 pseudo-uniform prime generation

c11db6a

Merge branch 'master' into dp-uniform-sieve

ce62bf5

dvdplm requested a review from fjarri November 11, 2024 21:47

Minor cleanup

db21ed3

fjarri mentioned this pull request Nov 11, 2024

Generalize sieving #64

Open

fjarri reviewed Nov 11, 2024

View reviewed changes

dvdplm added 6 commits November 13, 2024 13:46

Add test-log crate for easy tracing setup in tests

7f5697c

Add benches for uniform prime gen to compare with "presets"

b05bf23

Let tests use std

c5735a5

Proper bounds check (but why is random_bits so much faster?)

23d3cb2

Use a bigger l for U128 (4)

a4b4546

Let users pass the CSPRNG

ddc4730

Add back generate_prime + some docs + fix tests

82aa804

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft impl of JP06 pseudo-uniform sieve #65

Draft impl of JP06 pseudo-uniform sieve #65

dvdplm commented Nov 11, 2024 •

edited

Loading

codecov bot commented Nov 11, 2024 •

edited

Loading

fjarri Nov 11, 2024

dvdplm Nov 12, 2024

Draft impl of JP06 pseudo-uniform sieve #65

Are you sure you want to change the base?

Draft impl of JP06 pseudo-uniform sieve #65

Conversation

dvdplm commented Nov 11, 2024 • edited Loading

codecov bot commented Nov 11, 2024 • edited Loading

Codecov Report

fjarri Nov 11, 2024

Choose a reason for hiding this comment

dvdplm Nov 12, 2024

Choose a reason for hiding this comment

dvdplm commented Nov 11, 2024 •

edited

Loading

codecov bot commented Nov 11, 2024 •

edited

Loading