[crypto] Trial division for RSA key generation. #21086

jadephilipoom · 2024-01-29T11:07:50Z

Filter out 62.1% of composite numbers by using arithmetic tricks specific to certain small primes. This should speed up RSA key generation by something close to 60% in expectation, because primality testing is the vast majority of the runtime and this lets us skip it in 60% of cases where it will eventually fail.

rswarbrick

I've just had a really enjoyable few minutes reading through some of this code! (I must say, I hadn't thought of using a ones-register to do masking like this: it's neat!)

A couple of nitty comments on the bits that I'd read.

rswarbrick · 2024-01-29T11:24:20Z

sw/otbn/crypto/rsa_keygen.s

@@ -1003,8 +1004,7 @@ check_p:
  /* Get the FG0.Z flag into a register.
       x2 <= (CSRs[FG0] >> 3) & 1 = FG0.Z */


I think this might need tweaking to match the "& 8" that's now in the code?

Looking at the rest of the file, it looks like there are several other places where we could make the corresponding change. Maybe that would be worth doing uniformly as a separate commit?

Maybe so, yes. I think I was looking to save code size here.

rswarbrick · 2024-01-29T12:21:42Z

sw/otbn/crypto/rsa_keygen.s


  /* Add the lower 16 bits of the sum to the highest 3 bits to get a 17-bit
     result.
       w22 <= w22 + (w23 >> 32) */
  bn.add   w22, w22, w23 >> 32

-  /* The sum from the previous addition is < 2 * F4, so a modular addition with
-     zero is sufficient to fully reduce.
+  /* The sum from the previous addition is <= 2^16 - 1 + 2^3 - 1 < 2 * F4, so a


Maybe "at most" instead of "<=", since those symbols are also being used for assignment?

rswarbrick · 2024-01-29T12:22:26Z

sw/otbn/crypto/rsa_keygen.s

+ * 35-bit result and then fold the number a few more times to get a 9-bit
+ * result.
+ *
+ * Testing 


"Testing "??

(I think the rest of the paragraph appears in the next commit?)

Not sure what was going on there! Anyway, I double-checked and it does seem like it's filled in on the next commit. The full paragraph is:

* Testing for these primes will catch approximately: * 1 - ((1 - 1/3) * (1 - 1/5) * ... * (1 - 1/31)) * = 62.1% of composite numbers.

Maybe I just hadn't done the calculation yet 🙂

jadephilipoom · 2025-02-21T09:56:20Z

I was searching for my own open PRs recently and noticed this one was never merged! Looks like it was approved the day before I changed employers and lost my open tabs so I think I know how that happened 😉 Luckily, the code hasn't changed much in the meantime, so I've rebased it and unless there are objections I'll merge once it passes CI.

These small primes have the convenient property that 2^8 mod p = 1, which significantly speeds up the check and allows it to share code with relprime_f4. Signed-off-by: Jade Philipoom <[email protected]>

All of these small primes have the nice property that 2^32 mod p = 4. Signed-off-by: Jade Philipoom <[email protected]>

jadephilipoom · 2025-02-21T11:11:45Z

Since we now have nicer profiling targets than we did when I first made this PR, I also ran some quick benchmarks. I modified the RSA-2048 keygen test to generate keys 10 times and continue recording the OTBN instruction count (a good proxy for the cycle count, and more accurate on FPGA tests given RSA keygen is too slow for Verilator). The results were:

>>> master = [728920834,626858263,844018075,212064556,1295722633,1265332333,1395617782,798422326,1964599999,225096775]
>>> trial_div = [518505838,279511222,642358674,344699495,1109487354,1467981868,223009163,527194053,118732085,535880793]
>>> avg(master)
935665357.6
>>> avg(trial_div)
576736054.5
>>> avg(master) / avg(trial_div)
1.6223458726040163

In short, I observed a 62% speedup, a nice confirmation of the 61% estimate. I think I just got lucky with how exact it is; the standard deviation on these measurements is large because RSA timing is so variable and I only had the patience for 10 samples. But still -- at least it confirms that we see the significant average speedups we expect.

jadephilipoom requested a review from moidx January 29, 2024 11:07

rswarbrick reviewed Jan 29, 2024

View reviewed changes

moidx approved these changes Jan 30, 2024

View reviewed changes

jadephilipoom force-pushed the trial-div branch from 50891e0 to 5d80df5 Compare February 21, 2025 09:55

[crypto] Check for multiples of 3, 5, and 17 in RSA keygen.

1ba1b59

These small primes have the convenient property that 2^8 mod p = 1, which significantly speeds up the check and allows it to share code with relprime_f4. Signed-off-by: Jade Philipoom <[email protected]>

jadephilipoom force-pushed the trial-div branch 4 times, most recently from 4599b4d to 689ba1e Compare February 21, 2025 10:12

[crypto] Check for multiples of 7, 11, and 31 in RSA keygen.

a2e535a

All of these small primes have the nice property that 2^32 mod p = 4. Signed-off-by: Jade Philipoom <[email protected]>

jadephilipoom force-pushed the trial-div branch from 689ba1e to a2e535a Compare February 21, 2025 11:10

jadephilipoom merged commit 3a5cca7 into lowRISC:master Feb 24, 2025
42 checks passed

jadephilipoom deleted the trial-div branch February 24, 2025 07:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[crypto] Trial division for RSA key generation. #21086

[crypto] Trial division for RSA key generation. #21086

jadephilipoom commented Jan 29, 2024

rswarbrick left a comment

rswarbrick Jan 29, 2024

jadephilipoom Jan 30, 2024

rswarbrick Jan 29, 2024

rswarbrick Jan 29, 2024

jadephilipoom Jan 30, 2024

jadephilipoom commented Feb 21, 2025 •

edited

Loading

jadephilipoom commented Feb 21, 2025 •

edited

Loading

		@@ -1003,8 +1004,7 @@ check_p:
		/* Get the FG0.Z flag into a register.
		x2 <= (CSRs[FG0] >> 3) & 1 = FG0.Z */

[crypto] Trial division for RSA key generation. #21086

[crypto] Trial division for RSA key generation. #21086

Conversation

jadephilipoom commented Jan 29, 2024

rswarbrick left a comment

Choose a reason for hiding this comment

rswarbrick Jan 29, 2024

Choose a reason for hiding this comment

jadephilipoom Jan 30, 2024

Choose a reason for hiding this comment

rswarbrick Jan 29, 2024

Choose a reason for hiding this comment

rswarbrick Jan 29, 2024

Choose a reason for hiding this comment

jadephilipoom Jan 30, 2024

Choose a reason for hiding this comment

jadephilipoom commented Feb 21, 2025 • edited Loading

jadephilipoom commented Feb 21, 2025 • edited Loading

jadephilipoom commented Feb 21, 2025 •

edited

Loading

jadephilipoom commented Feb 21, 2025 •

edited

Loading