feat(gpu): implement fhe rand on gpu #1958

guillermo-oyarzun · 2025-01-13T16:44:19Z

closes: please link all relevant issues

PR content/description

Check-list:

Tests for the changes have been added (for bug fixes / features)
Docs have been added / updated (for bug fixes / features)
Relevant issues are marked as resolved/closed, related issues are linked in the description
Check for breaking changes (including serialization changes) and add them to commit message following the conventional commit specification

agnesLeroy

Hey @guillermo-oyarzun! Thanks a lot for this PR, here comes my review. My main question is about the par_generate... entry points: at the moment the cuda calls are made on the same streams, which makes execution sequential. What we could do is use a different set of streams for each of the par_iter iteration maybe? Wdyt?

backends/tfhe-cuda-backend/cuda/include/linear_algebra.h

backends/tfhe-cuda-backend/cuda/src/linearalgebra/addition.cu

backends/tfhe-cuda-backend/cuda/src/linearalgebra/addition.cuh

tfhe/src/integer/gpu/server_key/radix/oprf.rs

agnesLeroy · 2025-01-16T09:05:15Z

tfhe/src/integer/gpu/server_key/radix/oprf.rs

+            .into_par_iter()
+            .enumerate()
+            .map(|(i, seed)| {
+                let stream_index = i;


Here if there are too many blocks this won't work, will it? We would need something like

let stream_index = i % streams.gpu_indexes.len()

instead

It will work because before I generate a vector of streams with as many streams as blocks are. This is being executed in a single GPU but with many streams.

Ah yes of course, thanks for the clarification!

tfhe/src/integer/gpu/server_key/radix/oprf.rs

guillermo-oyarzun requested a review from agnesLeroy January 13, 2025 16:44

guillermo-oyarzun self-assigned this Jan 13, 2025

cla-bot bot added the cla-signed label Jan 13, 2025

guillermo-oyarzun force-pushed the go/feature/implement-fherand-on-gpu branch from 349cfaa to 9088fb0 Compare January 13, 2025 18:04

agnesLeroy reviewed Jan 14, 2025

View reviewed changes

guillermo-oyarzun force-pushed the go/feature/implement-fherand-on-gpu branch 3 times, most recently from 5d54d65 to 63eb9b8 Compare January 14, 2025 12:04

agnesLeroy reviewed Jan 15, 2025

View reviewed changes

tfhe/src/integer/gpu/server_key/radix/oprf.rs Show resolved Hide resolved

guillermo-oyarzun force-pushed the go/feature/implement-fherand-on-gpu branch 2 times, most recently from cd28cff to 29dcb98 Compare January 15, 2025 18:08

agnesLeroy reviewed Jan 16, 2025

View reviewed changes

tfhe/src/integer/gpu/server_key/radix/oprf.rs Show resolved Hide resolved

agnesLeroy reviewed Jan 16, 2025

View reviewed changes

tfhe/src/integer/gpu/server_key/radix/oprf.rs Show resolved Hide resolved

agnesLeroy approved these changes Jan 16, 2025

View reviewed changes

zama-bot added the approved label Jan 16, 2025

feat(gpu): implement fhe rand on gpu

17f5d05

guillermo-oyarzun force-pushed the go/feature/implement-fherand-on-gpu branch from 29dcb98 to 17f5d05 Compare January 16, 2025 11:45

zama-bot removed the approved label Jan 16, 2025

guillermo-oyarzun merged commit a9e4724 into main Jan 16, 2025
53 checks passed

guillermo-oyarzun deleted the go/feature/implement-fherand-on-gpu branch January 16, 2025 13:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gpu): implement fhe rand on gpu #1958

feat(gpu): implement fhe rand on gpu #1958

guillermo-oyarzun commented Jan 13, 2025

agnesLeroy left a comment

agnesLeroy Jan 16, 2025

guillermo-oyarzun Jan 16, 2025

agnesLeroy Jan 16, 2025

feat(gpu): implement fhe rand on gpu #1958

feat(gpu): implement fhe rand on gpu #1958

Conversation

guillermo-oyarzun commented Jan 13, 2025

PR content/description

Check-list:

agnesLeroy left a comment

Choose a reason for hiding this comment

agnesLeroy Jan 16, 2025

Choose a reason for hiding this comment

guillermo-oyarzun Jan 16, 2025

Choose a reason for hiding this comment

agnesLeroy Jan 16, 2025

Choose a reason for hiding this comment