Replies: 1 comment 4 replies
-
Do you know how to implement the program in CUDA? Basically, you need several streams (equal to the number of stages). At the beginning of each round, you inject tasks into the streams. Then you can drain all the streams so that all the tasks are executed. This is similar to the code here. https://github.com/sarchlab/mgpusim/blob/v3/benchmarks/heteromark/fir/fir.go#L132 Can you check the code and see if it helps? |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello, Professor Sun. I would like to implement pipelined AES on MGPUSim, but everything is quite chaotic right now, and I’m not sure where to start. Could you give me some advice?
Beta Was this translation helpful? Give feedback.
All reactions