-
Notifications
You must be signed in to change notification settings - Fork 112
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I am confused about the initElapsedS & readElapsedS. #173
Comments
I think the time read_arrays function consumed actually depend on the PCIe bandwidth? That function copy the data to Host from device |
Yes, Init and Read is a new thing we report that measures the setup and read-back time of the buffers.
What's the performance difference you're observing? |
@CKopoer the time intervals used for
The After #186 is merged, adding proper |
Fixed by #186 . |
So, As these lines shows, we get the initElapsedS & readElapsedS from the difference of each other. Is't a mistake or something meaningful I haven't understood?
Otherwise, I get the results on H800 using another closed-source NV-STREAM tool. It seems that it provided better bandwidth performance result compared with BabelStream because of the optimized block size parameters. What's more, it also and show Read & Write results. Could I take the Init_kernel as the Write result and read_arrays as Read result in BabelStream?
The text was updated successfully, but these errors were encountered: