-
Notifications
You must be signed in to change notification settings - Fork 738
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SYCL runtime: Severe host overhead in sycl::get_kernel_bundle #15824
Comments
Hi @majing921201,
Is your complaint that To add some background here which will likely be enough for a high-level explanation of this (but not enough to say what exactly happens in your case):
|
We used aot with 'pvc' as target. And we didn't pass kernel boudle to |
For platform compatible, we didn't use device max work group size to launch kernel, and switch to query specific max work group size for kernel by SYCL API. following is our code example
We found this usage takes much host overhead in application. we measured one kernel CPU performance here, each API name in table maps example code:
<style> </style>We also file internal jira to track this issue. Can you help evaluate this slow performance.
The text was updated successfully, but these errors were encountered: