-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[0.10] Forbid divergent execution of work-group barriers #558
Conversation
|
Benchmark Results
Benchmark PlotsA plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR. |
This currently doesn't lower correctly:
|
5e3e1f4
to
a48a158
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #558 +/- ##
=====================================
Coverage 0.00% 0.00%
=====================================
Files 21 21
Lines 1584 1575 -9
=====================================
+ Misses 1584 1575 -9 ☔ View full report in Codecov by Sentry. |
This stack of pull requests is managed by Graphite. Learn more about stacking. |
8072f4c
to
f014e38
Compare
f014e38
to
cd60145
Compare
cd60145
to
7b88e3e
Compare
With pocl#main only two tests are still failing.
|
7b88e3e
to
31f8f5f
Compare
a5f740a
to
210658c
Compare
210658c
to
7e448d1
Compare
7e448d1
to
b58c830
Compare
b58c830
to
58ed8cc
Compare
58ed8cc
to
1163b32
Compare
As noted by @maleadt in JuliaGPU/OpenCL.jl#283 (comment)
Several backends have a strict requirement that barriers like
@synchronize
must be executed convergent,and the automatic bounds-checking in KA violates that.
Before GPU kernels lowered like:
Now they lower to: