-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PArray coherence issue #149
Comments
Thanks for reporting this issues. Fine grained slicing in this branch has known bug which has been fixed in experiment-parla but has not sync back to this repo yet. Will make a PR to bring the patch back |
#150 is created which solves coherence bugs in PArray. Whats more, your second exmaple doesn't work since it voilate the PArray's restriction that doesn't allow moving multiple overlapping subarrays at the system without a writeback. That also includes the same subarray on different device (e.g. task_a might create arr[0] on gpu 0 but task_b will read arr[0] to gpu 1). This is a TODO that will be supported in next parla release but not yet. So to make you example work in current version, you need a
|
The changes have been merged into the main branch and the tutorial is also updated based on that. Will leave the issues as open for enhancement to get rid of the requirement of using writeback task. |
The merge does not resolve the issue, even with a writeback. Some objects are still None and the output is essentially the same.
|
My bad, has removed the debug string please pull the changes. I have reproduced the bug and looks like there is still a bug in current version of parla but not in new parla. Will look at it. |
When doing a reduce operation using PArrays, the data in follow-up operations is not consistent or None.
The following example without a reduction works fine:
The outputs are 1.0, 1.0, 1.0, 1.0.
Adding a reduce operation leads to wrong values, often 0.5, 0.0, 0.0, 0.0
When the acc operation binds the parameter via
inout
the values are sometimes None and yield runtime exceptions. This only happens on GPU, not on CPU.The text was updated successfully, but these errors were encountered: