-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify group synchronization behavior #499
Clarify group synchronization behavior #499
Conversation
The previous description of group_barrier required implementations to execute certain fences, which should be an implementation detail. This commit adds a "Synchronization" section to group_barrier and describes behavior in terms of a happens-before relationship. Implementations may choose to implement the barrier using any synchronization operations which satisfy this requirement.
State explicitly that: - Group algorithms block until the algorithm executes. - Group algorithms have similar happens-before implications to barriers. Closes internal issue 576.
C++ just says "blocks" rather than "blocks this thread", so we can drop "blocks this work-item".
Explains that a group function may define operation(s) that take place only when all work-items have arried at the group function's synchronization point. The intent is that group algorithms can then be considered a specific instance of these group functions, where the operation(s) constitute the algorithm that should be executed.
These were accidentally overlooked during the previous round of changes.
Co-authored-by: Greg Lueck <[email protected]>
This needs one more reviewer. Can someone volunteer? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That looks good. Perhaps we could factorize somehow the basic synchronization description?
I agree that there are probably some opportunities to reduce the amount of duplicated information. I'd rather we do that in a follow-up PR, though, if everybody is okay with that. Ideally, I'd like us to agree that the specification for each overload is correct, and then we can look for opportunities to refactor things. |
Clarify group synchronization behavior
Clarify group synchronization behavior (cherry picked from commit fdc8a5e)
Clarify group synchronization behavior (cherry picked from commit fdc8a5e)
These changes build on #484 to clarify the expected synchronization behavior of group barriers and algorithms.
The wording proposed here deliberately does not specify how a SYCL implementation must satisfy certain guarantees. Previously, barriers were described in terms of fences and atomics, and we had discussed (offline) the possibility of defining algorithms similarly (or in terms of barriers). The approach taken by
std::barrier
in ISO C++ is much simpler, giving implementations the freedom to use any combination of synchronization operations that guarantee certain events to happen before other events.For example, it should be legal to implement
sycl::group_barrier
using any of the following:std::atomic_thread_fence
+ relaxedstd::atomic_ref
, release atomics paired with acquire atomics, a call tostd::barrier
, or any implementation-defined (backend-specific) functions that provide the desired memory ordering semantics.There's a lot of duplication here now, but there are two reasons for that:
I wanted some feedback on the wording, and specifically whether we would like the "Effects" for each algorithm to mention the algorithm by name.
I wanted to separate the review of the clarifications from any later formatting changes. The new table of overloads format that @gmlueck has been working on would allow us to combine some of these definitions.
Closes internal issue 576.