Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CORE: fix bug for triggered colls in tl/ucp #757

Merged

Conversation

samnordmann
Copy link
Collaborator

What

A bug was reported by QA (#3338436)
TL/UCP reduce_scatter and reduce_scatterv on CUDA using triggered mode cause a segfault.
This is because schedule->super.triggered_post is not set on the schedule.

How ?

Since all the tasks must set task.triggered_post=ucc_triggered_post, this is done in ucc_coll_task_init.
Only TL/NCCL and TL/RCCL will overwrite this field.

@manjugv manjugv added this to the v1.2.0 Release milestone Mar 29, 2023
@Sergei-Lebedev Sergei-Lebedev merged commit a5d0b97 into openucx:master Apr 3, 2023
jeniaka pushed a commit to jeniaka/ucc that referenced this pull request Jun 21, 2023
janjust pushed a commit to janjust/ucc that referenced this pull request Jan 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants