Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix bug for construct sempaphore #341

Merged
merged 12 commits into from
Sep 4, 2024
Merged

Fix bug for construct sempaphore #341

merged 12 commits into from
Sep 4, 2024

Conversation

Binyang2014
Copy link
Contributor

@Binyang2014 Binyang2014 commented Aug 23, 2024

Current semaphore construction requires two-way communication, e.g., to construct a semaphore signaling from rank 0 to rank 1, both rank 0 and rank 1 need to send a message to each other. This PR fixes an executor bug that fails to conduct two-way communication for constructing such one-way semaphores, and instead hangs during the semaphore construction. In the future, we may need to change the implementation to construct semaphore via one-way communication.

@Binyang2014 Binyang2014 marked this pull request as ready for review August 26, 2024 02:47
@Binyang2014 Binyang2014 requested review from chhwang and caiomcbr and removed request for chhwang August 26, 2024 02:53
Copy link
Contributor

@chhwang chhwang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@chhwang chhwang merged commit 26a8753 into main Sep 4, 2024
17 checks passed
@chhwang chhwang deleted the binyli/bug-fix branch September 4, 2024 11:42
caiomcbr pushed a commit that referenced this pull request Sep 12, 2024
Current semaphore construction requires two-way communication, e.g., to
construct a semaphore signaling from rank 0 to rank 1, both rank 0 and
rank 1 need to send a message to each other. This PR fixes an executor
bug that fails to conduct two-way communication for constructing such
one-way semaphores, and instead hangs during the semaphore construction.
In the future, we may need to change the implementation to construct
semaphore via one-way communication.

---------

Co-authored-by: Changho Hwang <changhohwang@microsoft.com>
caiomcbr pushed a commit that referenced this pull request Sep 13, 2024
Current semaphore construction requires two-way communication, e.g., to
construct a semaphore signaling from rank 0 to rank 1, both rank 0 and
rank 1 need to send a message to each other. This PR fixes an executor
bug that fails to conduct two-way communication for constructing such
one-way semaphores, and instead hangs during the semaphore construction.
In the future, we may need to change the implementation to construct
semaphore via one-way communication.

---------

Co-authored-by: Changho Hwang <changhohwang@microsoft.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants