Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Debug progress TIMEOUTs #11

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Debug progress TIMEOUTs #11

wants to merge 2 commits into from

Conversation

hzhou
Copy link
Owner

@hzhou hzhou commented May 15, 2021

Pull Request Description

~/work/pull_requests/debug_requests/test/mpi/pt2pt$ MPIR_CVAR_CH4_DEBUG_PROGRESS_TIMEOUT=1 mpirun -n 3 ./test_deadlock
1 pending requests in pool 0
    ac000000:
1 pending requests in pool 0
    ac000001:
1 pending requests in pool 0
    ac000000:
^C[mpiexec@Tiger] Sending Ctrl-C to processes as requested

Author Checklist

  • Provide Description
    Particularly focus on why, not what. Reference background, issues, test failures, xfail entries, etc.
  • Commits Follow Good Practice
    Commits are self-contained and do not do two things at once.
    Commit message is of the form: module: short description
    Commit message explains what's in the commit.
  • Passes All Tests
    Whitespace checker. Warnings test. Additional tests via comments.
  • Contribution Agreement
    For non-Argonne authors, check contribution agreement.
    If necessary, request an explicit comment from your companies PR approval manager.

@hzhou hzhou force-pushed the debug_requests branch 2 times, most recently from de82e58 to 3350ea4 Compare May 15, 2021 14:53
@hzhou hzhou force-pushed the debug_requests branch 2 times, most recently from dd3c897 to 7447f0e Compare November 8, 2022 15:03
When MPICH_DEBUG_REQUEST is defined (config option to be added), we add
an info string to the request objects that can be used to update
readable debug informations to the request.
When MPICH_DEBUG_REQUEST is defined and
MPIR_CVAR_CH4_DEBUG_PROGRESS_TIMEOUT is set to non-zero, we add debug
dump of outstanding request infomation when progress-loop is not making
progress for a while. This is helpful in debugging TIMEOUT errors.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant