Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve the MPI parcelport and change the zero-copy threshold to 8192 #6274

Merged
merged 1 commit into from
Jun 28, 2023

Conversation

JiakunYan
Copy link
Contributor

Improve the MPI parcelport:

  • Replace the lock-based tag provider with an atomic variable
  • Make the header message size dynamic

Since the maximum size of the dynamic header message is set to be the zero-copy serialization threshold, I also changed it from 128B to 8192B. (The message size that MPI switches from eager protocol to rendezvous protocol is usually around 8K - 64K).

Experiments: Octo-Tiger, max_level=6, SDSC Expanse, 32 nodes, 128 threads per node.
With hpx.parcel.mpi.sendimm=0, this PR improves the total execution time from 11.77s to 10.68s.
With hpx.parcel.mpi.sendimm=1, this PR improves the total execution time from ~60s to 11.72s.

@StellarBot
Copy link

Can one of the admins verify this patch?

{
chunk_buffers_.resize(num_zero_copy_chunks);
}
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.


void run() noexcept
{
util::mpi_environment::scoped_lock l;
get_next_free_tag(l);
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is acquiring this lock still necessary?

Comment on lines +33 to +35
int tag = next_tag.fetch_add(1, std::memory_order_relaxed) %
(util::mpi_environment::MPI_MAX_TAG - 1) +
1;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we add an assert when tag overflows? ... I'm not sure.

@JiakunYan JiakunYan force-pushed the opt-mpi branch 2 times, most recently from 1df2d4f to 1351724 Compare June 16, 2023 22:04
@JiakunYan
Copy link
Contributor Author

@hkaiser I think this PR is ready for you to take a look at again.

@hkaiser hkaiser modified the milestones: 1.10.0, 1.9.1 Jun 26, 2023
Copy link
Member

@hkaiser hkaiser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@hkaiser
Copy link
Member

hkaiser commented Jun 26, 2023

@JiakunYan could you please rebase this PR to fix the issues reported (those were caused by problems on the master branch when you branched this off).

…tion threshold from 128 to 8192

- Replace the lock-based tag provider with an atomic variable
- Make the header message size dynamic
Copy link
Member

@hkaiser hkaiser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@hkaiser
Copy link
Member

hkaiser commented Jun 28, 2023

bors merge

@bors
Copy link

bors bot commented Jun 28, 2023

Build succeeded!

The publicly hosted instance of bors-ng is deprecated and will go away soon.

If you want to self-host your own instance, instructions are here.
For more help, visit the forum.

If you want to switch to GitHub's built-in merge queue, visit their help page.

  • Bors

@bors bors bot merged commit 8a949a7 into STEllAR-GROUP:master Jun 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants