-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dstore: keeping job info in the dstor #217
Conversation
Need to fix: This is the reason of failing the Mellanox jenkins on test |
Reduction of memory usage on pmix-client's side by to not duplicate the job data for each client's: - Job info will not be sent to clients when connecting to the server. - The server will provide the job information into `dstore`. - The client will use the `dstore` for access to job info.
d22abbe
to
3761a90
Compare
bot:retest |
3761a90
to
1e8a9fb
Compare
Provide the information about process location into `dstor`
1e8a9fb
to
32c93e2
Compare
@karasevb do you have an estimate on when this will be merged in? |
@hppritcha What is really important - we need others to try this before merging since it's a reasonable change. Once we will have 👍 from others - we can merge. |
We have a good results at init stage with use dstor for job info.
Also, we need measure the memory footprint. |
Here is the memory consumption data (in KB) collected on 8 nodes, 16 ppn for the updated perf tool (PR #223). Here PSS (the process's proportional share of this mapping) was collected. |
@hppritcha @jjhursey @rhc54 @hjelmn We removed the "In-progress" label and done all of our testing. We need updates from your side so we can merge this. |
@karasevb could you please repeat the measurements with the key count = 0? Just out of curiosity. |
I'm planning to run some of these performance tests on some of our systems today. |
@jjhursey Note, that we haven't done anything in PPC optimization direction yet. But I'm still curious to see the results. |
Per today's call:
|
Passed MTT when installed in OMPI (dstore enabled by default):
|
Corresponds to PR openpmix#217 (cherry-picked dcb6967) (cherry-picked 32c93e2)
Corresponds to PR openpmix#217 (cherry-picked from dcb6967) (cherry-picked from 32c93e2)
Corresponds to PR openpmix#217 (cherry-picked from dcb6967) (cherry-picked from 32c93e2)
Corresponds to PR openpmix#217 (cherry-picked from dcb6967) (cherry-picked from 32c93e2)
Reduction of memory usage on pmix-client's side by to not duplicate
the job data for each client's:
dstore
.dstore
for access to job info.Fixes #144