Distributed CUDAEnsemble #1073

Robadob · 2023-07-03T10:56:25Z

MPI distributed ensemble, RunPlanVector has it's work split across multiple nodes.

Should be a relatively simple chance to try out MPI, give us a better idea of what we'd be getting into if aiming for multi-node simulations.

The text was updated successfully, but these errors were encountered:

Robadob · 2023-07-18T14:46:26Z

Rough plan for the basic process:

enum EnvelopeTag : int {
    RequestJob=0,
    AssignJob=1;
};

Rank 0

MPI_Init(NULL, NULL);
int world_rank;  // This var will branch behaviour
MPI_Comm_rank(MPI_COMM_WORLD, &world_rank);
int world_size;
int finalize_size = 1;
MPI_Comm_size(MPI_COMM_WORLD, &world_size);
MPI_Status status;
while(finalize_size < world_size) {
    memset(&status, sizeof(MPI_Status), 0);
    // Wait for job requests from any source, these have no data
    MPI_Recv(
        void* data,  // nullptr
        int count,  // 0
        MPI_Datatype datatype, // MPI_DATATYPE_NULL
        int source,  // MPI_ANY_SOURCE
        int tag,  // EnvelopeTag::RequestJob
        MPI_Comm communicator,  // MPI_COMM_WORLD
        MPI_Status* status)  // &status
    // Select the next unassigned job
    int next_job_index = next_run++;
    // Respond to the sender with a job assignment
    MPI_Send(
        void* data,  // &next_job_index 
        int count,  / 1
        MPI_Datatype datatype,  // MPI_INT
        int destination,  // status.MPI_SOURCE
        int tag,  //  EnvelopeTag::AssignJob
        MPI_Comm communicator)  // MPI_COMM_WORLD
    if (next_job_index >= plans.size())
        ++finalize_size;
}
MPI_Finalize()

Rank > 1

MPI_Init(NULL, NULL);
int world_rank;  // This var will branch behaviour
MPI_Comm_rank(MPI_COMM_WORLD, &world_rank);
int next_run = -1;
MPI_Status status;
while(next_run < plans.size()) {
    memset(&status, sizeof(MPI_Status), 0);    
    // Send a job request to 0, these have no data
    MPI_Send(
        void* data,  // nullptr
        int count,  / 0
        MPI_Datatype datatype,  // MPI_DATATYPE_NULL
        int destination,  // status.MPI_SOURCE
        int tag,  //  EnvelopeTag::RequestJob
        MPI_Comm communicator)  // MPI_COMM_WORLD
   // Wait for a job assignment from 0
   MPI_Recv(
       void* data,  // &next_run
       int count,  // 1
       MPI_Datatype datatype, // MPI_INT
       int source,  // 0 (is there a better built in macro/enum for 0?)
       int tag,  // EnvelopeTag::AssignJob
       MPI_Comm communicator,  // MPI_COMM_WORLD
       MPI_Status* status)  // &status
   // Process the job assignment
   do_run()
}
MPI_Finalize()

Will require some more thought as to validation (e.g. check count of receive matches) and how to separate it from regular CUDAEnsemble (subclass?, second runDistributed() method?, automatically do everything as OMP when built as OMP?)

Robadob self-assigned this Jul 10, 2023

Robadob mentioned this issue Jul 14, 2023

Distributed Ensemble (MPI Support) #1090

Merged

11 tasks

ptheywood closed this as completed in #1090 Dec 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed CUDAEnsemble #1073

Distributed CUDAEnsemble #1073

Robadob commented Jul 3, 2023

Robadob commented Jul 18, 2023

Distributed CUDAEnsemble #1073

Distributed CUDAEnsemble #1073

Comments

Robadob commented Jul 3, 2023

Robadob commented Jul 18, 2023

Rank 0

Rank > 1