-
Notifications
You must be signed in to change notification settings - Fork 9
PM paper brainstorm
Phil Miller edited this page Jul 21, 2020
·
10 revisions
- Provides asynchronous execution and application-independent dynamic load in a design that emphasizes close integration with and embedding in existing MPI code
- Includes consolidated tracing, instrumentation, and run-time error-checking across MPI and AMT execution
- Portability using strictly MPI for communication, not low-level hardware-specific interfaces
- Offers a consistent C++ API that balances convenience for application developers through modern language facilities, with careful consideration of compatibility with the broad range of compilers used in distributed memory HPC applications (e.g. Intel, Nvidia) (compare LSU HPX)
- Uses (relatively? check X10, Sterling HPX) novel high-performance implementations of composable distributed termination detection to support convenient over-arching high-level control flow constructs
- Provides well-defined semantics for system-level distributed operations - asynchronous entity creation & destruction, reductions, broadcasts, ...
- Integrates with and supports ecosystem of performance and correctness analysis tools, such as Projections, Nvidia, (anything from Teuchos or Kokkos?)
- Instruments application execution including internal sub-structure to support advanced load balancing development (cross-cite WIP LB paper? Include LB work?)