Make CUDA optional dependency #20

pelesh · 2023-10-20T04:29:52Z

This PR reduces internal dependencies on CUDA SDK libraries. Classes MatrixHandler and VectorHandler are refactored to allow CPU-only or CUDA implementation of matrix and vector operations to be selected at runtime. With these changes:

ReSolve can be built without CUDA
Hardware backend (CPU or CUDA) is selected at runtime
The code now builds on Linux and MacOS
Fixed bug in FGMRES destructor

The build configuration allows for multiple hardware backends to coexist. In theory, we should be able to add HIP backend and build it at the same time as CUDA backend. If no GPU backend is built, a dummy device backend needs to be built instead (see CpuMemory.hpp).

Known issues not addressed in this PR:

matvec function implemented in MatrixHandlerCuda class corrupts matrix descriptor object.
There are memory leaks in unit tests for CUDA implementations of matrix and vector handlers as CUDA workspace is not deleted.

… input.

cameronrutherford · 2023-10-20T17:56:46Z

@pelesh PNNL GitHub supports MacOS and Windows runners. Can we add that for ReSolve in GitHub actions?

Additionally, now ReSolve supports building on Linux w/ Spack in GitHub actions, so we should add a pipeline and testing for those builds without CUDA.

kswirydo

I thought about it for some time and there was probably an easier solution to make all lin alg operatios (in VectorHandler and MatrixHandler) run with various backends, but it would had been a CMake nightmare and not necessarily backward compatible. I think the stuff in this PR is pretty good. I had some minor comments. Also, why IndexPlusValue changed its name?

examples/r_KLU_KLU_standalone.cpp

resolve/vector/VectorHandler.cpp

tests/functionality/testKLU_Rf.cpp

cameronrutherford

With better docs will come enlightenment. LGTM

EDIT: We should make issues for:

Future work
Known bugs
CI/CD additions

resolve/matrix/MatrixHandler.cpp

resolve/vector/CMakeLists.txt

resolve/workspace/CMakeLists.txt

resolve/workspace/LinAlgWorkspace.hpp

tests/functionality/testKLU.cpp

ryandanehy · 2023-10-23T16:35:26Z

@pelesh PNNL GitHub supports MacOS and Windows runners. Can we add that for ReSolve in GitHub actions?

Additionally, now ReSolve supports building on Linux w/ Spack in GitHub actions, so we should add a pipeline and testing for those builds without CUDA.

Will that mean we have to setup a mirror to a PNNL github given this under ORNL?

pelesh · 2023-10-23T17:22:11Z

@pelesh PNNL GitHub supports MacOS and Windows runners. Can we add that for ReSolve in GitHub actions?
Additionally, now ReSolve supports building on Linux w/ Spack in GitHub actions, so we should add a pipeline and testing for those builds without CUDA.

Will that mean we have to setup a mirror to a PNNL github given this under ORNL?

I think so.

cameronrutherford · 2023-10-23T18:42:43Z

@pelesh PNNL GitHub supports MacOS and Windows runners. Can we add that for ReSolve in GitHub actions?
Additionally, now ReSolve supports building on Linux w/ Spack in GitHub actions, so we should add a pipeline and testing for those builds without CUDA.

Will that mean we have to setup a mirror to a PNNL github given this under ORNL?

Mirror from ORNL GitHub to PNNL GitLab should be sufficient without PNNL GitHub in between

pelesh added 14 commits October 16, 2023 17:33

Don't build CUDA examples and tests when CUDA is disabled.

d806abf

Create modular (and eventually portable) workspace.

5ddd386

Enforce dependencies between CMake options.

d8cf0e6

Working PIMPL in MatrixHandler; needs lots of cleanup.

e40c43d

Separate CPU and CUDA mmethods for csc2csr conversion.

6ad87aa

Method setValuesChanged in MatrixHandler now requires memory space as…

23edbf7

… input.

Take helper class for index-value pairs outside MatrixHandler sources.

edc7e4c

Complete PIMPL implementation for MatrixHandler.

678a324

Use PIMPL in VectorHandler class.

cec1aa7

Remove workspace factory.

1e8f84a

Put preprocessor guards around CUDA code in unit tests.

7a920be

Make CUDA optional dependency (lots of cleanup needed.

5c5bdd8

Fixes to CMake

977e1da

Code cleanup.

3037513

pelesh added the enhancement New feature or request label Oct 20, 2023

pelesh added this to the Hackathon milestone Oct 20, 2023

pelesh requested review from maksud, rothpc, kswirydo, cameronrutherford and ryandanehy October 20, 2023 04:29

pelesh modified the milestones: Hackathon, First Release Oct 20, 2023

kswirydo reviewed Oct 21, 2023

View reviewed changes

Fix I/O in examples.

126b16a

kswirydo approved these changes Oct 22, 2023

View reviewed changes

cameronrutherford approved these changes Oct 23, 2023

View reviewed changes

ryandanehy closed this Oct 23, 2023

pelesh reopened this Oct 23, 2023

pelesh merged commit 395267b into develop Oct 23, 2023
2 checks passed

pelesh deleted the no-cuda-dev branch October 30, 2023 15:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make CUDA optional dependency #20

Make CUDA optional dependency #20

pelesh commented Oct 20, 2023 •

edited

Loading

cameronrutherford commented Oct 20, 2023

kswirydo left a comment

cameronrutherford left a comment •

edited

Loading

ryandanehy commented Oct 23, 2023

pelesh commented Oct 23, 2023

cameronrutherford commented Oct 23, 2023

Make CUDA optional dependency #20

Make CUDA optional dependency #20

Conversation

pelesh commented Oct 20, 2023 • edited Loading

cameronrutherford commented Oct 20, 2023

kswirydo left a comment

Choose a reason for hiding this comment

cameronrutherford left a comment • edited Loading

Choose a reason for hiding this comment

ryandanehy commented Oct 23, 2023

pelesh commented Oct 23, 2023

cameronrutherford commented Oct 23, 2023

pelesh commented Oct 20, 2023 •

edited

Loading

cameronrutherford left a comment •

edited

Loading