[RFC] Cross-Platform Refactor: CPU-only implementation #1021

rickardp · 2024-02-03T09:50:57Z

Motivation

As we want to have this library portable, the first step would be to make 100% of this library run correctly on only CPU (i.e. not requiring CUDA for any part of the functionality). This would serve two purposes:

Provide a baseline that contributors of ports can reference
Provide a fallback for partially implemented hardware platforms

Proposed solution

Implement all the CUDA kernels in "normal" C++
Make sure the unit tests all run on the CPU as well
Make sure unit test coverage is satisfactory

Open questions

Which CPU architectures do we support (x86_64 and arm64 are givens, but any more)?
How do we deal with SIMD intrinsics? Build separate libraries for each SIMD architecture? Or run-time selection based on CPU features?

@Titus-von-Koeller Feel free to edit this issue as you see fit, if you want a different structure for it for example.tbd

tbd

simepy · 2024-09-06T21:55:35Z

@rickardp Where are we on this feature ? It is some part already working, or another threads talking about this feature ?, not much comment here.

I'm especially interested about arm64 CPU only

rickardp · 2024-09-10T13:59:38Z

@rickardp Where are we on this feature ? It is some part already working, or another threads talking about this feature ?, not much comment here.

Hi @simepy, sorry not much to add here still. I am still up for contributing towards this when 1) I have time to do so and 2) the dependencies that I do not have time to contribute are ready to use. More specifically the idea is to take a gradual approach and use the reference implementation where MPS acceleration is not yet implemented. Currently, large parts of this codebase require CUDA, which does not run on Apple silicon, making a partial implementation virtually unusable.

rickardp mentioned this issue Feb 3, 2024

[RFC] Cross-Platform Refactor: Mac M1 support #1020

Open

5 tasks

This was referenced Feb 4, 2024

[RFC] Cross-Platform Refactor: Overview + Link Hub #997

Closed

[RFC] Cross-Platform Refactor: Testing and CI/CD Strategy #1031

Open

matthewdouglas mentioned this issue Feb 6, 2024

Distribute pip wheels for the architecture they are built for #1043

Closed

rickardp mentioned this issue Nov 18, 2024

Support running on CPU #1402

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Cross-Platform Refactor: CPU-only implementation #1021

[RFC] Cross-Platform Refactor: CPU-only implementation #1021

rickardp commented Feb 3, 2024 •

edited

Loading

simepy commented Sep 6, 2024 •

edited

Loading

rickardp commented Sep 10, 2024

[RFC] Cross-Platform Refactor: CPU-only implementation #1021

[RFC] Cross-Platform Refactor: CPU-only implementation #1021

Comments

rickardp commented Feb 3, 2024 • edited Loading

Motivation

Proposed solution

Open questions

simepy commented Sep 6, 2024 • edited Loading

rickardp commented Sep 10, 2024

rickardp commented Feb 3, 2024 •

edited

Loading

simepy commented Sep 6, 2024 •

edited

Loading