[RFC] CUDA code decoupling and directory refactoring

For a long time, NVIDIA GPUs and the CUDA architecture have dominated the PyTorch ecosystem. However, as an increasing number of vendors introduce their high-performance AI chips, current ecosystem is revealing the following key issues:
- *Code coupling*. The CUDA code is too tightly coupled with the PyTorch codebase, resulting in poor modularity and high maintenance costs.
- *Integration effort*. Currently, different hardware backends may adopt varying integration methods into PyTorch. The integration approaches and code lack standardization and consistency, leading to a significant amount of repetitive code and substantial integration effort.
- *Code migration*. Due to the lack of integration code specification, different hardware backends provide APIs with varying names and styles, resulting in high code migration costs for PyTorch users.

To address these issues, we propose a RFC covering the following work:
- Decouple CUDA-related code from the main codebase at both inter-file and intra-file levels, reducing the PyTorch core framework's direct dependency on CUDA.
- Propose a modularized and standardized directory hierarchy and consolidate all CUDA-related code within it as a reference for other third-party backend integration. 
- Redesign the build system to support standalone compilation of the CUDA backend and develop a wrapped cmake toolkit to support and streamline the build process.

Please click [here](https://github.com/pytorch/rfcs/pull/82) for more details of the RFC.

cc @NmomoN @mengpenghui @fwenguang @cdzhan @1274085042 @PHLens @albanD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] CUDA code decoupling and directory refactoring #161954

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RFC] CUDA code decoupling and directory refactoring #161954

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions