Skip to content

Conversation

noemotiovon
Copy link
Collaborator

Introduce a high performance mode for the CANN backend. In this mode, intermediate computation states are stored in FP16, which improves execution performance at the cost of slightly reduced precision.

Make sure to read the contributing guidelines before submitting a PR

Introduce a high performance mode for the CANN backend.
In this mode, intermediate computation states are stored in FP16,
which improves execution performance at the cost of slightly reduced precision.
@github-actions github-actions bot added documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Ascend NPU issues specific to Ascend NPUs labels Sep 25, 2025
@noemotiovon
Copy link
Collaborator Author

Further discussion at #16251

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Ascend NPU issues specific to Ascend NPUs documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant