Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add mixed precision ilu0 on gpu #5552

Draft
wants to merge 23 commits into
base: master
Choose a base branch
from

Conversation

multitalentloes
Copy link

@multitalentloes multitalentloes commented Aug 21, 2024

This PR will adds two mixed precision versions of ILU0 on GPU.
Both store compute the ILDU factorization as double, but casts it to floats afterwards, with the thinking that the apply will be faster if the matrix requires less memory movement. The two mixed precision variants differ in whether they do the apply computations in float or double as well.

@multitalentloes multitalentloes force-pushed the add_mixed_precision_ilu0_and_dilu_on_gpu branch from 2b0dfad to d292f24 Compare August 23, 2024 12:09
@multitalentloes multitalentloes changed the title Add mixed precision ilu0 and dilu on gpu Add mixed precision ilu0 on gpu Aug 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant