Skip to content

Conversation

@Kh4ster
Copy link
Contributor

@Kh4ster Kh4ster commented Jun 3, 2025

This PR aims at fixing the invalid operation while there is a graph capture we sometimes see when using batch solve.

Solution is to use a regular instead of a non-blocking stream to make sure that if any operation (like a cudaFree from Thrust) is being launched on the default stream, it will wait for all other operations on other stream to finish first, preventing any cudaMalloc/Free while another stream might be doing a CUDA Graph capture.

@Kh4ster Kh4ster requested a review from a team as a code owner June 3, 2025 08:54
@Kh4ster Kh4ster requested review from aliceb-nv, chris-maes, hlinsen and kaatish and removed request for hlinsen and kaatish June 3, 2025 08:54
@Kh4ster Kh4ster added bug Something isn't working non-breaking Introduces a non-breaking change pdlp labels Jun 3, 2025
@Kh4ster Kh4ster removed the request for review from chris-maes June 3, 2025 14:02
@Kh4ster
Copy link
Contributor Author

Kh4ster commented Jun 3, 2025

/merge

@Kh4ster Kh4ster requested a review from chris-maes June 3, 2025 14:02
@rgsl888prabhu
Copy link
Collaborator

@Kh4ster Just wanted to confirm if this needs to go to 25.05

@Kh4ster
Copy link
Contributor Author

Kh4ster commented Jun 3, 2025

@rgsl888prabhu if we can have it in 25.05 yes that's better, that avoid having a hard to read / understand and impossible to fix error when running in batch mode

@rgsl888prabhu
Copy link
Collaborator

I think there are few other bug fixes that needs to go in, may be we can add this as well.

@tmckayus
Copy link
Contributor

tmckayus commented Jun 3, 2025

I think there are few other bug fixes that needs to go in, may be we can add this as well.

I agree

@Kh4ster
Copy link
Contributor Author

Kh4ster commented Jun 3, 2025

@rgsl888prabhu so we can merge this right?

@rgsl888prabhu rgsl888prabhu merged commit 6853c5e into branch-25.05 Jun 3, 2025
112 of 113 checks passed
@rgsl888prabhu rgsl888prabhu deleted the pdlp_fix_batch_graph branch October 29, 2025 16:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working non-breaking Introduces a non-breaking change pdlp

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants