-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Ansor][FLAKY] Bug fix for compute at mutation error #6557
Conversation
Do you try to build debug version of TVM and use |
The problem is this is not always reproduceable. The only sure thing is that the bug is caused by |
One way you could do is remove |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll merge this PR first to remove the CI flaky. We should definitely continue diving into InitChangeComputeLocation()
to find the root cause.
Thanks @jcf94 @FrozenGene |
Thanks. |
This kind of general exception catch is not good for future maintenance. We should dig deeper to find out the underlying cause. #6512 does not change any logic, it just moves the location of some functions. Can you confirm this is caused by #6512 or #6529? |
Bug fix for #6548.
From the error log:
we can find that the error of the test was caused by the inferbound error. @merrymercy
Seems this bug was further intruded by #6512, I'm not sure which part of this mutation rule gets a wrong result now.
But it's strange that this bug is not always reproduceable, it occurs with a very small possibility(Which may caused by the multithread?). There may still be some leak conditions in our random generator design.
cc @tqchen @comaniac @FrozenGene