Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mfma prune #1577

Open
wants to merge 2 commits into
base: develop
Choose a base branch
from
Open

Mfma prune #1577

wants to merge 2 commits into from

Conversation

ethansaurusrex
Copy link

Try pruning configs that take too many registers as likely non-performant. This adds a tentative check in the XDLOPs specificCouldBePerformant() function to see if the output size (numOutputVectorElements) is greater than the number of available registers.

would cause register spills. Current value is hardcoded to 256, but should be generalized.
Copy link
Collaborator

@krzysz00 krzysz00 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

... I thought I'd sent the review in already

This seems fine - but do we have data that this is a good condition to impose? Are there cases where the number of outputs is > 256 but performance is good?

int64_t numOutputVectorElements = accelParams.numOutputVectorElements();

// would be best to have register count be a part of arch, is not necessarily totalVGPRPerEu
if(numOutputVectorElements > 256) {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's go ahead and make that an arch field, then

@ethansaurusrex
Copy link
Author

... I thought I'd sent the review in already

This seems fine - but do we have data that this is a good condition to impose? Are there cases where the number of outputs is > 256 but performance is good?

I am doing another tuning run this weekend and testing it further, atm cannot tell. I should have results by Monday or Tuesday for that question.

@jerryyin jerryyin requested a review from djramic July 22, 2024 13:43
@krzysz00
Copy link
Collaborator

Did we ever get the experiment run?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants