Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Operators] Conv2d gemm fp16 performance enhancement #285

Closed
wants to merge 37 commits into from

Conversation

Aalanli
Copy link
Collaborator

@Aalanli Aalanli commented Jun 20, 2023

for pointwise convolution, use matmul.

@Aalanli
Copy link
Collaborator Author

Aalanli commented Jun 20, 2023

Additionally, I removed an extra cp_async statement in fp16_matmul

@Aalanli Aalanli closed this Jun 23, 2023
vadiklyutiy pushed a commit that referenced this pull request Jul 22, 2024
In parallel runs of the regression tests, the compilation server might
fail when checking/loading the config file.

This change handles the specific case where those files are checked as
non existent, but when they attempting to create it, they would return
`FileExistsError`.

examples:
[1](https://github.com/CentML/playground/actions/runs/9601931045)
[2](https://github.com/CentML/playground/actions/runs/9601633511)

If you think this should be handled some other way, just let me know and
i'll make the change.
vadiklyutiy pushed a commit that referenced this pull request Jul 23, 2024
In parallel runs of the regression tests, the compilation server might
fail when checking/loading the config file.

This change handles the specific case where those files are checked as
non existent, but when they attempting to create it, they would return
`FileExistsError`.

examples:
[1](https://github.com/CentML/playground/actions/runs/9601931045)
[2](https://github.com/CentML/playground/actions/runs/9601633511)

If you think this should be handled some other way, just let me know and
i'll make the change.
vadiklyutiy pushed a commit that referenced this pull request Dec 26, 2024
In parallel runs of the regression tests, the compilation server might
fail when checking/loading the config file.

This change handles the specific case where those files are checked as
non existent, but when they attempting to create it, they would return
`FileExistsError`.

examples:
[1](https://github.com/CentML/playground/actions/runs/9601931045)
[2](https://github.com/CentML/playground/actions/runs/9601633511)

If you think this should be handled some other way, just let me know and
i'll make the change.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant