Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Operator][Backend] Add nvcc flags for faster math and update Attention schedule #221

Merged
merged 20 commits into from
May 11, 2023

Conversation

hjjq
Copy link
Member

@hjjq hjjq commented May 9, 2023

  • Make -ftz=true and prec-div=false default for all nvcc compiled kernels
  • Update Attention schedule template
  • Make repeat mapping explicit unroll by default when extent < 4
  • Fix erf test. Increase tolerance for pool test

@hjjq hjjq changed the title [Operator] Add fast_math and update Attention schedule [Operator] Add nvcc flags for faster math and update Attention schedule May 11, 2023
@hjjq hjjq changed the title [Operator] Add nvcc flags for faster math and update Attention schedule [Operator][Backend] Add nvcc flags for faster math and update Attention schedule May 11, 2023
@hjjq hjjq merged commit 971bd01 into hidet-org:main May 11, 2023
vadiklyutiy pushed a commit that referenced this pull request Jul 22, 2024
Adding support for the operator `torch.as_tensor`, which was encountered
in #221

Also added more tests for `torch.argmax, torch.argmin` as discussed in
#234
vadiklyutiy pushed a commit that referenced this pull request Jul 22, 2024
…Tensor.new_full` (#238)

Adding two operators `torch.Tensor.max` and `torch.Tensor.new_full`
while attempting to compile models from TorchBench(mentioned in the
comments under #221).
vadiklyutiy pushed a commit that referenced this pull request Jul 23, 2024
Adding support for the operator `torch.as_tensor`, which was encountered
in #221

Also added more tests for `torch.argmax, torch.argmin` as discussed in
#234
vadiklyutiy pushed a commit that referenced this pull request Jul 23, 2024
…Tensor.new_full` (#238)

Adding two operators `torch.Tensor.max` and `torch.Tensor.new_full`
while attempting to compile models from TorchBench(mentioned in the
comments under #221).
vadiklyutiy pushed a commit that referenced this pull request Dec 26, 2024
Adding support for the operator `torch.as_tensor`, which was encountered
in #221

Also added more tests for `torch.argmax, torch.argmin` as discussed in
#234
vadiklyutiy pushed a commit that referenced this pull request Dec 26, 2024
…Tensor.new_full` (#238)

Adding two operators `torch.Tensor.max` and `torch.Tensor.new_full`
while attempting to compile models from TorchBench(mentioned in the
comments under #221).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant