Skip to content

Commit

Permalink
gpu: jit: conv_v2: refactor to reduce register usage
Browse files Browse the repository at this point in the history
- Introduced versioning for offsets to split offset allocations between
  main loop and epilogue
- Added loop_nest_t to simplify work with loop indices/bounds
  • Loading branch information
echeresh committed Apr 12, 2024
1 parent 50cc674 commit d419187
Showing 1 changed file with 239 additions and 192 deletions.
Loading

0 comments on commit d419187

Please sign in to comment.