Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AIRRtToNpuPass SHIM DMA BD optimization #550

Merged

Conversation

erwei-xilinx
Copy link
Collaborator

  • When tiling any wrap>1023 into two wraps, make the largest int factor the outer wrap, so that the new stride is smaller and less likely to go beyond 1M.
  • Code quality.
  • Avoid using SymbolTable::getSymbolUses method, which turns out to be quite slow with big IRs.

@erwei-xilinx erwei-xilinx requested a review from newling April 25, 2024 01:07
Copy link
Collaborator

@newling newling left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@erwei-xilinx erwei-xilinx merged commit 4faaa09 into Xilinx:main Apr 25, 2024
9 checks passed
@erwei-xilinx erwei-xilinx deleted the improve_aiex_npu_dma_gemm_size_limit branch April 25, 2024 16:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants