Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix CUDA int8x4 vectorize #3928

Merged
merged 5 commits into from
Sep 13, 2019
Merged

Fix CUDA int8x4 vectorize #3928

merged 5 commits into from
Sep 13, 2019

Conversation

llehtahw
Copy link
Contributor

  • fix CodeGenCUDA::PrintVecElemStore and CodeGenCUDA::PrintVecElemLoad
  • fix accumulation of shared/local memory usage with vector types
    • test updated

@vinx13 @nishi-t @tqchen

@tqchen
Copy link
Member

tqchen commented Sep 12, 2019

@vinx13 please manage this PR

@vinx13 vinx13 merged commit 195973c into apache:master Sep 13, 2019
@vinx13
Copy link
Member

vinx13 commented Sep 13, 2019

Thanks @llehtahw this is merged

@llehtahw llehtahw deleted the fix-int8x4-vectorize branch September 13, 2019 01:15
wweic pushed a commit to wweic/tvm that referenced this pull request Sep 16, 2019
* Fix int8x4 vectorize

* Fix gpu shared/local memory accumulate

* Add test_shared_memory for int8x4

* Adjust test format

* Fix cpplint
wweic pushed a commit to wweic/tvm that referenced this pull request Sep 16, 2019
* Fix int8x4 vectorize

* Fix gpu shared/local memory accumulate

* Add test_shared_memory for int8x4

* Adjust test format

* Fix cpplint
wweic pushed a commit to neo-ai/tvm that referenced this pull request Sep 16, 2019
* Fix int8x4 vectorize

* Fix gpu shared/local memory accumulate

* Add test_shared_memory for int8x4

* Adjust test format

* Fix cpplint
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants