Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement accurate max block size in hipFuncGetAttributes() #1676

Merged
merged 9 commits into from
Mar 18, 2020

Conversation

jglaser
Copy link

@jglaser jglaser commented Nov 20, 2019

This PR takes ensures that the maxThreadsPerBlock returned by hipFuncGetAttributes is both a multiple of the warp size and that the register usage of the maximum block does not exceed the number of available registers.

Currently targeting the 2.9.x branch because I based it off of that branch (see #1675). Can probably be rebased on master.

Fixes #1662

@jglaser
Copy link
Author

jglaser commented Nov 21, 2019

It looks like the calculation was wrong in the first attempt, and I fixed it. however now my own tests are passing.

@mangupta
Copy link
Contributor

@jglaser Thanks for the PR. Can you please rebase the PR against the master branch?

@jglaser jglaser force-pushed the hipfuncgetattributes branch from d3654b6 to 93c4944 Compare November 22, 2019 17:53
@jglaser jglaser changed the base branch from roc-2.9.x to master November 22, 2019 17:54
@jglaser
Copy link
Author

jglaser commented Dec 14, 2019

@mangupta It would be great if someone could review, thanks. This will be a blocker once we enable CI for HOOMD blue w/HIP glotzerlab/hoomd-blue#541

@jglaser
Copy link
Author

jglaser commented Jan 23, 2020

Any progress on this?

mangupta
mangupta previously approved these changes Feb 26, 2020
@mangupta mangupta added the pr:resolve_conflicts PR requires additional testing label Feb 26, 2020
@mangupta mangupta added pr:needs_updates PR initially approved. But needs rework and removed pr:resolve_conflicts PR requires additional testing labels Mar 16, 2020
@mangupta mangupta merged commit b5e683a into ROCm:master Mar 18, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pr:needs_updates PR initially approved. But needs rework
Projects
None yet
Development

Successfully merging this pull request may close these issues.

hipFuncGetAttributes returns inaccurate maxThreadsPerBlock
2 participants