-
Notifications
You must be signed in to change notification settings - Fork 126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Introduces new operator to get the lemma logits for factored vocabulary models for GPU inference #776
Open
rhenry-nv
wants to merge
18
commits into
marian-nmt:master
Choose a base branch
from
rhenry-nv:factorMaxesOp
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Commits on Dec 11, 2020
-
Introduces a special operator to handle getting factor logits for inf…
…erence when processing the lemmas
Configuration menu - View commit details
-
Copy full SHA for 1100596 - Browse repository at this point
Copy the full SHA 1100596View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8694fd9 - Browse repository at this point
Copy the full SHA 8694fd9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fd9aa1 - Browse repository at this point
Copy the full SHA 0fd9aa1View commit details -
Configuration menu - View commit details
-
Copy full SHA for d700727 - Browse repository at this point
Copy the full SHA d700727View commit details -
Configuration menu - View commit details
-
Copy full SHA for f7aba6d - Browse repository at this point
Copy the full SHA f7aba6dView commit details -
Removes sync before free in addFactorMaxes. I think it is not needed …
…since the allocator has a memory pool that it manages for it won't get released by a cuda free. Additionally, two kernels may get the same pointer but they cannot execute concurrently since a single thread does not launch concurrent kernels. Since there is an allocator per thread, this means that no two kernels can ever race on the same pointer (I think). I have not seen any issues after removing this sync
Configuration menu - View commit details
-
Copy full SHA for 7aba3b4 - Browse repository at this point
Copy the full SHA 7aba3b4View commit details -
WIP - Rework addFactorMaxes. Starts splitting it into two kernels to …
…expose more parallelism when adding into the lemmas
Configuration menu - View commit details
-
Copy full SHA for baab83f - Browse repository at this point
Copy the full SHA baab83fView commit details -
Configuration menu - View commit details
-
Copy full SHA for f2d77e0 - Browse repository at this point
Copy the full SHA f2d77e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 706bbe7 - Browse repository at this point
Copy the full SHA 706bbe7View commit details -
Configuration menu - View commit details
-
Copy full SHA for edebe59 - Browse repository at this point
Copy the full SHA edebe59View commit details
Commits on Dec 12, 2020
-
Configuration menu - View commit details
-
Copy full SHA for 07de32d - Browse repository at this point
Copy the full SHA 07de32dView commit details -
Configuration menu - View commit details
-
Copy full SHA for eb49d6c - Browse repository at this point
Copy the full SHA eb49d6cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 975682b - Browse repository at this point
Copy the full SHA 975682bView commit details
Commits on Dec 13, 2020
-
Configuration menu - View commit details
-
Copy full SHA for f0ffee1 - Browse repository at this point
Copy the full SHA f0ffee1View commit details
Commits on Dec 14, 2020
-
Configuration menu - View commit details
-
Copy full SHA for 4f57fdb - Browse repository at this point
Copy the full SHA 4f57fdbView commit details
Commits on Dec 15, 2020
-
Configuration menu - View commit details
-
Copy full SHA for de5449b - Browse repository at this point
Copy the full SHA de5449bView commit details
Commits on Mar 6, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 22d13b3 - Browse repository at this point
Copy the full SHA 22d13b3View commit details
Commits on Jul 9, 2021
-
1
Configuration menu - View commit details
-
Copy full SHA for 7be492a - Browse repository at this point
Copy the full SHA 7be492aView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.