-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] add buffered chunked streaming for nemo force aligner #6185
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for this PR Slyne. Please modify the FrameBatchASR parameter as described in the comment.
cc @jbalam-nv for visibility on changes to streaming_utils. I don't think they will break anything. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne Deng <slyned@nvidia.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, this is the same as before
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall it looks awesome ! Minor comments, feel free to ignore for now
frame_buffers = self.frame_bufferer.get_buffers_batch() | ||
|
||
while len(frame_buffers) > 0: | ||
self.frame_buffers += frame_buffers[:] | ||
self.data_layer.set_signal(frame_buffers[:]) | ||
self._get_batch_preds() | ||
self._get_batch_preds(keep_logits) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it possible to avoid changing signatures of these functions ? Ie set a bool value from config or some other way (class arg or setter function) and rest of the functions just use that ?
return hypothesis | ||
|
||
all_logits = [] | ||
for log_prob in self.all_logits: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you put this in a function? I feel like it gets repeated a lot
) * add nfa buffered streaming Signed-off-by: Slyne Deng <slyned@nvidia.com> * restore to previous __iter__ function Signed-off-by: Slyne Deng <slyned@nvidia.com> --------- Signed-off-by: Slyne Deng <slyned@nvidia.com> Co-authored-by: Slyne Deng <slyned@nvidia.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu>
What does this PR do ?
To support buffered chunked streaming inference for nemo neural force aligner.
Collection: [Note which collection this PR will affect]
ASR
Changelog
FrameBatchASR
to keep the inference logits.Usage
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information