[DRAFT] Feature/merge changes from upstream #3

mulhod · 2022-02-12T05:47:48Z

In this branch (which is a branch from the current master branch in this fork), I merged changes from the upstream master branch and also incremented the version of the Python bindings to 0.3.42. Check out the 3 commits at the very bottom to see the extra changes I put in. The merged-in changes help with the thread safety issue and I am able to get similar times for each request.

I will make this a draft PR for now until I can do some more testing with a running Sparkle engine. Thus far, I have only been testing this outside the engine using a custom-made script that runs the calculate_feature function concurrently. I can share this code if it will be interesting. With this script, I was able to reproduce the pattern we were seeing in the engine (some requests taking double the time). With my updates, concurrent executions are taking roughly the same time.

Batch GPU decoding

mulhod · 2022-02-14T04:51:15Z

Still running some tests with the new package and a running engine, but so far I can see that the running time of concurrent requests is pretty stable. Submitting 4 requests over and over again at the same time leads to total request times averaging about 45 seconds. The important detail here is that I am not seeing any request times taking roughly double that, as I was with the old package.

I also checked to see if the outputs being generated were still different (the non-deterministic issue). It appears that that has not been resolved. But, I wasn't really expecting that to be resolved with these changes, anyway.

desilinguist

Thanks for putting this together @mulhod! I was looking through the changes that are in here and it looks like the major change is now batch decoding that also seems to support the GPU which is cool! However, I couldn’t really find any changes that may be related directly to thread safety. Can you point me to where those are?

mulhod · 2022-02-14T16:27:30Z

@desilinguist Actually, I don't either. I was assuming that certain changes were already merged. Maybe they haven't yet. My first thought was to just merge in all changes since this branched off.

Anyway, I am now seeing a new pattern emerge that was unexpected and I now believe was making it seem as though the issue were resolved when it actually had not been.

Using the original package, I first tried to submit 1 request at a time just to make sure things were working. The requests were taking a stable amount of time and that's all that I noted. Then, when I submitted 3 requests at a time, I noted that certain requests were taking considerably longer than others. It looked like the same pattern from the engine context. However, the single requests were only taking about 30 seconds and the simultaneous requests were ranging from 30 seconds to a minute.

When I made the new package with the changes in this PR, I tried 4 requests simultaneously and noted that they were all taking about 55 seconds. So, I thought this was resolved because the timing was stable. However, I went back and tried with the original 0.3.29 package again to run the same experiment and found the same stability. And I could get the same instability with 3 simultaneous requests with the new package, actually.

Long story short, I don't believe this is resolving anything unfortunately. I will take a look at the PR that I thought was merged that related to thread safety issues. In the meantime, though, it could still be a good idea to update the package. It wouldn't do any harm, at least...

desilinguist · 2022-02-14T16:37:24Z

Thanks for the detailed update and for confirming my intuition @mulhod. It definitely makes sense to update the package with the latest upstream code! From what I remember, I think the thread-safety changes might have been in a fork ... may be we can incorporate them in our branch too?

mulhod · 2022-02-14T16:39:21Z

Yeah, I'm going to look into that shortly. I'll just leave this open/as a draft for now and work on it when I get a chance.

nshmyrev and others added 28 commits December 12, 2021 21:37

Batch recognizer draft

6977be7

Decoding works, results are empty yet

344e137

Reset lattice on endpoint

60f0396

Expose results in Python

848b2dc

Per-stream wait API

cb0f8e6

Bigger frames per chunk for our big models

93e81c3

Put the demo into main folder

72bf210

Compile without CUDA too

525b722

Merge pull request alphacep#800 from alphacep/batch

ed4c15b

Batch GPU decoding

Round times

5428d36

Update README with Japanese

70d5cbd

Merge branch 'master' of github.com:alphacep/vosk-api

64dfc65

Add Esperanto

a1eac01

Fix branch name and add implib dump

c320997

Don't close channel which not yet started

c6fab36

Add libs as dependencies in Makefile

9861be2

Implement wave chunking for cuda decoder

6f86944

Put stream information in a single structure

2135223

Set soname for Android library

b090341

Read list of files from arguments

d2c11a6

Add NLSML output

79b8395

Don't add space before string

a561c2d

Emtpy result should be also xml

f574d89

Rename according to Kaldi changes

1f447a8

Merge upstream master branch

a768afe

Change KaldiRecognizer reference to Recognizer in src/recognizer.cc

b63df75

Change KaldiRecognizer reference to Recognizer in src/vosk_api.cc

de0bec8

Increment version to 0.3.42

9732d1d

mulhod requested review from desilinguist and rutujaubale February 12, 2022 05:47

mulhod self-assigned this Feb 12, 2022

desilinguist reviewed Feb 14, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] Feature/merge changes from upstream #3

[DRAFT] Feature/merge changes from upstream #3

mulhod commented Feb 12, 2022

mulhod commented Feb 14, 2022

desilinguist left a comment

mulhod commented Feb 14, 2022 •

edited

Loading

desilinguist commented Feb 14, 2022

mulhod commented Feb 14, 2022 •

edited

Loading

[DRAFT] Feature/merge changes from upstream #3

Are you sure you want to change the base?

[DRAFT] Feature/merge changes from upstream #3

Conversation

mulhod commented Feb 12, 2022

mulhod commented Feb 14, 2022

desilinguist left a comment

Choose a reason for hiding this comment

mulhod commented Feb 14, 2022 • edited Loading

desilinguist commented Feb 14, 2022

mulhod commented Feb 14, 2022 • edited Loading

mulhod commented Feb 14, 2022 •

edited

Loading

mulhod commented Feb 14, 2022 •

edited

Loading