-
Notifications
You must be signed in to change notification settings - Fork 426
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Using -sc option crashes ccextractor #1115
Comments
I'm wanting to work on this. Any leads where to start off? |
Start by reproducing the issue locally.
…On Sun, Nov 3, 2019 at 5:24 AM sp2703 ***@***.***> wrote:
I'm wanting to work on this. Any leads where to start off?
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#1115?email_source=notifications&email_token=ABNMTWJTEU3QVCGOT4J2XADQR3GJJA5CNFSM4JBSNFSKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEC5SP3A#issuecomment-549136364>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABNMTWPT7AIIVFOGTS6RBXTQR3GJJANCNFSM4JBSNFSA>
.
|
Patched here: |
Sentence case crash (-sc)
@cfsmp3 Carlos any chance of getting a new build? It's been a while and many fixes have gone in since the last release |
I'll bundle a new version after GSoC (in one month or so). These days I really don't have a lot of time on my hands I'm afraid. |
@cfsmp3 do you think it’s ready for a release? |
Well, there's lots of bugs, but there's no one doing active work these days, so they're not going to go away magically. |
@cfsmp3 were you able to get the new build released Carlos? |
@cfsmp3 bump on release |
@cfsmp3 can we expect a new release anytime soon? |
Around mid June
…On Sat, May 22, 2021, 08:05 rboy1 ***@***.***> wrote:
@cfsmp3 <https://github.com/cfsmp3> can we expect a new release anytime
soon?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1115 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABNMTWLTPDHZRCMR6IV6CV3TO7B3XANCNFSM4JBSNFSA>
.
|
@cfsmp3 Mid June is here, looking forward to it :) |
https://github.com/CCExtractor/ccextractor/releases/tag/v0.89 :) Windows build is still WIP*. You can download the binaries here though (let me know if you can't): https://github.com/CCExtractor/ccextractor/suites/2983776538/artifacts/67339947 * We're working on a new installer and code signing, the latter is what's holding us back right now |
Awesome thanks. I noticed that it says it's compiled against tessdata 4.0alpha. Does it mean it won't work with tessdata 3? |
For Windows the libraries are embedded, so you're indeed stuck to that specific version. I noticed you had another comment a couple of minutes ago, but it seems to have vanished.
Was that when trying the standalone binary too, or caused by the GUI exe itself? Anyway, looks like we should add that dll to the generated artifacts too. |
Hmm, I tried using tessdata 3.04 and it seemed to work fine converting dvbsub to srt |
IIRC tessdata is not bound to the tesseract version, so that's indeed no problem :) |
On a side note I have some files with multiple dvbsub tracks but when I run ccextractor it only extracts the first track. Is there a way to get it to extract all tracks or maybe specify the track number? |
Maybe we should also figure out a way to build those libraries again from source :-) @Izaron did that work a few years ago and we haven't touched that ever since I think? |
Since tesseract 3 is no longer maintained at all, I think we should stick to 4 (which as can be see, supports the pattern recognition mode from v3, so no need to actually use v3). |
@cfsmp3 are you saying that we need to explicitly add --oem 0 to get it to work with Tesseract 3 because 0.89 is working (or am I missing something here). For future ref, wouldn't be better if ccextractor automatically detects if it's using Tesseract 3 or 4 with an option to override using the --oem? |
What I pasted comes from tesseract's website. v4 supports v3's legacy engine, so there's not reason to actually have v3 around at all. If you want to use the old system, just use --oem (If I remember correctly we do expose that argument in CCExtractor).
I don't want to support legacy versions of libraries. If the tesseract maintainers have decided to stop development of v3, what's the reason for us to bother supporting both? Just use v4 and use the legacy mode if it works better. |
Please prefix your issue with one of the following: [BUG], [PROPOSAL], [QUESTION].
CCExtractor version (using the --version parameter preferably) : 0.88
In raising this issue, I confirm the following (please check boxes, eg [X] - and delete unchecked ones):
My familiarity with the project is as follows (check one, eg [X] - and delete unchecked ones):
Necessary information
-sc
Additional information
Link to file which is causing the crash:
https://www.dropbox.com/s/4jiooj787e02kd3/CCExtractor%20crash.ts?dl=0
Output of cmd line:
The text was updated successfully, but these errors were encountered: