-
Notifications
You must be signed in to change notification settings - Fork 448
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GSOC - Finish CEA-708 support #3
Comments
CEA-708 is impossible to support 100% exporting to .srt or transcript because it has a number of features (windows to name one) that just don't have a correlation in those formats. Still, for most of the real world data we can do a fairly good job. First order of business is fix the timing. It doesn't work, it's off by around 3 seconds. Probably not to hard to solve, just read the specs paying attention :-) Expect for that, I believe the internal status of the decoder is correct and it supports all the 708 commands (at least that I saw ever in use). What's missing is correctly exporting that status to files. |
Assigned to bigharshrag since he's the 708 man this GSoC :-) kisselef is mentoring. |
Additional info (pasted from a separate issue, I'm going to merge 708 stuff here). Someone just sent a number of useful samples. They're available in /repository/Cristiano708 MPEG-PS containing both CEA608 and CEA708 captions MPEG-TS containing both CEA608 and CEA708 captions CEA608 TTML file generated by Adobe Premiere (CEA608 track) CEA708 TTML file (slightly different from the CEA608 TTML): SCC File (CEA608) MCC File (CEA708) |
Additional: #178 |
Doesn't CEA-608 support case? |
In theory yes (check the character set here: https://en.wikipedia.org/wiki/EIA-608#Characters ) But if I recall correctly the very first decoders were uppercase only (I can't find the technical reference to support this claim though) but: https://audio-accessibility.com/news/2018/10/all-caps-vs-mixed-case-type-for-captions/ Also, this: |
Standard here: https://shop.cta.tech/products/line-21-data-services I'm assigning this to @PunitLodha since he's going to be working on it during GSoC 2021. |
I'm closing this since 708 is now in reasonably good shape now that @PunitLodha killed the known bugs :-) |
EIA-708 is the "new" standard for closed captioning. While the specification has been around for some years and support for it is mandatory in the US for both TV receivers and stations, until very recently almost all stations have just converted their CEA-608 data to 708; this means that none of the 708 features have actually been used, and you still see many captions in all uppercase (to mention just one thing). This is starting to change though, so it makes sense for CCExtractor to fully implement a 708 decoder. Some work was done already, and you can actually see 708 output in CCExtractor in debug mode. But it needs to be completed by adding the actual export features.
The text was updated successfully, but these errors were encountered: