feat: teletext formatting #1384

tobbee · 2024-04-12T07:53:52Z

This PR adds parsing of teletext styling, and rendering of the styling in output TTML and WebVTT subtitle tracks.

It is split into three commits, one for parsing teletext, one for TTML (stpp) generation and a third for WebVTT (wvtt) generation.

Beyond unit tests, I've used the sample https://drive.google.com/file/d/19ZYsoeUfH85gEilQkaAdLbPhC4CxhDEh/view?usp=sharing which has rather advanced subtitling with two separate rows at the same time, where one is left aligned and another is right aligned. This necessitates two parallel cues to be rendered. It also has some colored text.

This should solve #1335.

Commit 1: parse teletext styling and formatting

Extend the teletext parser to parse the teletext styling and formatting.
This includes translating rows into regions, calculating alignment
from start and stop position of the text, and extracting text and
background colors.

The colors are limited to full lines.
Both lines and regions are propagated in the TextSample structures.
This is because the number of lines may differ from different sources.
For teletext, there are 24 rows, but they are essentially always
used with double height, so the number of output lines is 12
from 0 to 11.
There are also corresponding regions are denoted "ttx_R",
where R is an integer row number. A renderer can use either
the line number or the region ID to render the text.

Commit 2: ttml generation for teletext to EBU-TT-D

Add support to render teletext input in EBU-TT-D (IMSC-1) format.
This includes appropriate regions ttx_0 to ttx_11 signalled
in the TextSamples, alignment and text and background colors.

The general TTML output has been changed to always include
metadata, layout, and styling nodes, even if they are empty.

EBU-TT-D is detected by the presence of "ttx_?" regions in the
samples. If detected, extra TTML elements will be added and
the EBU-TT-D linePadding used as well.

Appropriate styles for background and text colors are generated
depending on the color and backgroundColor attributes in the
text fragments.

Commit 3 fix: adapt WebVTT output to teletext TextSample.

Teletext input generates both a region with prefix ttx_
and a floating point line number (e.g. 9.5) in the
range 0 to 11.5 (due to input 0-23 as double lines).

The output is adopted to drop such regions
and convert the line number to an integer
since the standard only used floats for percent
values but not for plain line numbers.

cosmin

Please rebase as this will include integration tests in CI and hopefully fix the build failures on this PR on Windows and OS X.

packager/media/formats/mp2t/es_parser_teletext.cc

Extend the teletext parser to parse the teletext styling and formatting. This includes translating rows into regions, calculating alignment from start and stop position of the text, and extracting text and background colors. The colors are limited to full lines. Both lines and regions are propagated in the TextSample structures. This is because the number of lines may differ from different sources. For teletext, there are 24 rows, but they are essentially always used with double height, so the number of output lines is 12 from 0 to 11. There are also corresponding regions are denoted "ttx_R", where R is an integer row number. A renderer can use either the line number or the region ID to render the text.

Add support to render teletext input in EBU-TT-D (IMSC-1) format. This includes appropriate regions ttx_0 to ttx_11 signalled in the TextSamples, alignment and text and background colors. The general TTML output has been changed to always include metadata, layout, and styling nodes, even if they are empty. EBU-TT-D is detected by the presence of "ttx_?" regions in the samples. If detected, extra TTML elements will be added and the EBU-TT-D linePadding used as well. Appropriate styles for background and text colors are generated depending on the color and backgroundColor attributes in the text fragments.

Teletext input generates both a region with prefix ttx_ and a floating point line number (e.g. 9.5) in the range 0 to 11.5 (due to input 0-23 as double lines). The output is adopted to drop such regions and convert the line number to an integer since the standard only used floats for percent values but not for plain line numbers.

cosmin · 2024-04-22T16:16:31Z

Looks like the existing TTML integration tests are failing. You can run the tests locally with python3 build/packager/packager_test.py. If the expectation files need to be updated due to improved behavior, you can run python3 build/packager/packager_test.py --test_update_golden_files and then verify that all the produced differences are expected.

…ents

tobbee · 2024-04-23T10:34:15Z

@cosmin Thanks for the note. I only made the unit tests work and missed the integration test.

I've now updated all the test TTML assets. The main difference is that the head element of all TTML
output now contains head, styling, and layout elements. In the test material these are empty,
but I don't think it is worth the effort to remove them if empty, since they are standard elements of TTML.

cosmin · 2024-04-23T22:38:04Z

@tobbee thank you, if all the CI jobs pass I'll go ahead and merge this

cosmin · 2024-04-24T00:42:00Z

The build is failing on Windows

D:\a\shaka-packager\shaka-packager\packager\media\formats\ttml\ttml_generator.cc(59,14): error C2220: the following warning is treated as an error [D:\a\shaka-packager\shaka-packager\build\packager\media\formats\ttml\ttml.vcxproj]
D:\a\shaka-packager\shaka-packager\packager\media\formats\ttml\ttml_generator.cc(59,14): warning C4305: 'initializing': truncation from 'double' to 'float' [D:\a\shaka-packager\shaka-packager\build\packager\media\formats\ttml\ttml.vcxproj]

tobbee · 2024-04-25T05:39:21Z

I talked to a real subtitle expert, and we should suppress the warning for bad data length, since it is stuffing that should be there if the TS stream is compliant with the DVB teletext spec. Apparently, the spec says that one cannot stuff on PES level, but must fill an integral number of TS packets with teletext PES data. I checked and all the payload bytes of this trailing chunk of 136 bytes are indeed stuffing 0xff in my test stream.

I'll update this PR.

tobbee · 2024-04-25T20:09:00Z

Turns out that the stuffing has a different data_unit_id, so by first checking that value before the length, we get rid of the warnings and only have an error if a teletext data unit has wrong length. I added another commit to fix that so this PR should now be fine for merging.

🤖 I have created a release *beep* *boop* --- ## [3.1.0](v3.0.4...v3.1.0) (2024-05-03) ### Features * add missing DASH roles from ISO/IEC 23009-1 section 5.8.5.5 ([#1390](#1390)) ([fe885b3](fe885b3)) * get start number from muxer and specify initial sequence number ([#879](#879)) ([bb104fe](bb104fe)) * teletext formatting ([#1384](#1384)) ([4b5e80d](4b5e80d)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).

cosmin reviewed Apr 19, 2024

View reviewed changes

packager/media/formats/mp2t/es_parser_teletext.cc Outdated Show resolved Hide resolved

tobbee added 5 commits April 21, 2024 23:11

chore: update CONTRIBUTORS

97aaad3

fix: add warning if teletext data length is bad

7d1aef2

tobbee force-pushed the teletext-formatting branch from a957764 to 7d1aef2 Compare April 21, 2024 21:19

fix: update TTML testdata with (empty) metadata, styling, layout elem…

8203178

…ents

cosmin approved these changes Apr 23, 2024

View reviewed changes

fix: declare number as explicit float

6970703

tobbee marked this pull request as draft April 25, 2024 05:45

fix: check teletext data_unit_id before asserting length

4f19fa2

tobbee marked this pull request as ready for review April 25, 2024 20:09

cosmin added this to the v3.1 milestone Apr 26, 2024

cosmin added type: enhancement New feature or request component: text The issue involves text streams (subtitles or captions) labels Apr 26, 2024

cosmin merged commit 4b5e80d into shaka-project:main Apr 29, 2024
38 checks passed

shaka-bot mentioned this pull request Apr 29, 2024

chore(main): release 3.1.0 #1391

Merged

cosmin mentioned this pull request Apr 29, 2024

Queries on: Fixed position and style formatting for EBU-TT-D subtitles #1335

Closed

github-actions bot added the status: archived Archived and locked; will not be updated label Jun 28, 2024

github-actions bot locked as resolved and limited conversation to collaborators Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: teletext formatting #1384

feat: teletext formatting #1384

tobbee commented Apr 12, 2024

cosmin left a comment

cosmin commented Apr 22, 2024

tobbee commented Apr 23, 2024 •

edited

Loading

cosmin commented Apr 23, 2024

cosmin commented Apr 24, 2024

tobbee commented Apr 25, 2024

tobbee commented Apr 25, 2024 •

edited

Loading

feat: teletext formatting #1384

feat: teletext formatting #1384

Conversation

tobbee commented Apr 12, 2024

Commit 1: parse teletext styling and formatting

Commit 2: ttml generation for teletext to EBU-TT-D

Commit 3 fix: adapt WebVTT output to teletext TextSample.

cosmin left a comment

Choose a reason for hiding this comment

cosmin commented Apr 22, 2024

tobbee commented Apr 23, 2024 • edited Loading

cosmin commented Apr 23, 2024

cosmin commented Apr 24, 2024

tobbee commented Apr 25, 2024

tobbee commented Apr 25, 2024 • edited Loading

tobbee commented Apr 23, 2024 •

edited

Loading

tobbee commented Apr 25, 2024 •

edited

Loading