You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Dec 14, 2023. It is now read-only.
But it turns out that quite a few episodes have their episode's static thumbnail attached as a "video" stream, e.g.:
$ ffmpeg -i ADV5935473959.mp3
<...>
[mp3 @ 0x55f3de42a2c0] Skipping 1 bytes of junk at 62145.
[mp3 @ 0x55f3de42a2c0] Estimating duration from bitrate, this may be inaccurate
Input #0, mp3, from 'ADV5935473959.mp3':
Metadata:
title : EVERYTHING YOU EVER WANTED TO KNOW ABOUT COVID THAT THE GOVERNMENT WON'T TELL YOU
lyrics-ENG : <p>INTRODUCTION; WHY OBESITY IS A BIG RISK FACTOR; ZINC AND ACTIVATORS; NUTRACEUTICALS AND BOTANICALS; GARLIC, A SUPERFOOD</p>
album : The Michael Savage Show
genre : Podcast
date : 2021
Duration: 00:59:06.10, start: 0.000000, bitrate: 192 kb/s
Stream #0:0: Audio: mp3, 44100 Hz, mono, fltp, 192 kb/s
Stream #0:1: Video: mjpeg (Progressive), yuvj420p(pc, bt470bg/unknown/unknown), 500x500 [SAR 72:72 DAR 1:1], 90k tbr, 90k tbn, 90k tbc (attached pic)
Metadata:
title : image
comment : Other
At least one output file must be specified
(That's Stream #0:1 here.)
FFMPEG advises us to "add a trailing '?' to the map" but that probably won't work with the speech to text engine, so let's remake transcode_file_if_needed() to remove all non-audio streams, e.g. video, attached JPEGs, text files, etc. - one can attach quite a few things to media files: https://ffmpeg.org/doxygen/trunk/group__lavu__misc.html#ga9a84bba4713dfced21a1a56163be1f48)
Podcast transcoding fails for some episodes because:
(Sample episode that fails:
https://traffic.megaphone.fm/ADV5935473959.mp3?updated=1628579716
)To make transcriptions work, we remove video streams from incoming episodes if we find any:
backend/apps/podcast-transcribe-episode/src/python/podcast_transcribe_episode/transcode.py
Lines 74 to 77 in f32b21b
Whether or not the episode has video streams is determined here:
backend/apps/podcast-transcribe-episode/src/python/podcast_transcribe_episode/media_info.py
Lines 184 to 185 in f32b21b
But it turns out that quite a few episodes have their episode's static thumbnail attached as a "video" stream, e.g.:
(That's
Stream #0:1
here.)FFMPEG advises us to "add a trailing '?' to the map" but that probably won't work with the speech to text engine, so let's remake
transcode_file_if_needed()
to remove all non-audio streams, e.g. video, attached JPEGs, text files, etc. - one can attach quite a few things to media files: https://ffmpeg.org/doxygen/trunk/group__lavu__misc.html#ga9a84bba4713dfced21a1a56163be1f48)@jtotoole, could you:
transcode_file_if_needed()
to remove all non-audio streams instead of just video streams; andmedia-samples
(which we use as a submodule: https://github.com/mediacloud/backend/tree/master/apps/podcast-transcribe-episode/tests/data) which would have similar structure to this sample file that's failing, i.e. a single audio stream and a "video" stream of typemjpeg
, in order to confirm that we're in fact able to transcode those?The text was updated successfully, but these errors were encountered: