Adds radio data recipe #1400

m-wiesner · 2024-10-10T03:29:32Z

This is the recipe for some radio data we collected and are able to distribute if someone emails us for it. We may possibly have a way of releasing it with some restrictions in the future, but for now it is just a prepare script. The paper it corresponds to is

https://aclanthology.org/2024.naacl-long.286.pdf

The main use is for geolocating speech, self-supervised model pretraining, or language id.

pzelasko

Thanks! Please fix the CI and the torchaudio import.

lhotse/recipes/radio.py

pzelasko · 2024-10-10T20:06:23Z

lhotse/recipes/radio.py

+from lhotse.supervision import SupervisionSegment, SupervisionSet
+from lhotse.utils import Pathlike
+
+set_ffmpeg_torchaudio_info_enabled(False)


One last request - don’t set these things in the global scope because it’ll be executed when Lhotse is imported. Move it to local scope (even better i think this one works like a context manager)

I think I fixed it now?

Almost - now you need to move to actual usage to inside the function that does the audio reading and use it with Python with statement

I actually forget why I started adding this in to all the Lhotse recipes. I think it fixed some problem I was having at some point, but I have no recollection of what that problem was. Do I actually need it? When I actually use it locally each time it gives me a ton of horrible logging messages.

You shouldn’t need it. We are using libsndfile now by default rather than torchaudio. Feel free to remove it if you prefer

That is what I ended up doing. I believe the current updated commit just removed this.

…enabled call. The recipe runs fine without it.

m-wiesner · 2024-10-21T16:19:41Z

Is there anything left to do for this one?

pzelasko · 2024-10-21T19:14:20Z

Looks good, thanks!

Adds radio data recipe

47c49df

pzelasko requested changes Oct 10, 2024

View reviewed changes

lhotse/recipes/radio.py Outdated Show resolved Hide resolved

m-wiesner added 2 commits October 9, 2024 23:59

Makes some small formatting changes

469ee97

Fixing black and isort formatting

a297416

pzelasko reviewed Oct 10, 2024

View reviewed changes

m-wiesner added 2 commits October 11, 2024 10:00

Fixes disable_ffmpeg_torchaudio_info to use contextmanager

6bdc207

Removes what appears to be an unnecessary set_ffmpeg_torchaudio_info_…

8361d32

…enabled call. The recipe runs fine without it.

pzelasko merged commit 25475d4 into lhotse-speech:master Oct 21, 2024
9 checks passed

pzelasko added this to the v1.28.0 milestone Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds radio data recipe #1400

Adds radio data recipe #1400

m-wiesner commented Oct 10, 2024

pzelasko left a comment

pzelasko Oct 10, 2024

m-wiesner Oct 11, 2024

pzelasko Oct 11, 2024

m-wiesner Oct 11, 2024

pzelasko Oct 11, 2024

m-wiesner Oct 11, 2024

m-wiesner commented Oct 21, 2024

pzelasko commented Oct 21, 2024

Adds radio data recipe #1400

Adds radio data recipe #1400

Conversation

m-wiesner commented Oct 10, 2024

pzelasko left a comment

Choose a reason for hiding this comment

pzelasko Oct 10, 2024

Choose a reason for hiding this comment

m-wiesner Oct 11, 2024

Choose a reason for hiding this comment

pzelasko Oct 11, 2024

Choose a reason for hiding this comment

m-wiesner Oct 11, 2024

Choose a reason for hiding this comment

pzelasko Oct 11, 2024

Choose a reason for hiding this comment

m-wiesner Oct 11, 2024

Choose a reason for hiding this comment

m-wiesner commented Oct 21, 2024

pzelasko commented Oct 21, 2024