add chunked sph file processing #367

videodanchik · 2021-08-12T15:49:26Z

This MR aims to provide an optimal way for reading SPHERE files, avoiding reading the whole file, when offset and duration are provided. These parameters can be specified directly in sph2pipe command which helps to load only chunk of audio in memory during lhotse.audio.read_audio function call.

pzelasko · 2021-08-12T15:51:17Z

lhotse/audio.py

@@ -15,6 +15,7 @@
 from typing import Any, Callable, Dict, Iterable, List, Mapping, NamedTuple, Optional, Sequence, Tuple, Union

 import numpy as np
+import soundfile as sf


Please stick to a local import, otherwise building the docs is going to fail (the import tries to load libsndfile.so into memory which is not available on read-the-docs servers).

Oh, sorry, will move it back

pzelasko · 2021-08-12T15:52:21Z

lhotse/audio.py

@@ -231,8 +232,7 @@ def from_file(
        else:
            try:
                # Try to parse the file using pysoundfile first.
-                import soundfile
-                info = soundfile.info(str(path))
+                info = sf.info(str(path))


is there a missing case for using sph_info?

Right, good catch.

lhotse/audio.py

pzelasko · 2021-08-12T15:57:08Z

Thanks that looks good! Please address my comments and we'll merge it.

Co-authored-by: Piotr Żelasko <petezor@gmail.com>

videodanchik · 2021-08-12T16:31:39Z

Import and SPHERE file handling in Recording are fixed.

videodanchik · 2021-08-13T13:58:50Z

@pzelasko Can we merge this as I want to add Fisher English that will depend on this PR?

pzelasko · 2021-08-13T14:34:13Z

Hmm, the tests are failing because there is no sph2pipe in the CI. Let me try to add it, and then we can merge.

pzelasko · 2021-08-13T19:33:00Z

@videodanchik I made a PR #370 that makes it easy to install sph2pipe in a way that enables Lhotse to auto-discover it. I'll wait for Dan to say what he thinks; as soon as it's merged, we can try re-run the CI for your PR again with no code changes -- it should help.

videodanchik · 2021-08-13T22:49:47Z

@pzelasko Oh, sorry that it caused this additional effort, I haven't even realized that you can't install sph2pipe via sudo apt install.

pzelasko · 2021-08-13T23:01:05Z

no worries, I think that it’s better for Lhotse users not to have to worry about how to find these esoteric programs…

pzelasko · 2021-08-14T12:34:32Z

Seems to work now, merging.

videodanchik added 2 commits August 12, 2021 17:38

add chunked sph file processing

5179291

Merge branch 'master' into feature/chunked-sph-file-load

e02f6ad

pzelasko reviewed Aug 12, 2021

View reviewed changes

videodanchik and others added 3 commits August 12, 2021 19:04

add check to subprocess run

4e5890f

Co-authored-by: Piotr Żelasko <petezor@gmail.com>

move back soundfile local import

40e8fda

add missing sph file handling in recording creation

0b889c7

Merge branch 'master' into feature/chunked-sph-file-load

bdf4ef2

Merge branch 'master' into feature/chunked-sph-file-load

6e35b57

pzelasko merged commit 82286dc into lhotse-speech:master Aug 14, 2021

pzelasko added this to the v0.8 milestone Aug 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add chunked sph file processing #367

add chunked sph file processing #367

videodanchik commented Aug 12, 2021

pzelasko Aug 12, 2021

videodanchik Aug 12, 2021

pzelasko Aug 12, 2021

videodanchik Aug 12, 2021

pzelasko commented Aug 12, 2021

videodanchik commented Aug 12, 2021

videodanchik commented Aug 13, 2021 •

edited

Loading

pzelasko commented Aug 13, 2021

pzelasko commented Aug 13, 2021

videodanchik commented Aug 13, 2021

pzelasko commented Aug 13, 2021

pzelasko commented Aug 14, 2021

add chunked sph file processing #367

add chunked sph file processing #367

Conversation

videodanchik commented Aug 12, 2021

pzelasko Aug 12, 2021

Choose a reason for hiding this comment

videodanchik Aug 12, 2021

Choose a reason for hiding this comment

pzelasko Aug 12, 2021

Choose a reason for hiding this comment

videodanchik Aug 12, 2021

Choose a reason for hiding this comment

pzelasko commented Aug 12, 2021

videodanchik commented Aug 12, 2021

videodanchik commented Aug 13, 2021 • edited Loading

pzelasko commented Aug 13, 2021

pzelasko commented Aug 13, 2021

videodanchik commented Aug 13, 2021

pzelasko commented Aug 13, 2021

pzelasko commented Aug 14, 2021

videodanchik commented Aug 13, 2021 •

edited

Loading