Data points remain tuples #330

vincentqb · 2019-11-05T21:14:16Z

Since this is what has been most used in domains, we keep tuples as the item to return for now.

for CommonVoice:

    # dictionary with keys:
    # client_id, path, sentence, up_votes, down_votes, age, gender, accent
    return waveform, sample_rate, dictionary

for LibriSpeech:

    return (
        waveform,
        sample_rate,
        utterance,
        int(speaker_id),
        int(chapter_id),
        int(utterance_id),
    )

for VCTK:

    return waveform, sample_rate, utterance, speaker_id, utterance_id

for YesNo:

    # labels = [int(c), ...]
    return waveform, sample_rate, labels

CC #303

vincentqb · 2019-11-05T21:16:19Z

torchaudio/datasets/librispeech.py

-        "waveform": waveform,
-        "sample_rate": sample_rate,
-    }
+    with open(file_text) as ft:


Adding this as part of this PR to make sure that the file is closed.

vincentqb · 2019-11-05T21:17:09Z

torchaudio/datasets/librispeech.py

+        utterance,
+        int(speaker_id),
+        int(chapter_id),
+        int(utterance_id),


Also converting to int though they come from file names.

vincentqb · 2019-11-05T21:17:17Z

torchaudio/datasets/yesno.py

@@ -11,16 +11,20 @@

 def load_yesno_item(fileid, path, ext_audio):
    # Read label
-    label = fileid.split("_")
+    labels = [int(c) for c in fileid.split("_")]


Also converting to int though they come from file names.

vincentqb · 2019-11-06T17:15:16Z

Just modified CommonVoice to return a tuple (waveform, sample_rate, dictionary), where the dictionary contains the data and header from the tsv. This is closer to how the data is provided by CommonVoice. The only assumption on the tsv is that the second column is the name of the audio file to retrieve.

The previous iteration was:

    # waveform, sample_rate, client_id, path, sentence, up_votes, down_votes, age, gender, accent
    return(waveform, sample_rate, *line)

P.S. The notation with *line is not compatible with python 2.

vincentqb added 2 commits November 5, 2019 16:06

close file.

564b80b

staying with datapoints as tuples until further notice.

2492ed4

vincentqb commented Nov 5, 2019

View reviewed changes

cpuhrsch approved these changes Nov 6, 2019

View reviewed changes

vincentqb added 2 commits November 6, 2019 11:49

loading tsv as dict instead.

e710040

change var name.

5b2fd93

vincentqb merged commit 38d1a9b into pytorch:master Nov 6, 2019

vincentqb deleted the notnamedtuple branch November 6, 2019 23:06

vincentqb mentioned this pull request Dec 20, 2019

Update audio preprocessing tutorial pytorch/tutorials#797

Merged

10 tasks

vincentqb mentioned this pull request Aug 17, 2020

Add tedlium dataset (all 3 releases) #882

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Data points remain tuples #330

Data points remain tuples #330

Uh oh!

vincentqb commented Nov 5, 2019 •

edited

Loading

Uh oh!

vincentqb Nov 5, 2019

Uh oh!

vincentqb Nov 5, 2019

Uh oh!

vincentqb Nov 5, 2019

Uh oh!

vincentqb commented Nov 6, 2019

Uh oh!

Uh oh!

Data points remain tuples #330

Data points remain tuples #330

Uh oh!

Conversation

vincentqb commented Nov 5, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vincentqb Nov 5, 2019

Choose a reason for hiding this comment

Uh oh!

vincentqb Nov 5, 2019

Choose a reason for hiding this comment

Uh oh!

vincentqb Nov 5, 2019

Choose a reason for hiding this comment

Uh oh!

vincentqb commented Nov 6, 2019

Uh oh!

Uh oh!

vincentqb commented Nov 5, 2019 •

edited

Loading