feat: update ApplyBackgroundNoise augmentation #48

hbredin · 2020-11-23T09:55:27Z

No description provided.

hbredin · 2020-11-23T09:55:53Z

This is a work in progress but I'd love to receive early feedback anyway.

torch_audiomentations/augmentations/background_noise.py

iver56

Add torchaudio>=0.6.0 to the list of required dependencies in setup.py

torch_audiomentations/augmentations/background_noise.py

iver56 · 2020-11-23T10:23:57Z

I can take another look when it's out of draft :)

Co-authored-by: Iver Jordal <1470603+iver56@users.noreply.github.com>

iver56 · 2020-11-30T15:20:26Z

There's a merge conflict, and this is the reason: https://github.com/asteroid-team/torch-audiomentations/pull/50/files#diff-6e77014151191ab9ff2d304e38e00227aacf5f13c96d97b95bb4ace36bf834c1

hbredin · 2020-11-30T15:51:00Z

This PR now fails because tests try to augment 2d samples.
Shouldn't we first come up with a PR that switches to 3d only (as discussed last week in slack)?

iver56 · 2020-11-30T16:04:34Z

This PR now fails because tests try to augment 2d samples.
Shouldn't we first come up with a PR that switches to 3d only (as discussed last week in slack)?

Yes, we should adapt (or remove) tests that currently provide 2d input. We can enforce/assert 3d input in a different pull request.

…ng changes in torchaudio.

iver56 · 2020-11-30T21:40:31Z

I tried to use this transform in the demo script. I proposed a few changes in your branch here: hbredin#1

I'm currently getting an exception like this:

Traceback (most recent call last):
  File "C:/Users/Iver/Code/torch-audiomentations/scripts/demo.py", line 139, in <module>
    samples=samples, sample_rate=SAMPLE_RATE
  File "C:\Users\Iver\Anaconda3\envs\torch-audiomentations-gpu\lib\site-packages\torch\nn\modules\module.py", line 722, in _call_impl
    result = self.forward(*input, **kwargs)
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\core\transforms_interface.py", line 189, in forward
    self.randomize_parameters(cloned_samples, sample_rate)
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\augmentations\background_noise.py", line 113, in randomize_parameters
    [self.random_background(audio, num_samples) for _ in range(batch_size)]
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\augmentations\background_noise.py", line 113, in <listcomp>
    [self.random_background(audio, num_samples) for _ in range(batch_size)]
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\augmentations\background_noise.py", line 80, in random_background
    0, background_num_samples - missing_num_samples
  File "C:\Users\Iver\Anaconda3\envs\torch-audiomentations-gpu\lib\random.py", line 222, in randint
    return self.randrange(a, b+1)
  File "C:\Users\Iver\Anaconda3\envs\torch-audiomentations-gpu\lib\random.py", line 195, in randrange
    raise ValueError("non-integer stop for randrange()")
ValueError: non-integer stop for randrange()

Edit: I think it's because get_num_samples sometimes doesn't return an int

iver56 · 2020-12-01T08:04:09Z

I tried to run the demo script, and I got an exception like this:

Traceback (most recent call last):
  File "C:/Users/Iver/Code/torch-audiomentations/scripts/demo.py", line 139, in <module>
    samples=samples, sample_rate=SAMPLE_RATE
  File "C:\Users\Iver\Anaconda3\envs\torch-audiomentations-gpu\lib\site-packages\torch\nn\modules\module.py", line 722, in _call_impl
    result = self.forward(*input, **kwargs)
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\core\transforms_interface.py", line 189, in forward
    self.randomize_parameters(cloned_samples, sample_rate)
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\augmentations\background_noise.py", line 113, in randomize_parameters
    [self.random_background(audio, num_samples) for _ in range(batch_size)]
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\augmentations\background_noise.py", line 113, in <listcomp>
    [self.random_background(audio, num_samples) for _ in range(batch_size)]
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\augmentations\background_noise.py", line 84, in random_background
    background_path, sample_offset=sample_offset, num_samples=num_samples,
  File "C:\Users\Iver\Code\torch-audiomentations\torch_audiomentations\utils\io.py", line 240, in __call__
    raise ValueError()
ValueError

Somehow the numbers don't add up in this case, and I think it's related to signal and noise having different original sample rates

        # io.py line 239-240
        if original_sample_offset + original_num_samples > original_total_num_samples:
            raise ValueError()

Could you try to run the demo script and see if you can reproduce it?

python -m scripts.demo

iver56 · 2020-12-01T11:51:50Z

LGTM! 🚀
Thanks for the contribution 😄

hbredin added 2 commits November 23, 2020 10:52

feat: add Audio IO class

423237e

wip: add first version of updated ApplyBackgroundNoise

32687d7

Merge branch 'master' into master

f04c36b

iver56 self-requested a review November 23, 2020 10:00

iver56 reviewed Nov 23, 2020

View reviewed changes

torch_audiomentations/augmentations/background_noise.py Outdated Show resolved Hide resolved

iver56 reviewed Nov 23, 2020

View reviewed changes

torch_audiomentations/augmentations/background_noise.py Show resolved Hide resolved

torch_audiomentations/augmentations/background_noise.py Outdated Show resolved Hide resolved

torch_audiomentations/augmentations/background_noise.py Outdated Show resolved Hide resolved

hbredin and others added 2 commits November 30, 2020 16:14

Update torch_audiomentations/augmentations/background_noise.py

dfc4f57

Co-authored-by: Iver Jordal <1470603+iver56@users.noreply.github.com>

Update torch_audiomentations/augmentations/background_noise.py

ae895d5

Co-authored-by: Iver Jordal <1470603+iver56@users.noreply.github.com>

hbredin and others added 5 commits November 30, 2020 16:25

refactor: use calculate_rms() instead of (now removed) self.rms()

571de06

setup: add torchaudio conda dependency

3cd44d9

feat: add ApplyBackgroundNoise to root module

e72bd13

chore: rename (old) parameters to (new) transform_parameters

935961f

Merge branch 'master' into master

6225c29

hbredin changed the title ~~wip: update ApplyBackgroundNoise augmentation~~ feat: update ApplyBackgroundNoise augmentation Nov 30, 2020

hbredin marked this pull request as ready for review November 30, 2020 15:34

hbredin requested a review from iver56 November 30, 2020 15:35

hbredin added 2 commits November 30, 2020 16:39

tests: add torchaudio requirement

ffbe180

setup: add torchaudio in setup.py

a94a05a

iver56 added 3 commits November 30, 2020 17:27

Refactor torchaudio.info usage and make it compatible with the breaki…

e6d3bba

…ng changes in torchaudio.

Run ApplyBackgroundNoise in the demo script

fb9a3a5

Add support for the legacy interface of torchaudio.load

335a880

hbredin and others added 2 commits December 1, 2020 08:41

fix: fix torchaudio compatibility

561bfc4

fix: ensure get_num_samples returns an int

e6b4060

hbredin added 3 commits December 1, 2020 12:27

fix: fix wrong get_audio_metadata output order

2f7a740

fix: fix rounding error

36067e1

fix: fix tests

b4f8d00

iver56 merged commit 062347f into asteroid-team:master Dec 1, 2020

iver56 removed their request for review December 1, 2020 11:52

iver56 mentioned this pull request Dec 7, 2020

Load audio with torchaudio, not librosa #9

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: update ApplyBackgroundNoise augmentation #48

feat: update ApplyBackgroundNoise augmentation #48

hbredin commented Nov 23, 2020

hbredin commented Nov 23, 2020

iver56 left a comment

iver56 commented Nov 23, 2020

iver56 commented Nov 30, 2020

hbredin commented Nov 30, 2020

iver56 commented Nov 30, 2020

iver56 commented Nov 30, 2020 •

edited

Loading

iver56 commented Dec 1, 2020 •

edited

Loading

iver56 commented Dec 1, 2020

feat: update ApplyBackgroundNoise augmentation #48

feat: update ApplyBackgroundNoise augmentation #48

Conversation

hbredin commented Nov 23, 2020

hbredin commented Nov 23, 2020

iver56 left a comment

Choose a reason for hiding this comment

iver56 commented Nov 23, 2020

iver56 commented Nov 30, 2020

hbredin commented Nov 30, 2020

iver56 commented Nov 30, 2020

iver56 commented Nov 30, 2020 • edited Loading

iver56 commented Dec 1, 2020 • edited Loading

iver56 commented Dec 1, 2020

iver56 commented Nov 30, 2020 •

edited

Loading

iver56 commented Dec 1, 2020 •

edited

Loading