-
Notifications
You must be signed in to change notification settings - Fork 704
Closed
Labels
Description
🚀 Feature
Convert sox effects into transforms. Ideally, we provide enough building blocks to make those transforms easy to assemble.
- allpass (with biquad)
- band (with biquad)
- bandpass/bandreject (with biquad)
- bass (with biquad)
- treble (with biquad)
- bend
- biquad
-
channels (we already decided downmix, etc, is easy to do by hand) - chorus
- compand/mcompand
- contrast
- dcshift
- deemph (with biquad, related to this)
- delay
- dither
- downsample/upsample (available with resample)
- earwax
- echo/echos
- equalizer (with biquad, see also here)
- fade
- fir
- flanger
- gain
- highpass/lowpass (with biquad)
- hilbert
- ladspa
- loudness
- mcompand
- noiseprof/noisered
-
norm (duplicate of gain) -
oops (simply the difference between two channels) - overdrive
-
pad/trim (easy to do without transforms) - phaser
- pitch
-
rate (resample already available) -
remix (we already decided downmix, etc, is easy to do by hand) -
repeat (tensor.repeat
or pad with replicate) - reverb
-
reverse - riaa (with biquad)
- silence
- sinc
- spectrogram
-
speed (change sample rate) - stretch/tempo (vocoder functional available)
- splice
-
stat/stats (only prints information about the waveform) -
swap - synth
- tremolo
- vad (in example)
- vol
Crossed out: can be done in a simple way with current pytorch operations, so we don't need a dedicated transform. Feel free to point out any updates recommended to this list.
Additional context
Remove dependency on sox in VCTKOpen files in format such as mp3, etc (see here and here)- Comment from Merging plan from torchaudio-contrib #110 linking conversation: fast, general audio read keunwoochoi/torchaudio-contrib#31
- Some transforms touch on Merging items torchaudio-contrib that are not yet in torchaudio #259
- Performance comparison between libraries for audio loading here
- internal doc
Mistobaan, keunwoochoi, eugene-kharitonov, tomassosorio, markovka17 and 1 morecpuhrsch