Skip to content

Commit

Permalink
Add instruction to implement audio augmentation using tourchaudio.sox…
Browse files Browse the repository at this point in the history
…_effects
  • Loading branch information
Le Vu Linh Chi authored and Le Vu Linh Chi committed Jan 2, 2025
1 parent b6d4675 commit 6217e24
Showing 1 changed file with 30 additions and 0 deletions.
30 changes: 30 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -83,3 +83,33 @@ The pre-trained models provided in this library may have their own licenses or t
For instance, SquimSubjective model is released under the Creative Commons Attribution Non Commercial 4.0 International (CC-BY-NC 4.0) license. See [the link](https://zenodo.org/record/4660670#.ZBtWPOxuerN) for additional details.

Other pre-trained models that have different license are noted in documentation. Please checkout the [documentation page](https://pytorch.org/audio/main/).

Audio augmentation using sox_effect
-------------------------
Step 1: Install torchaudio
pip install torchaudio

Step 2: Import Necessary Libraries
import torchaudio
import torchaudio.sox_effects as sox_effects

Step 3: Load an Audio File
waveform, sample_rate = torchaudio.load("path_to_audio_file")

Step 4: Define the SoX Effects and apply the effects to input audio. In this example, I apply time_stretch, loudness, and high-pass filter adjustment.

effects = [
["tempo", "1.25"], # Increase the playback speed (tempo) by 25%
["gain", "10"], # Amplify the audio by 10 dB
["highpass", "1000"], # Apply a high-pass filter with a cutoff frequency of 1000 Hz
]

augmented_waveform, augmented_sample_rate = sox_effects.apply_effects_tensor(
waveform,
sample_rate,
effects
)

Step 5: Save the Augmented Audio

torchaudio.save("augmented_audio.wav", augmented_waveform, augmented_sample_rate)

0 comments on commit 6217e24

Please sign in to comment.