You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am working with the AudioMAE codebase and have a question about the implementation of the _roll_mag_aug function.
I've noticed that the index for rolling the waveform is determined by np.random.randint(len(waveform)).
However, for a waveform with the shape (1, 16000), this approach seems to always return 0, leading to no actual shift in the time axis of the waveform.
Could you please clarify if this is the intended behavior? If the goal is to roll along the time axis, would it be more appropriate to use np.random.randint(waveform.shape[1]) to ensure variability in the rolling process?
Any insights would be greatly appreciated, as I aim to understand and utilize this function correctly in my project.
Thank you for your time and the amazing work on AudioMAE !
The text was updated successfully, but these errors were encountered:
Hello,
I am working with the AudioMAE codebase and have a question about the implementation of the
_roll_mag_aug
function.I've noticed that the index for rolling the waveform is determined by
np.random.randint(len(waveform))
.However, for a waveform with the shape (1, 16000), this approach seems to always return 0, leading to no actual shift in the time axis of the waveform.
Could you please clarify if this is the intended behavior? If the goal is to roll along the time axis, would it be more appropriate to use
np.random.randint(waveform.shape[1])
to ensure variability in the rolling process?Any insights would be greatly appreciated, as I aim to understand and utilize this function correctly in my project.
Thank you for your time and the amazing work on AudioMAE !
The text was updated successfully, but these errors were encountered: