My research focuses on machine listening and improvisation.
During my PhD at Berkeley I was advised mainly by Sanjit Seshia and Edmund Campion.
Pinned Loading
-
NVIDIA/tacotron2
NVIDIA/tacotron2 PublicTacotron 2 - PyTorch implementation with faster-than-realtime inference
-
NVIDIA/waveglow
NVIDIA/waveglow PublicA Flow-based Generative Network for Speech Synthesis
-
NVIDIA/mellotron
NVIDIA/mellotron PublicMellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
-
NVIDIA/flowtron
NVIDIA/flowtron PublicFlowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.