Pinned Loading
-
kyutai-labs/moshi
kyutai-labs/moshi PublicMoshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
-
kyutai-labs/hibiki
kyutai-labs/hibiki PublicHibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…
-
demucs
demucs PublicForked from facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
-
facebookresearch/audiocraft
facebookresearch/audiocraft PublicAudiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
-
facebookresearch/dora
facebookresearch/dora PublicDora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of exp…
If the problem persists, check the GitHub status page or contact support.