shigabeev

🥐

Ilya Shigabeev shigabeev

🥐

48 followers · 41 following

langswap.app
https://langswap.app

Achievements

Highlights

Stars

ai-forever / Real-ESRGAN

PyTorch implementation of Real-ESRGAN model

Python 525 150 Updated Apr 15, 2024

pytorch / torchcodec

PyTorch video decoding

Python 176 14 Updated Jan 1, 2025

colstone / SOFA_AI

SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference

Python 20 3 Updated May 28, 2024

Tera2Space / AudioAE

Simple audio AE

Python 11 Updated Nov 10, 2024

Audio-WestlakeU / audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Python 110 10 Updated Aug 27, 2024

oss-roettger / XL-Textual-Inversion

Textual Inversion for Stable Diffusion XL 1.0

Jupyter Notebook 74 6 Updated Jan 6, 2024

advimman / lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 8,273 877 Updated Jul 26, 2024

MilanaShhanukova / speech_parameters

Parameters to analyse audio files

Python 1 1 Updated Dec 27, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,696 185 Updated Nov 14, 2024

DigitalPhonetics / IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,504 172 Updated Nov 7, 2024

ternaus / retinaface

The remake of the https://github.com/biubug6/Pytorch_Retinaface

Python 394 107 Updated Jan 27, 2023

karpathy / LLM101n

LLM101n: Let's build a Storyteller

30,796 1,682 Updated Aug 1, 2024

SpeechColab / GigaSpeech2

An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement

Python 123 6 Updated Dec 29, 2024

yandexdataschool / speech_course

YSDA course in Speech Processing.

Jupyter Notebook 208 66 Updated Jul 1, 2024

line / LibriTTS-P

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

116 2 Updated Jun 13, 2024

AI4Bharat / IndicVoices-R

A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS

Python 32 1 Updated Dec 11, 2024

ai-forever / DataProcessingFramework

Framework for processing and filtering datasets

Python 26 2 Updated Aug 1, 2024

radoondas / flask-elastic-image-search

Python 60 16 Updated Jun 12, 2024

bmaltais / kohya_ss

Python 9,877 1,271 Updated Jan 1, 2025

shang0712 / HierTTS

Python 44 10 Updated Apr 16, 2023

asmindev / image-scrapper

Image Scrapper (Unsplash and Pinterest)

Python 1 Updated Jul 23, 2023

ToTheBeginning / PuLID

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 2,898 203 Updated Nov 27, 2024

fastai / lm-hackers

Hackers' Guide to Language Models

Jupyter Notebook 1,798 297 Updated Dec 13, 2024

gluonfield / enchanted

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

Swift 4,131 258 Updated Nov 7, 2024

justinwlin / runpodWhisperx

Runpod WhisperX Docker Container Repo

Python 12 7 Updated Mar 10, 2024

IDRnD / VoxTube

The VoxTube dataset official repository

HTML 62 1 Updated Feb 14, 2024

lifeiteng / naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 181 12 Updated Apr 20, 2024

thepowerfuldeez / tacotron2_improved

This is my reimplementation of Tacotron2 based on nvidia implementation

Python 3 Updated Mar 28, 2024

xai-org / grok-1

Grok open release

Python 49,750 8,345 Updated Aug 30, 2024

k2-fsa / libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Python 182 11 Updated Sep 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ilya Shigabeev shigabeev

Achievements

Achievements

Highlights

Block or report shigabeev

Stars

ai-forever / Real-ESRGAN

pytorch / torchcodec

colstone / SOFA_AI

Tera2Space / AudioAE

Audio-WestlakeU / audiossl

oss-roettger / XL-Textual-Inversion

advimman / lama

MilanaShhanukova / speech_parameters

ictnlp / LLaMA-Omni

DigitalPhonetics / IMS-Toucan

ternaus / retinaface

karpathy / LLM101n

SpeechColab / GigaSpeech2

yandexdataschool / speech_course

line / LibriTTS-P

AI4Bharat / IndicVoices-R

ai-forever / DataProcessingFramework

radoondas / flask-elastic-image-search

bmaltais / kohya_ss

shang0712 / HierTTS

asmindev / image-scrapper

ToTheBeginning / PuLID

fastai / lm-hackers

gluonfield / enchanted

justinwlin / runpodWhisperx

IDRnD / VoxTube

lifeiteng / naturalspeech3_facodec

thepowerfuldeez / tacotron2_improved

xai-org / grok-1

k2-fsa / libriheavy