Stars
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Fully open reproduction of DeepSeek-R1
A course on aligning smol models.
Curated list of datasets and tools for post-training.
👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.
Examples and guides for using the Gemini API
The fast, Pythonic way to build Model Context Protocol servers 🚀
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A full-featured, hackable Next.js AI chatbot built by Vercel
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Composable building blocks to build Llama Apps
A playbook for systematically maximizing the performance of deep learning models.
Efficient Triton Kernels for LLM Training
Install PyTorch distributions with computation backend auto-detection
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
LostRuins / koboldcpp
Forked from ggml-org/llama.cppRun GGUF models easily with a KoboldAI UI. One File. Zero Install.
A complete guide to start and improve your LLM skills in 2025 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
Vertex AI (GCP) Claude Proxy via Cloudflare workers
Port of OpenAI's Whisper model in C/C++
CloudflareSpeedTest 推送「每5分钟自选优选 IP」https://ip.164746.xyz