- Shenzhen, China
- https://www.fullstackmemo.com
-
s1 Public
Forked from simplescaling/s1s1: Simple test-time scaling
Python Apache License 2.0 UpdatedMar 6, 2025 -
evalscope Public
Forked from modelscope/evalscopeA streamlined and customizable framework for efficient large model evaluation and performance benchmarking
Python Apache License 2.0 UpdatedMar 4, 2025 -
TransMLA Public
Forked from fxmeng/TransMLATransMLA: Multi-Head Latent Attention Is All You Need
Python MIT License UpdatedMar 1, 2025 -
LLaDA Public
Forked from ML-GSAI/LLaDAOfficial PyTorch implementation for "Large Language Diffusion Models"
Python MIT License UpdatedFeb 26, 2025 -
LiveCodeBench Public
Forked from LiveCodeBench/LiveCodeBenchOfficial repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
Python MIT License UpdatedFeb 24, 2025 -
grpo-graph-extraction Public
Forked from wey-gu/grpo-graph-extractionQwen GRPO Graph Extraction RL Finetune
Jupyter Notebook Apache License 2.0 UpdatedFeb 23, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryEasy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Python Apache License 2.0 UpdatedFeb 22, 2025 -
vscode Public
Forked from microsoft/vscodeVisual Studio Code
TypeScript MIT License UpdatedFeb 17, 2025 -
argilla Public
Forked from argilla-io/argillaArgilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Python Apache License 2.0 UpdatedFeb 17, 2025 -
Kiln Public
Forked from Kiln-AI/KilnThe easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
Python Other UpdatedFeb 16, 2025 -
clearml Public
Forked from clearml/clearmlClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
Python Apache License 2.0 UpdatedFeb 16, 2025 -
deepscaler Public
Forked from agentica-project/deepscalerDemocratizing Reinforcement Learning for LLMs
Python MIT License UpdatedFeb 16, 2025 -
unsloth Public
Forked from unslothai/unslothFinetune Llama 3.3, DeepSeek-R1, Reasoning, Phi-4 & Gemma 2 LLMs 2x faster with 70% less memory
Python Apache License 2.0 UpdatedFeb 14, 2025 -
unsloth-zoo Public
Forked from unslothai/unsloth-zooUtils for Unsloth
Python GNU Lesser General Public License v3.0 UpdatedFeb 13, 2025 -
moe-pruner Public
Forked from gabrielolympie/moe-prunerA repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size
Python UpdatedFeb 13, 2025 -
open-webui Public
Forked from open-webui/open-webuiUser-friendly AI Interface (Supports Ollama, OpenAI API, ...)
JavaScript BSD 3-Clause "New" or "Revised" License UpdatedFeb 12, 2025 -
chat-ui Public
Forked from huggingface/chat-uiOpen source codebase powering the HuggingChat app
TypeScript Apache License 2.0 UpdatedFeb 12, 2025 -
coai Public
Forked from coaidev/coai🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot …
TypeScript Apache License 2.0 UpdatedFeb 12, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedFeb 10, 2025 -
leaked-system-prompts Public
Forked from jujumilk3/leaked-system-promptsCollection of leaked system prompts
UpdatedFeb 10, 2025 -
-
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reasonThis is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Python MIT License UpdatedFeb 7, 2025 -
open-thoughts Public
Forked from open-thoughts/open-thoughtsOpen Thoughts: Fully Open Data Curation for Thinking Models
Python Apache License 2.0 UpdatedFeb 7, 2025 -
oat-zero Public
Forked from sail-sg/oat-zeroA lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
Python MIT License UpdatedFeb 6, 2025 -
-
ChatPPT-o1-mini Public
Forked from YOOTeam/ChatPPT-o1-mini一个LLM-Agent与PPT项目,支持基于对话式需求进行操作PPT的原生AI应用项目。模型为DeepSeek,基于LLM+VBA调用进行文档操作,轻量级ChatPPT服务。An LLM-Agent and PPT project that supports native AI application projects that operate PPT based on conversati…
VBA GNU General Public License v3.0 UpdatedFeb 5, 2025 -
TinyZero Public
Forked from Jiayi-Pan/TinyZeroClean, minimal, accessible reproduction of DeepSeek R1-Zero
Python Apache License 2.0 UpdatedFeb 1, 2025 -
Thinking-Claude Public
Forked from richards199999/Thinking-ClaudeLet your Claude able to think
TypeScript MIT License UpdatedJan 23, 2025 -
CAG Public
Forked from hhhuang/CAGCache-Augmented Generation: A Simple, Efficient Alternative to RAG
Python MIT License UpdatedJan 15, 2025 -
tiptap Public
Forked from ueberdosis/tiptapThe headless rich text editor framework for web artisans.
TypeScript MIT License UpdatedJan 13, 2025