Skip to content
View zzxslp's full-sized avatar

Highlights

  • Pro

Block or report zzxslp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tiny PyTorch library for maintaining a moving average of a collection of parameters.

Python 422 26 Updated Oct 2, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cโ€ฆ

Jupyter Notebook 7,626 489 Updated Mar 7, 2025

Make websites accessible for AI agents

Python 35,746 3,704 Updated Mar 3, 2025

๐Ÿง‘โ€๐Ÿซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐Ÿ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaโ€ฆ

Python 58,991 5,986 Updated Aug 24, 2024

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 459 24 Updated Mar 1, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,162 774 Updated Mar 1, 2025

๐Ÿค– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป for ๐—ณ๐—ฟ๐—ฒ๐—ฒ how to ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ an end-to-end ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป-๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐˜† ๐—Ÿ๐—Ÿ๐—  & ๐—ฅ๐—”๐—š ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ using ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€ best practices: ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + 12 ๐˜ฉ๐˜ข๐˜ฏ๐˜ฅ๐˜ด-๐˜ฐ๐˜ฏ ๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ฐ๐˜ฏ๐˜ด

Python 3,673 599 Updated Mar 6, 2025

A minimal implementation of vllm.

Cuda 34 Updated Jul 27, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,390 407 Updated Mar 6, 2025

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

Python 1,629 122 Updated Feb 6, 2024

An official implementation of VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Python 92 Updated Mar 2, 2025

AISystem ไธป่ฆๆ˜ฏๆŒ‡AI็ณป็ปŸ๏ผŒๅŒ…ๆ‹ฌAI่Šฏ็‰‡ใ€AI็ผ–่ฏ‘ๅ™จใ€AIๆŽจ็†ๅ’Œ่ฎญ็ปƒๆก†ๆžถ็ญ‰AIๅ…จๆ ˆๅบ•ๅฑ‚ๆŠ€ๆœฏ

Jupyter Notebook 12,758 1,842 Updated Mar 1, 2025

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,411 196 Updated Apr 29, 2021

Pytorch library for fast transformer implementations

Python 1,682 184 Updated Mar 23, 2023

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2โ€ฆ

Python 14,860 1,504 Updated Mar 6, 2025

This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.

1,127 257 Updated Feb 12, 2025

Investigating CoT Reasoning in Autoregressive Image Generation

Python 524 19 Updated Feb 5, 2025

s1: Simple test-time scaling

Python 5,859 669 Updated Mar 6, 2025

ใ€Šไปฃ็ ้šๆƒณๅฝ•ใ€‹LeetCode ๅˆท้ข˜ๆ”ป็•ฅ๏ผš200้“็ปๅ…ธ้ข˜็›ฎๅˆท้ข˜้กบๅบ๏ผŒๅ…ฑ60wๅญ—็š„่ฏฆ็ป†ๅ›พ่งฃ๏ผŒ่ง†้ข‘้šพ็‚นๅ‰–ๆž๏ผŒ50ไฝ™ๅผ ๆ€็ปดๅฏผๅ›พ๏ผŒๆ”ฏๆŒC++๏ผŒJava๏ผŒPython๏ผŒGo๏ผŒJavaScript็ญ‰ๅคš่ฏญ่จ€็‰ˆๆœฌ๏ผŒไปŽๆญค็ฎ—ๆณ•ๅญฆไน ไธๅ†่ฟท่Œซ๏ผ๐Ÿ”ฅ๐Ÿ”ฅ ๆฅ็œ‹็œ‹๏ผŒไฝ ไผšๅ‘็Žฐ็›ธ่งๆจๆ™š๏ผ๐Ÿš€

Shell 54,456 11,860 Updated Mar 5, 2025
Python 90 6 Updated May 29, 2023

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,076 228 Updated Feb 19, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 23,493 2,327 Updated Mar 6, 2025

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,916 1,059 Updated Mar 6, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,895 1,039 Updated Mar 3, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,100 238 Updated Mar 6, 2025

a family of versatile and state-of-the-art video tokenizers.

Python 348 19 Updated Jan 15, 2025

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 706 49 Updated Sep 27, 2024

Module 0 - Fundamentals

Python 100 1,188 Updated Aug 30, 2024

(WIP) A small but powerful, homemade PyTorch from scratch.

C++ 530 24 Updated Mar 2, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 41,508 5,623 Updated Mar 5, 2025
Next
Showing results