-
Salesforce AI Research
- Palo Alto
- https://zzxslp.github.io/
Highlights
- Pro
Stars
Tiny PyTorch library for maintaining a moving average of a collection of parameters.
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cโฆ
Make websites accessible for AI agents
๐งโ๐ซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaโฆ
Muon optimizer: +>30% sample efficiency with <3% wallclock overhead
๐ค ๐๐ฒ๐ฎ๐ฟ๐ป for ๐ณ๐ฟ๐ฒ๐ฒ how to ๐ฏ๐๐ถ๐น๐ฑ an end-to-end ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐๐ถ๐ผ๐ป-๐ฟ๐ฒ๐ฎ๐ฑ๐ ๐๐๐ & ๐ฅ๐๐ ๐๐๐๐๐ฒ๐บ using ๐๐๐ ๐ข๐ฝ๐ best practices: ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + 12 ๐ฉ๐ข๐ฏ๐ฅ๐ด-๐ฐ๐ฏ ๐ญ๐ฆ๐ด๐ด๐ฐ๐ฏ๐ด
verl: Volcano Engine Reinforcement Learning for LLMs
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
An official implementation of VideoRoPE: What Makes for Good Video Rotary Position Embedding?
AISystem ไธป่ฆๆฏๆAI็ณป็ป๏ผๅ ๆฌAI่ฏ็ใAI็ผ่ฏๅจใAIๆจ็ๅ่ฎญ็ปๆกๆถ็ญAIๅ จๆ ๅบๅฑๆๆฏ
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Pytorch library for fast transformer implementations
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2โฆ
This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.
Investigating CoT Reasoning in Autoregressive Image Generation
ใไปฃ็ ้ๆณๅฝใLeetCode ๅท้ขๆป็ฅ๏ผ200้็ปๅ ธ้ข็ฎๅท้ข้กบๅบ๏ผๅ ฑ60wๅญ็่ฏฆ็ปๅพ่งฃ๏ผ่ง้ข้พ็นๅๆ๏ผ50ไฝๅผ ๆ็ปดๅฏผๅพ๏ผๆฏๆC++๏ผJava๏ผPython๏ผGo๏ผJavaScript็ญๅค่ฏญ่จ็ๆฌ๏ผไปๆญค็ฎๆณๅญฆไน ไธๅ่ฟท่ซ๏ผ๐ฅ๐ฅ ๆฅ็็๏ผไฝ ไผๅ็ฐ็ธ่งๆจๆ๏ผ๐
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Open-Sora: Democratizing Efficient Video Production for All
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
A curated list of recent diffusion models for video generation, editing, and various other applications.
a family of versatile and state-of-the-art video tokenizers.
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
(WIP) A small but powerful, homemade PyTorch from scratch.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step