zzxslp

An Yan zzxslp

Researcher. Vision-Language, Multimodal LLMs

38 followers · 12 following

Salesforce AI Research
Palo Alto
https://zzxslp.github.io/

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

fadel / pytorch_ema

Tiny PyTorch library for maintaining a moving average of a collection of parameters.

Python 422 26 Updated Oct 2, 2024

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,626 489 Updated Mar 7, 2025

browser-use / browser-use

Make websites accessible for AI agents

Python 35,746 3,704 Updated Mar 3, 2025

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,991 5,986 Updated Aug 24, 2024

KellerJordan / Muon

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 459 24 Updated Mar 1, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,162 774 Updated Mar 1, 2025

decodingml / llm-twin-course

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

Python 3,673 599 Updated Mar 6, 2025

MDK8888 / vllmini

A minimal implementation of vllm.

Cuda 34 Updated Jul 27, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,390 407 Updated Mar 6, 2025

LuChengTHU / dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

Python 1,629 122 Updated Feb 6, 2024

Wiselnn570 / VideoRoPE

An official implementation of VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Python 92 Updated Mar 2, 2025

chenzomi12 / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 12,758 1,842 Updated Mar 1, 2025

godweiyang / NN-CUDA-Example

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,411 196 Updated Apr 29, 2021

idiap / fast-transformers

Pytorch library for fast transformer implementations

Python 1,682 184 Updated Mar 23, 2023

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,860 1,504 Updated Mar 6, 2025

llmgenai / LLMInterviewQuestions

This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.

1,127 257 Updated Feb 12, 2025