Stars
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
[TMLR 2024] Efficient Large Language Models: A Survey
Refine high-quality datasets and visual AI models
embedx 是基于 c++ 开发的、完全自研的分布式 embedding 训练和推理框架。它目前支持 图模型、深度排序、召回模型和图与排序、图与召回的联合训练模型等
A scalable graph learning toolkit for extremely large graph datasets. (WWW'22, 🏆 Best Student Paper Award)
程序员延寿指南 | A programmer's guide to live longer
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
PKU-DAIR / Hetu
Forked from Hsword/HetuA high-performance distributed deep learning system targeting large-scale and automated distributed training.
PKU-DAIR / open-box
Forked from thomas-young-2013/open-boxGeneralized and Efficient Blackbox Optimization System
Heterogeneous Information Network Datasets for Recommendation and Network Embedding
PyTorch implementations of Generative Adversarial Networks.
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.
A computer algebra system written in pure Python
HugoZHL / ProbLM
Forked from yanlinf/ProbLMprobabilistic counting for language modeling.