Skip to content
View HugoZHL's full-sized avatar

Block or report HugoZHL

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉

3,594 252 Updated Mar 4, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,524 243 Updated Mar 5, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,186 779 Updated Mar 1, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,818 469 Updated Mar 5, 2025

An AI Hedge Fund Team

Python 13,385 2,404 Updated Mar 7, 2025

Numbers every LLM developer should know

4,188 140 Updated Jan 16, 2024

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,479 106 Updated Aug 20, 2024

An Awesome Collection for LLM Survey

329 31 Updated Sep 12, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

1,112 95 Updated Feb 27, 2025

Refine high-quality datasets and visual AI models

Python 9,261 606 Updated Mar 7, 2025

健康学习到150岁 - 人体系统调优不完全指南

13,527 990 Updated May 9, 2024

embedx 是基于 c++ 开发的、完全自研的分布式 embedding 训练和推理框架。它目前支持 图模型、深度排序、召回模型和图与排序、图与召回的联合训练模型等

C++ 303 47 Updated May 27, 2024

A scalable graph learning toolkit for extremely large graph datasets. (WWW'22, 🏆 Best Student Paper Award)

Python 148 22 Updated May 10, 2024

程序员延寿指南 | A programmer's guide to live longer

31,031 2,160 Updated Jan 30, 2024

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

C++ 1,079 359 Updated Jan 21, 2025

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 69,425 8,845 Updated Feb 22, 2025

a distributed deep learning platform

C++ 3,380 1,248 Updated Mar 7, 2025

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Python 289 32 Updated Feb 22, 2025

Generalized and Efficient Blackbox Optimization System

Python 398 54 Updated Oct 17, 2024

DMALab's reading group slides and papers.

17 3 Updated Jun 8, 2021

Heterogeneous Information Network Datasets for Recommendation and Network Embedding

340 92 Updated Dec 24, 2019

Heterogeneous Information Network Embedding

198 43 Updated Aug 28, 2021

PyTorch implementations of Generative Adversarial Networks.

Python 16,812 4,106 Updated Jun 18, 2024

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Python 34,539 10,412 Updated Jan 15, 2025

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

Python 6,237 1,099 Updated Oct 19, 2022

A computer algebra system written in pure Python

Python 13,374 4,599 Updated Mar 7, 2025

Shōgun

C++ 3,039 1,035 Updated Dec 19, 2023

结巴中文分词

Python 33,795 6,730 Updated Aug 21, 2024

probabilistic counting for language modeling.

Python 2 2 Updated Aug 15, 2018

probabilistic counting for language modeling.

Python 1 1 Updated Aug 15, 2018
Next
Showing results