Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

yushengsu-thu Follow

Overview Repositories 34 Projects 0 Packages 0 Stars 210

More

Overview
Repositories
Projects
Packages
Stars

yushengsu-thu

Follow

Yusheng (Ethan) Su yushengsu-thu

Follow

#ML #NLP #LLM Goal: Building a model toward AGI.

80 followers · 92 following

Tsinghua University (Graduated)
California, USA
04:57 - 7h behind
https://yushengsu-thu.github.io/
@thu_yushengsu

Achievements

Achievements

Highlights

Pro

Organizations

Block or report yushengsu-thu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 34 Projects 0 Packages 0 Stars 210

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python JavaScript Jupyter Notebook Cuda Vim Script Lua C++ HTML

Sort Last updated

Select order

Last updated Name Stars

verl Public
Forked from volcengine/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 1 Apache License 2.0 Updated Mar 5, 2025
yushengsu-thu Public

Updated Jan 4, 2025
yushengsu-thu.github.io Public
Forked from academicpages/academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript 1 MIT License Updated Oct 24, 2024
dolma Public
Forked from allenai/dolma

Data and tools for generating and inspecting OLMo pre-training data.

Python Apache License 2.0 Updated Sep 20, 2024
llama-recipes Public
Forked from meta-llama/llama-cookbook

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook Updated Sep 18, 2024
Liger-Kernel Public
Forked from linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 1 BSD 2-Clause "Simplified" License Updated Sep 2, 2024
llm.c Public
Forked from karpathy/llm.c

LLM training in simple, raw C/CUDA

Cuda 1 MIT License Updated Aug 25, 2024
Megatron-LLM Public
Forked from epfLLM/Megatron-LLM

distributed trainer for LLMs

Python 1 Other Updated Aug 25, 2024
hack-vimrc Public

my vim configure

Vim Script 1 Updated Aug 22, 2024
PET_Scaling Public

Exploring the Impact of Model Scaling on Parameter-efficient Tuning Methods

Python 2 Updated Aug 9, 2024
mup_training Public

Explore the µP

Python 2 Updated Jul 13, 2024
pre-training_cook Public

pre-training_cook

1 1 Updated May 5, 2024
d2l-en Public
Forked from d2l-ai/d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python Other Updated Mar 17, 2024
SuperAlignment_tuning Public

All Tuning works in SuperAlignment

2 Updated Jan 23, 2024
Embodied-Agents Public

This is a curated list of "Embodied Agents" research. Read this repository for the latest updates. Feel free to raise pull requests and launch the disscussion!

4 Updated Nov 8, 2023
Scaling-Science Public

Science driven scaling: to pursue scientific principles bebind scaling and use them to guide next-generation model development, where the subareas include data engineering, long context, efficiency…

3 Updated Nov 2, 2023
LLM-Advancing-from-Reasoning-to-Autonomous-Reasoning Public

LLM Reasoning

1 Updated Oct 31, 2023
LLM-Agent-Survey Public
Forked from Paitesanshi/LLM-Agent-Survey

Add AgentVerse paper link

Updated Sep 14, 2023
Voyager_fix_readme Public
Forked from MineDojo/Voyager

An Open-Ended Embodied Agent with Large Language Models

JavaScript MIT License Updated Aug 3, 2023
BMTrain Public
Forked from OpenBMB/BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Python Apache License 2.0 Updated Feb 9, 2023
FacebookChatBot Public

JavaScript 1 Updated Dec 7, 2022
LunarVim Public
Forked from LunarVim/LunarVim

An IDE layer for Neovim with sane defaults. Completely free and community driven.

Lua GNU General Public License v3.0 Updated Sep 18, 2022
ProKil Public
Forked from ProKil/ProKil

Updated Sep 15, 2022
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python Apache License 2.0 Updated Sep 7, 2022
nvimdots Public
Forked from ayamir/nvimdots

A well configured and structured Neovim.

Lua MIT License Updated Sep 5, 2022
lua-nvim-config Public
Forked from wsmbsbbz/lua-nvim-config

Lua Updated Aug 30, 2022
ModelCenter Public
Forked from OpenBMB/ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python Apache License 2.0 Updated May 22, 2022
DiffCSE Public
Forked from voidism/DiffCSE

Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"

Python MIT License Updated May 4, 2022
PromptPapers Public
Forked from thunlp/PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

Updated Apr 26, 2022
datasciencecoursera Public
Forked from geniayuan/datasciencecoursera

for Data Science class on Coursera

Updated Nov 26, 2019

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.