-
Tsinghua University (Graduated)
- California, USA
-
04:57
- 7h behind - https://yushengsu-thu.github.io/
- @thu_yushengsu
Highlights
- Pro
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
-
-
yushengsu-thu.github.io Public
Forked from academicpages/academicpages.github.ioGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
-
dolma Public
Forked from allenai/dolmaData and tools for generating and inspecting OLMo pre-training data.
Python Apache License 2.0 UpdatedSep 20, 2024 -
llama-recipes Public
Forked from meta-llama/llama-cookbookScripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…
Jupyter Notebook UpdatedSep 18, 2024 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
-
llm.c Public
Forked from karpathy/llm.cLLM training in simple, raw C/CUDA
-
Megatron-LLM Public
Forked from epfLLM/Megatron-LLMdistributed trainer for LLMs
-
-
PET_Scaling Public
Exploring the Impact of Model Scaling on Parameter-efficient Tuning Methods
-
-
-
d2l-en Public
Forked from d2l-ai/d2l-enInteractive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Python Other UpdatedMar 17, 2024 -
-
Embodied-Agents Public
This is a curated list of "Embodied Agents" research. Read this repository for the latest updates. Feel free to raise pull requests and launch the disscussion!
4 UpdatedNov 8, 2023 -
Scaling-Science Public
Science driven scaling: to pursue scientific principles bebind scaling and use them to guide next-generation model development, where the subareas include data engineering, long context, efficiency…
3 UpdatedNov 2, 2023 -
-
LLM-Agent-Survey Public
Forked from Paitesanshi/LLM-Agent-SurveyAdd AgentVerse paper link
UpdatedSep 14, 2023 -
Voyager_fix_readme Public
Forked from MineDojo/VoyagerAn Open-Ended Embodied Agent with Large Language Models
JavaScript MIT License UpdatedAug 3, 2023 -
BMTrain Public
Forked from OpenBMB/BMTrainEfficient Training (including pre-training and fine-tuning) for Big Models
Python Apache License 2.0 UpdatedFeb 9, 2023 -
-
LunarVim Public
Forked from LunarVim/LunarVimAn IDE layer for Neovim with sane defaults. Completely free and community driven.
Lua GNU General Public License v3.0 UpdatedSep 18, 2022 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedSep 7, 2022 -
nvimdots Public
Forked from ayamir/nvimdotsA well configured and structured Neovim.
Lua MIT License UpdatedSep 5, 2022 -
-
ModelCenter Public
Forked from OpenBMB/ModelCenterEfficient, Low-Resource, Distributed transformer implementation based on BMTrain
Python Apache License 2.0 UpdatedMay 22, 2022 -
DiffCSE Public
Forked from voidism/DiffCSECode for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
Python MIT License UpdatedMay 4, 2022 -
PromptPapers Public
Forked from thunlp/PromptPapersMust-read papers on prompt-based tuning for pre-trained language models.
UpdatedApr 26, 2022 -
datasciencecoursera Public
Forked from geniayuan/datasciencecourserafor Data Science class on Coursera
UpdatedNov 26, 2019