Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Sign up

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

karpathy / llm.c Public

Notifications You must be signed in to change notification settings
Fork 3k
Star 26.3k

Code
Issues 79
Pull requests 116
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: karpathy/llm.c

Labels 11 Milestones 0

Labels 11 Milestones 0

New pull request New

116 Open 486 Closed

116 Open 486 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

WIP: Ngc92/llama3 dev

#802 opened Apr 13, 2025 by ngc92

Loading…

Fix gradient tests

#801 opened Apr 13, 2025 by ngc92

Loading…

Correction: memory_ops for encoder_forward

#797 opened Feb 26, 2025 by MasterSkepticista

Loading…

FIX TypeError: normal_() got an unexpected keyword argument 'generator' #723

#791 opened Dec 24, 2024 by earlytobed

Loading…

Update README.md

#789 opened Dec 14, 2024 by joeyabdalla

Loading…

A CS 231n-style port of this project, implementing LLMs solely with NumPy

#784 opened Nov 18, 2024 by davidtag

Loading…

Mapping "py" gpt2 functionalities to match "c"

#783 opened Oct 31, 2024 by omarswelam

Loading…

Verify vocab is padded before reshaping

#782 opened Oct 23, 2024 by austinleedavis

Loading…

FP32 FlashAttention

#781 opened Oct 20, 2024 by ssiu

Loading…

fix: false-positive check for nccl install on ubuntu

#775 opened Oct 2, 2024 by leiDnedyA

Loading…

1

Activation Checkpointing for Llama3 branch

#773 opened Oct 2, 2024 by ademeure

Loading…

Add repkv_backward_kernel2 and repkv_kernel2 (llama3 branch)

#771 opened Sep 28, 2024 by insop

Loading…

2

-pm -> -pi: typo in error_usage

#765 opened Sep 22, 2024 by thundergolfer

Loading…

Micro optimization for softmax_forward_kernel5

#762 opened Sep 20, 2024 by insop

Loading…

6

FP8 with Tensor Reorg

#760 opened Sep 19, 2024 by ademeure • Draft

Update download_starter_pack.sh

#758 opened Sep 18, 2024 by dongrixinyu

Loading…

Add RoPE positional encoding - llama3 feature branch

#756 opened Sep 13, 2024 by gordicaleksa

Loading…

1

Add SwiGLU support - llama3 feature branch

#755 opened Sep 13, 2024 by gordicaleksa

Loading…

3

add llama 3 support to llm.c

#754 opened Sep 13, 2024 by karpathy • Draft

Adamw thread coarsening kernel

#753 opened Sep 3, 2024 by saladpalad

Loading…

Fix sizing typo in train_gpt2_fp32.cu

#748 opened Aug 25, 2024 by gajanan-choudhary

Loading…

2

log with LINE and FILE for better addressing.

#746 opened Aug 22, 2024 by NEWPLAN

Loading…

1

Re: Fixed modal script for updated cudnn version, and read errors

#743 opened Aug 14, 2024 by vyom1611

Loading…

check libnccl instead of nccl to be more reliable

#742 opened Aug 14, 2024 by dengl11

Loading…

[WIP] initial curand implementation for model init

#741 opened Aug 13, 2024 by ngc92

Loading…

1

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.