Skip to content
View JenWei0312's full-sized avatar
:octocat:
Working from home
:octocat:
Working from home

Block or report JenWei0312

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. All_things_attention All_things_attention Public

    Comparison of different kinds of attentions

    Jupyter Notebook 1

  2. deepseek-moe deepseek-moe Public

    Python

  3. OLMo OLMo Public

    Forked from allenai/OLMo

    Modeling, training, eval, and inference code for OLMo

    Python

  4. huggingface/trl huggingface/trl Public

    Train transformer language models with reinforcement learning.

    Python 16.4k 2.3k

  5. allenai/OLMo allenai/OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.2k 677

  6. deepseek-mla deepseek-mla Public

    Implementation of DeepSeek's Multihead Latent Attention architecture

    Python