Skip to content
@bigcode-project

BigCode Project

BigCode Project is an open scientific collaboration run by Hugging Face and ServiceNow Research, focused on open and responsible development of LLMs for code.

Pinned Loading

  1. starcoder2 starcoder2 Public

    Home of StarCoder2!

    Python 1.9k 171

  2. starcoder starcoder Public

    Home of StarCoder: fine-tuning & inference!

    Python 7.4k 525

  3. the-stack-v2 the-stack-v2 Public

    Code for the curation of The Stack v2 and StarCoder2 training data

    Jupyter Notebook 105 8

  4. bigcode-evaluation-harness bigcode-evaluation-harness Public

    A framework for the evaluation of autoregressive code generation language models.

    Python 938 242

  5. starcoder.cpp starcoder.cpp Public

    C++ implementation for šŸ’«StarCoder

    C 454 37

  6. bigcodebench bigcodebench Public

    [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

    Python 356 43

Repositories

Showing 10 of 28 repositories
  • bigcode-website Public

    Source of the website of the BigCode project.

    bigcode-project/bigcode-website’s past year of commit activity
    HTML 21 MIT 4 0 5 Updated May 7, 2025
  • bigcodebench Public

    [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

    bigcode-project/bigcodebench’s past year of commit activity
    Python 356 Apache-2.0 43 8 0 Updated Apr 11, 2025
  • selfcodealign Public

    [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation

    bigcode-project/selfcodealign’s past year of commit activity
    Python 307 Apache-2.0 21 3 0 Updated Feb 24, 2025
  • octopack Public

    šŸ™ OctoPack: Instruction Tuning Code Large Language Models

    bigcode-project/octopack’s past year of commit activity
    Jupyter Notebook 463 MIT 27 12 0 Updated Feb 6, 2025
  • bigcode-evaluation-harness Public

    A framework for the evaluation of autoregressive code generation language models.

    bigcode-project/bigcode-evaluation-harness’s past year of commit activity
    Python 938 Apache-2.0 242 59 (2 issues need help) 35 Updated Oct 31, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    bigcode-project/Megatron-LM’s past year of commit activity
    Python 387 2,823 21 (1 issue needs help) 9 Updated Aug 20, 2024
  • bigcode-project/bigcode-dataset’s past year of commit activity
    Jupyter Notebook 449 Apache-2.0 76 9 7 Updated Aug 15, 2024
  • bigcode-project/bigcode-inference-benchmark’s past year of commit activity
    Python 19 Apache-2.0 4 3 2 Updated Aug 10, 2024
  • bigcodebench-annotation Public

    BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

    bigcode-project/bigcodebench-annotation’s past year of commit activity
    Jupyter Notebook 21 Apache-2.0 11 0 0 Updated Aug 8, 2024
  • the-stack-v2 Public

    Code for the curation of The Stack v2 and StarCoder2 training data

    bigcode-project/the-stack-v2’s past year of commit activity
    Jupyter Notebook 105 Apache-2.0 8 6 0 Updated Apr 11, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…