Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      13331Updated Jan 5, 2025Jan 5, 2025
    • Project provides an API for correcting mistakes in written essays using AI model
      Python
      0100Updated Dec 20, 2024Dec 20, 2024
    • RAG-Eval

      Public
      R
      0000Updated Dec 18, 2024Dec 18, 2024
    • Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      135000Updated Dec 16, 2024Dec 16, 2024
    • An introduction to LLM Sampling
      Jupyter Notebook
      MIT License
      27400Updated Dec 15, 2024Dec 15, 2024
    • Jupyter Notebook
      Apache License 2.0
      0300Updated Dec 12, 2024Dec 12, 2024
    • Testing language adherence of models, i.e. how much they are able to continue the generation in the desired output language.
      R
      Apache License 2.0
      0000Updated Dec 5, 2024Dec 5, 2024
    • logos

      Public
      0000Updated Dec 4, 2024Dec 4, 2024
    • The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.
      Python
      MIT License
      11210Updated Nov 10, 2024Nov 10, 2024
    • Adaptation of nanotron to run on tractoai
      Python
      Apache License 2.0
      135002Updated Oct 28, 2024Oct 28, 2024
    • Set of scripts to generate synthetic correction/rewriting of problematic OCR
      0100Updated Apr 12, 2024Apr 12, 2024
    • Set of scripts to finetune LLMs
      Python
      23600Updated Mar 30, 2024Mar 30, 2024
    • Jupyter Notebook
      MIT License
      56840Updated Mar 4, 2024Mar 4, 2024
    • OCRoscope

      Public
      Small python package to measure OCR quality and other related metrics.
      Python
      MIT License
      22110Updated Feb 19, 2024Feb 19, 2024
    • MIT License
      0000Updated Feb 18, 2024Feb 18, 2024