Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.2k 184

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 945 78

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 846 114

  4. PanzaMail PanzaMail Public

    Python 297 19

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 277 23

  6. llmq llmq Public

    Quantized LLM training in pure CUDA/C++.

    C++ 215 14

Repositories

Showing 10 of 70 repositories
  • CAGE Public
    IST-DASLab/CAGE’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Nov 11, 2025
  • qutlass Public

    QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

    IST-DASLab/qutlass’s past year of commit activity
    C++ 129 Apache-2.0 10 1 0 Updated Nov 11, 2025
  • llmq Public

    Quantized LLM training in pure CUDA/C++.

    IST-DASLab/llmq’s past year of commit activity
    C++ 215 Apache-2.0 14 0 1 Updated Nov 11, 2025
  • torchtitan Public Forked from pytorch/torchtitan

    A PyTorch native platform for training generative AI models

    IST-DASLab/torchtitan’s past year of commit activity
    Python 0 BSD-3-Clause 614 0 0 Updated Nov 11, 2025
  • MoE-Quant Public

    Code for data-aware compression of DeepSeek models

    IST-DASLab/MoE-Quant’s past year of commit activity
    Python 63 10 2 1 Updated Nov 8, 2025
  • FP-Quant Public
    IST-DASLab/FP-Quant’s past year of commit activity
    Python 64 10 5 2 Updated Nov 5, 2025
  • Quartet Public
    IST-DASLab/Quartet’s past year of commit activity
    Jupyter Notebook 106 MIT 10 2 0 Updated Nov 2, 2025
  • nanochat-qat Public Forked from karpathy/nanochat

    The best ChatGPT that $100 can buy.

    IST-DASLab/nanochat-qat’s past year of commit activity
    Python 0 MIT 4,372 0 1 Updated Oct 31, 2025
  • CAGE-ao Public Forked from pytorch/ao

    PyTorch native quantization and sparsity for training and inference

    IST-DASLab/CAGE-ao’s past year of commit activity
    Python 0 363 0 0 Updated Oct 23, 2025
  • unified-sc-laws Public

    This repository contains the code for the "Unified Scaling Laws for Compressed Representations" study.

    IST-DASLab/unified-sc-laws’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Oct 23, 2025

Top languages

Loading…

Most used topics

Loading…