Skip to content
View DandinPower's full-sized avatar
  • Yang Ming Chiao Tung University
  • Hsinchu
  • 13:35 (UTC +08:00)
  • LinkedIn in/yongchengliaw

Highlights

  • Pro

Block or report DandinPower

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DandinPower/README.md

👋 Hi, I'm Joseph Liaw. I am currently pursuing a PhD's degree in Department of Computer Science at National Yang Ming Chiao Tung University.

Pinned Loading

  1. BitSqueeze BitSqueeze Public

    BitSqueeze is a tiny C library for compressing float32 tensors with GGML-style integer quantization (Q8_0, Q4_0, Q2_K, NF4, NVFP4), compact floating formats (FP4, MXFP4, NF4_DQ, FP8, MXFP8, FP16, B…

    C

  2. numa-gpu-xfer-bench numa-gpu-xfer-bench Public

    This NUMA-aware benchmark provides latency and bandwidth measurements for CPU-GPU data transfers using libnuma for interleaved allocation, cudaMemcpyAsync for async operations, and concurrent multi…

    C++

  3. numa-allocation numa-allocation Public

    This package provides a NUMA-aware CPU tensor allocation function for PyTorch. It allows you to allocate tensors on specific NUMA nodes or interleave them across nodes, with optional page-lock pinn…

    Python

  4. liger-kernel-for-cpu-offloading-study liger-kernel-for-cpu-offloading-study Public

    This repository contains code, experiments, and a report for a study on the Liger Kernel and its application in memory-efficient training of Large Language Models (LLMs). The report analyzes the Li…

    Python 4