Skip to content
View nadavrot's full-sized avatar

Block or report nadavrot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Efficient matrix multiplication Efficient matrix multiplication
    1
    # High-Performance Matrix Multiplication
    2
    
                  
    3
    This is a short post that explains how to write a high-performance matrix
    4
    multiplication program on modern processors. In this tutorial I will use a
    5
    single core of the Skylake-client CPU with AVX2, but the principles in this post
  2. bistra bistra Public

    Bistra is a domain-specific language designed to generate high-performance kernels (such as GEMMs, convolutions, etc). The program is designed to allow powerful compiler optimizations and code gene…

    C++ 6

  3. arpfloat arpfloat Public

    An arbitrary-precision floating-point library in Rust

    Rust 43 6

  4. fast_log fast_log Public

    A fast implementation of log() and exp()

    C 49 3

  5. memset_benchmark memset_benchmark Public

    This repository contains high-performance implementations of memset and memcpy in assembly.

    Assembly 307 13

  6. compressor compressor Public

    An educational implementation of a modern compressor in Rust

    Rust 46 2