ffge-avx512

SIMD implementation of fraction-free Gaussian elimination (FFGE) over the prime field Z_p for the Mersenne prime number p = 2^31 - 1.

Overview

This library implements the algorithm of fraction-free Gaussian elimination (FFGE). You can use the library's routines e.g., to bring a square matrix with integer coefficients to its row echelon form. This is an efficient way to compute the rank of a matrix.

As an improvement to standard FFGE, the functions provided here can perform Gaussian elimination on matrices with elements from the prime field Z_p for p = 2^31 - 1. The set Z_p contains almost all 32-bit-wide signed integers, with the exception of p, -p and -p-1. Because the computation over Z_p avoids integer division altogether, it is suitable for vectorization and can be considerably faster that the standard FFGE. For a matrix whose rank is expected to be less than p, the task to decide whether the matrix is invertible can be executed on a CPU supporting the AVX-512 instruction set extension up to 8 times faster than a scalar implementation of the same algorithm.

That sort of problem was the motivation to develop the library and to implement the SIMD version of its core routines in x86-64 assembly.

Using the library

The library's main functionality is defined in the header file ffge.h. The file also contains the documentation of the provided interface.

Compiling the source code

To obtain the source code, clone the public git repository:

git clone https://github.com/marek-miller/ffge-avx512
cd ffge-avx512

The toolchain necessary to compile the sources consists of:

NASM assembler,
C compiler supporting C23 standard, e.g. GCC version 14 and above.

If you use gcc version 13, you should be able to compile the sources with no issues if you modify the -std flag in Makefile

CFLAGS	+= -std=c2x	# etc., instead of -std=c23

To compile the library, run:

make all

To clean the compilation artifacts:

make clean

or to clean everything, including the compiled applications and libraries:

make distclean

Testing

Assuming your CPU supports AVX-512, run the library's test suite by typing:

make check

Installation

No installation mechanism has been provided yet. Simply copy the static library file libffge.a or the dynamic library libffge.so, and the header ffge.h to a location appropriate for your system.

API documentation

The interface is documented in the header file ffge.h.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
COPYING		COPYING
Makefile		Makefile
README.md		README.md
bench.c		bench.c
bench.h		bench.h
benchmark.c		benchmark.c
ffge.c		ffge.c
ffge.h		ffge.h
ffge_prim_i8.s		ffge_prim_i8.s
ffge_prim_i8_helpers.c		ffge_prim_i8_helpers.c
t-ffge.c		t-ffge.c
t-ffge_prim.c		t-ffge_prim.c
t-ffge_prim_i8.c		t-ffge_prim_i8.c
test.h		test.h
utils.c		utils.c
utils.h		utils.h
xoshiro256ss.c		xoshiro256ss.c
xoshiro256ss.h		xoshiro256ss.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ffge-avx512

Overview

Using the library

Compiling the source code

Testing

Installation

API documentation

About

Releases

Packages

Contributors 2

Languages

License

marek-miller/ffge-avx512

Folders and files

Latest commit

History

Repository files navigation

ffge-avx512

Overview

Using the library

Compiling the source code

Testing

Installation

API documentation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages