Support more precisions #198

KowerKoint · 2025-01-24T03:58:04Z

IEEE754 float16(F16)とbinary16(F16)をサポートします．

テストとPythonバインドは未対応です．

KowerKoint · 2025-01-24T03:58:41Z

.devcontainer/cpu/Dockerfile

+    apt-get install -y gcc-13 g++-13 && \
+    update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-13 100 && \
+    update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-13 100 && \
+    apt-get install -y --no-install-recommends \


<stdfloat>のためにg++13が必要

KowerKoint · 2025-01-24T04:00:08Z

CMakeLists.txt

+if(NOT DEFINED SCALUQ_FLOAT16)
+    set(SCALUQ_FLOAT16 OFF)
+endif(NOT DEFINED SCALUQ_FLOAT16)
+if(NOT DEFINED SCALUQ_FLOAT32)
+    set(SCALUQ_FLOAT32 ON)
+endif(NOT DEFINED SCALUQ_FLOAT32)
+if(NOT DEFINED SCALUQ_FLOAT64)
+    set(SCALUQ_FLOAT64 ON)
+endif(NOT DEFINED SCALUQ_FLOAT64)
+if(NOT DEFINED SCALUQ_BFLOAT16)
+    set(SCALUQ_BFLOAT16 OFF)
+endif(NOT DEFINED SCALUQ_BFLOAT16)


各精度コンパイルに含めるかを選択できるようになっていて，デフォルトではF32,F64がONになっています．

KowerKoint · 2025-01-24T04:01:14Z

CMakeLists.txt

-if ((${CMAKE_CXX_COMPILER_ID} STREQUAL "GNU") OR (${CMAKE_CXX_COMPILER_ID} STREQUAL "Clang") OR (${CMAKE_CXX_COMPILER_ID} STREQUAL "AppleClang"))
-    # Standard


GCCしかサポートしていないので削除

KowerKoint · 2025-01-24T04:01:32Z

CMakeLists.txt

-if ((${CMAKE_CXX_COMPILER_ID} STREQUAL "GNU") OR (${CMAKE_CXX_COMPILER_ID} STREQUAL "Clang") OR (${CMAKE_CXX_COMPILER_ID} STREQUAL "AppleClang"))
-    # Standard
+# Standard
+if(SCALUQ_USE_CUDA)
    target_compile_features(scaluq PUBLIC cxx_std_20)


NVCCはC++23に対応していない

KowerKoint · 2025-01-24T04:03:12Z

include/scaluq/types.hpp

-using StdComplex = std::complex<Fp>;
-template <std::floating_point Fp>
-using Complex = Kokkos::complex<Fp>;
+using StdComplex = std::complex<double>;


ユーザー向けはdouble std::complex<double>だけ使ってもらうことにしたので，Complexはinternalへ

KowerKoint · 2025-01-24T04:03:31Z

include/scaluq/types.hpp

+using ComplexMatrix = Eigen::Matrix<StdComplex, Eigen::Dynamic, Eigen::Dynamic, Eigen::RowMajor>;
+using SparseComplexMatrix = Eigen::SparseMatrix<StdComplex, Eigen::RowMajor>;


こいつらはユーザー向けなのであとでinternalの外へ移動

KowerKoint · 2025-01-24T04:04:28Z

include/scaluq/type/floating_point.hpp

+
+namespace scaluq {
+enum class Precision { F16, F32, F64, BF16 };
+}


`template std::floating_pointをやめてこのenumをテンプレートで指定することになります

KowerKoint · 2025-01-24T04:06:55Z

include/scaluq/type/floating_point.hpp

+template <typename T>
+struct IsFloatingPoint : public std::false_type {};
+#ifdef SCALUQ_FLOAT16
+#ifdef SCALUQ_USE_CUDA


種類 CPU CUDA

F16 std::float16_t __half

F32 std::float32_t short

F16 std::float64_t double

F16 std::bfloat16_t __nv_bfloat16

KowerKoint · 2025-01-24T04:07:55Z

include/scaluq/type/complex.hpp

Kokkos::Complexはstd::floating_pointコンセプトが条件になっていて，残念ながらCUDA独自の__half __nv_bfloat16は置けないので自作し直した:innocent:

KowerKoint · 2025-02-20T01:32:26Z

テストが通ってREADMEも整備できたので、 #200 の分をここで手動でマージします。

KowerKoint and others added 26 commits December 4, 2024 13:07

[WIP]

2e2d677

[WIP]

b721756

modify code for cuda 16bit

6c2de1e

revert devcontainer config

d07d58a

remove FLOAT128 option

9ac7d51

remove FLOAT128 option

8903c64

add Kokkos LICENSE

7a17571

sinh cosh for CPU

863f0ad

bind scaluq::Complex

48ebc68

bind ant floats

396170b

deal with cpu

423fb05

Merge remote-tracking branch 'origin' into support-more-precisions

554bf1e

merge safely & deal with cuda (except Eigen usage)

7da4463

revert devcontainer config

ec8a171

[WIP] Precision mode

fc3c30f

fix to compile StateVector

b4d4cd7

fix to compile StateVectorBatched

8acc814

[WIP] fix to support CUDA

27af605

fix to compile operator

27edc36

revert CMake change

32d9ecf

fix to compile update_ops

3326bce

fix to compile gate

651958a

fix to compile gate/paramgate

02ca9bf

fix to compile merge_gate

9f43e81

fix to compile circuit

a2f7a01

fix to compile both on cpu/cuda

76dac86

KowerKoint commented Jan 24, 2025

View reviewed changes

KowerKoint added 3 commits January 29, 2025 12:24

reform state_vector_test

29e4e23

reform state_vector_batched_test

1ab2d23

reform operator_test

5688d9b

KowerKoint added 17 commits January 31, 2025 10:22

check F16,BF16 for default developing

abaf8ff

[WIP] gate test is bad

83a5fc9

fix DenseMatrix gate test

99437dc

all test has passed on CPU!

cbcddd2

support python install

f1a2e08

fix installation cmake

13d0e6c

install libomp

5b96079

set OpenMP_ROOT

8a12c81

set LDFLAGS and CPPFLAGS

5e39cd9

RESTORE CMAKE_C_COMPILER option

781a359

Python 3.12

d13e746

update exe/main

f73d77e

fix example_project

681414a

use ubuntu2404

442663a

avoid cudamemoryerror

08cb285

fix: add KOKKOS_INLINE_FUNCTION

37f0a9a

update README

aa1f438

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support more precisions #198

Support more precisions #198

KowerKoint commented Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint Jan 24, 2025

KowerKoint commented Feb 20, 2025

		if ((${CMAKE_CXX_COMPILER_ID} STREQUAL "GNU") OR (${CMAKE_CXX_COMPILER_ID} STREQUAL "Clang") OR (${CMAKE_CXX_COMPILER_ID} STREQUAL "AppleClang"))
		# Standard

		using ComplexMatrix = Eigen::Matrix<StdComplex, Eigen::Dynamic, Eigen::Dynamic, Eigen::RowMajor>;
		using SparseComplexMatrix = Eigen::SparseMatrix<StdComplex, Eigen::RowMajor>;

種類	CPU	CUDA
F16	`std::float16_t`	`__half`
F32	`std::float32_t`	`short`
F16	`std::float64_t`	`double`
F16	`std::bfloat16_t`	`__nv_bfloat16`

Support more precisions #198

Are you sure you want to change the base?

Support more precisions #198

Conversation

KowerKoint commented Jan 24, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KowerKoint commented Feb 20, 2025