Skip to content

System requirements

Hüseyin Tuğrul BÜYÜKIŞIK edited this page Feb 7, 2021 · 13 revisions
  • At least one graphics card that supports OpenCL 1.2 and has a dedicated VRAM. Any combo including Nvdiai+Amd should work.

  • C++17 compiler option enabled (C++1y dialect for g++ compiler)

  • for the multi-threaded benchmark in main.cpp, OpenMP is needed ("gomp" library for g++ linker)

  • Some RAM that can hold active pages

    • total RAM consumed by active pages = (number of active pages per gpu instance) * (number of gpus) * (4(or custom num with memMult) instances per gpu) * (page size) * sizeof(your_object)
    • if you have 10 gpus, page size=1024, active pages per instance = 100, object size = 100 bytes, then 409MB of RAM will be used
      • so be careful when using memMult parameter (like {n1,n2,n3,..}), {50,50,50} means 150 gpu instances
    • VRAM usage only changes by number of elements of array (100M elements * 100 bytes per object = 10GB)
      • Equally distributed between graphics cards