GitHub - avinashcpandey/Bolt: Bolt is a C++ template library optimized for GPUs. Bolt provides high-performance library implementations for common algorithms such as scan, reduce, transform, and sort.

Bolt is a C++ template library optimized for heterogeneous computing. Bolt is designed to provide high-performance library implementations for common algorithms such as scan, reduce, transform, and sort. The Bolt interface was modeled on the C++ Standard Template Library (STL). Developers familiar with the STL will recognize many of the Bolt APIs and customization techniques.

The primary goal of Bolt is to make it easier for developers to utilize the inherent performance and power efficiency benefits of heterogeneous computing. It has interfaces that are easy to use, and has comprehensive documentation for the library routines, memory management, control interfaces, and host/device code sharing.

Compared to writing the equivalent functionality in OpenCL™, you’ll find that Bolt requires significantly fewer lines-of-code and less developer effort. Bolt is designed to provide a standard way to develop an application that can execute on either a regular CPU, or use any available OpenCL™ capable accelerated compute unit, with a single code path.

Here's a link to our BOLT wiki page.

Prerequisites

Windows

Visual Studio 2010 onwards (VS2012 onwards for C++ AMP)
Tested with 32/64 bit Windows® 7/8 and Windows® Blue
CMake 2.8.12
TBB (For Multicore CPU path only) (4.1 Update 1 or Above) . See Building Bolt with TBB.
APP SDK 2.8 or onwards.

Note: If the user has installed both Visual Studio 2012 and Visual Studio 2010, the latter should be updated to SP1.

Linux

GCC 4.6.3 and above
Tested with OpenSuse 13.1 64-bit, RHEL 6.4 32/64 bit, Ubuntu 13.10 64-bit
CMake 2.8.12
TBB (For Multicore CPU path only) (4.1 Update 1 or Above) . See Building Bolt with TBB.
APP SDK 2.8 onwards.

Note: Bolt pre-built binaries for Linux are build with GCC 4.7.3, same version should be used for Application building else user has to build Bolt from source with GCC 4.6.3 or higher.

Catalyst™ package

The latest Catalyst driver contains the most recent OpenCL runtime. Recommended Catalyst package is latest 13.11 Beta Driver.

13.4 and higher is supported.

Note: 13.9 in not supported.

Supported Devices

AMD APU Family with AMD Radeon™ HD Graphics

A-Series
C-Series
E-Series
E2-Series
G-Series
R-Series

AMD Radeon™ HD Graphics

7900 Series (7990, 7970, 7950)
7800 Series (7870, 7850)
7700 Series (7770, 7750)
6900 Series (6990, 6970, 6950)
6800 Series (6870, 6850)
6700 Series (6790 , 6770, 6750)
6600 Series (6670)
6500 Series (6570)
6400 Series (6450)
6xxxM Series

AMD Radeon™ Rx 2xx Graphics

R9 2xx Series
R8 2xx Series
R7 2xx Series

Compiled binary windows packages (zip packages) for Bolt may be downloaded from the Bolt landing page hosted on AMD's Developer Central website.

Examples

The simple example below shows how to use Bolt to sort a random array of 8192 integers.

#include <bolt/cl/sort.h>
#include <vector>
#include <algorithm>

int main ()
{
    // generate random data (on host)
    size_t length = 8192
    std::vector<int> a (length);
    std::generate ( a.begin (), a.end(), rand );

    // sort, run on best device in the platform
    bolt::cl::sort(a.begin(), a.end());
    return 0;
}

The code will be familiar to programmers who have used the C++ Standard Template Library; the difference is the include file (bolt/cl/sort.h) and the bolt::cl namespace before the sort call. Bolt developers do not need to learn a new device-specific programming model to leverage the power and performance advantages of heterogeneous computing.

#include <bolt/cl/device_vector.h>
#include <bolt/cl/scan.h>
#include <vector>
#include <numeric>

int main()
{
  size_t length = 1024;
  // Create device_vector and initialize it to 1
  bolt::cl::device_vector< int > boltInput( length, 1 );

  // Calculate the inclusive_scan of the device_vector
  bolt::cl::inclusive_scan(boltInput.begin(),boltInput.end(),boltInput.begin( ) );

  // Create an std vector and initialize it to 1
  std::vector<int> stdInput( length, 1 );
 
  // Calculate the inclusive_scan of the std vector
  bolt::cl::inclusive_scan(stdInput.begin( ),stdInput.end( ),stdInput.begin( ) );
  return 0;
}

This example shows how Bolt simplifies management of heterogeneous memory. The creation and destruction of device resident memory is abstracted inside of the bolt::cl::device_vector <> class, which provides an interface familiar to nearly all C++ programmers. All of Bolt’s provided algorithms can take either the normal std::vector or the bolt::cl::device_vector<> class, which allows the user to control when and where memory is transferred between host and device to optimize performance.

Copyright and Licensing information

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Name		Name	Last commit message	Last commit date
Latest commit History 703 Commits
automation		automation
bench		bench
benchmarkPackage		benchmarkPackage
bin		bin
bolt		bolt
doxy		doxy
examples		examples
include/bolt		include/bolt
superbuild		superbuild
test		test
tools		tools
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
FindOpenCL.cmake		FindOpenCL.cmake
FindTBB.cmake		FindTBB.cmake
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prerequisites

Windows

Linux

Catalyst™ package

Supported Devices

Examples

Copyright and Licensing information

About

Releases

Packages

Languages

License

avinashcpandey/Bolt

Folders and files

Latest commit

History

Repository files navigation

Prerequisites

Windows

Linux

Catalyst™ package

Supported Devices

Examples

Copyright and Licensing information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages