Add vectorization to the par_vec (aka par_unseq) implementations of the parallel algorithms #2271

brycelelbach · 2016-07-29T19:50:29Z

The par_vec (aka par_unseq) policy allows interleaving of element access functions, e.g. it is safe to the iterations of the algorithm.

Explicit engagement of compiler vectorizers through pragmas is probably the best way to ensure this occurs (e.g. #pragma simd, #pragma omp simd).

I will probably take a look into doing this myself while preparing my CppCon talk on parallel algorithms.

The text was updated successfully, but these errors were encountered:

diehlpk · 2017-01-24T09:50:27Z

@brycelelbach @hkaiser Could you please add a project description here https://github.com/STEllAR-GROUP/hpx/wiki/GSoC-2017-Project-Ideas

Johan511 · 2023-03-01T06:30:23Z

I am interested in working on this project. I have seen that in the previous PRs we have added openMP pragmas for vectorization and parallelisation of a loop. Can someone guide me on how I can start out with working on this issue?

hkaiser · 2023-03-01T12:39:27Z

I am interested in working on this project. I have seen that in the previous PRs we have added openMP pragmas for vectorization and parallelisation of a loop. Can someone guide me on how I can start out with working on this issue?

Yes, we have implemented this for the first batch of algorithms. There are still algorithms left that have not been touched, though. Also, we would need a thorough performance analysis of the existing implementation, combined with improvements, if needed.

Johan511 · 2023-03-12T07:06:19Z

trkk28097402 · 2024-03-06T16:58:53Z

Hello @hkaiser , I am interest in this topic on gsoc24 ,I have a qeustion.
Is this restricted to only use the #pragma omp simd to vectorize or using something like __m128d, __m256d, some SIMD instructions are unreadable.

hkaiser · 2024-03-09T00:25:07Z

Hello @hkaiser , I am interest in this topic on gsoc24 ,I have a qeustion. Is this restricted to only use the #pragma omp simd to vectorize or using something like __m128d, __m256d, some SIMD instructions are unreadable.

Everything is possible, I guess - as long as it is portable across architectures (beyond x86), at least in the long run.

hkaiser added type: compatibility issue category: algorithms labels Jul 29, 2016

hkaiser added this to the 1.0.0 milestone Jul 29, 2016

hkaiser added the project: GSoC label Jan 17, 2017

hkaiser modified the milestones: 1.0.0, 1.1.0 Apr 23, 2017

msimberg removed this from the 1.1.0 milestone Nov 21, 2017

hkaiser mentioned this issue Dec 8, 2017

First steps towards implementing execution par_unseq #3063

Closed

hkaiser added the tag: pinned Never close as stale label Jun 30, 2019

hkaiser mentioned this issue Feb 9, 2021

Separate the datapar algorithms #5157

Closed

akcube mentioned this issue Sep 18, 2022

Config and structural updates to support unseq implementation #6016

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vectorization to the par_vec (aka par_unseq) implementations of the parallel algorithms #2271

Add vectorization to the par_vec (aka par_unseq) implementations of the parallel algorithms #2271

brycelelbach commented Jul 29, 2016 •

edited by hkaiser

Loading

diehlpk commented Jan 24, 2017

Johan511 commented Mar 1, 2023

hkaiser commented Mar 1, 2023

Johan511 commented Mar 12, 2023 •

edited

Loading

trkk28097402 commented Mar 6, 2024

hkaiser commented Mar 9, 2024

Add vectorization to the par_vec (aka par_unseq) implementations of the parallel algorithms #2271

Add vectorization to the par_vec (aka par_unseq) implementations of the parallel algorithms #2271

Comments

brycelelbach commented Jul 29, 2016 • edited by hkaiser Loading

diehlpk commented Jan 24, 2017

Johan511 commented Mar 1, 2023

hkaiser commented Mar 1, 2023

Johan511 commented Mar 12, 2023 • edited Loading

trkk28097402 commented Mar 6, 2024

hkaiser commented Mar 9, 2024

brycelelbach commented Jul 29, 2016 •

edited by hkaiser

Loading

Johan511 commented Mar 12, 2023 •

edited

Loading