Speed up coh_tmm in tmm_core_vec; introduced versions with cpu and gpu parallelization #271

arsonwong · 2024-03-24T07:57:56Z

s and p polarization, 10000 wavelength x angles, 6 layers, calculate coh_tmm:
before speed increase: 4.178s
after speed increase: 2.399s
after speed increase, non-detailed mode: 1.948s
CPU parallelization with 24 cores
CPU parallelization: 0.844s
CPU parallelization, non-detailed mode: 0.719s
GPU parallelization with NVIDIA GeForce RTX 4060
GPU parallelization: 0.296s
GPU parallelization, non-detailed mode: 0.118s

…u parallelization

phoebe-p · 2024-05-07T05:01:54Z

sorry for my lack of input on this. I was wondering, for the parallelisation, what do you think the best way to incorporate this would be? I guess there are now three options: no parallelisation (i.e. just the old implementation), GPU parallelisation, or CPU parallelisation. Then there's the detailed vs. non-detailed mode. The user should be able to choose which one they want to use, but it would be better not to have three different files tmm_core_vec files, since most of the content is the same anyway.

johnsonkcwong added 7 commits March 24, 2024 00:37

Speed up coh_tmm in tmm_core_vec; introduced versions with cpu and gp…

0bf78f1

…u parallelization

debug

4b89be0

make the various coh_tmm implementations vectorized over angles

d665991

introduced differential

27fc690

nk parameterized

a470ed2

deleted prints

7d393d4

finished nk differential

a11ff04

johnsonkcwong added 5 commits May 9, 2024 10:27

slight debug

7096da3

debugged differential

3c6239d

allow nk_parameter to be set

c92f8bd

debugged the case where there is no nk paramter

905d9d6

make load nk data

b62f7be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up coh_tmm in tmm_core_vec; introduced versions with cpu and gpu parallelization #271

Speed up coh_tmm in tmm_core_vec; introduced versions with cpu and gpu parallelization #271

arsonwong commented Mar 24, 2024

phoebe-p commented May 7, 2024

Speed up coh_tmm in tmm_core_vec; introduced versions with cpu and gpu parallelization #271

Are you sure you want to change the base?

Speed up coh_tmm in tmm_core_vec; introduced versions with cpu and gpu parallelization #271

Conversation

arsonwong commented Mar 24, 2024

phoebe-p commented May 7, 2024