`copy_v_cs_from_GPU(Utils::Range<ParticleIterator<Cell**, Particle> >)` seems to make trouble with hip #2413

KaiSzuttor · 2018-12-12T19:47:32Z

see https://gitlab.icp.uni-stuttgart.de/espressomd/espresso/-/jobs/56529

mkuron · 2018-12-12T22:05:23Z

What pull request did this start with? It worked fine last week.

mkuron · 2018-12-13T08:43:11Z

I think #2390 fixes it. https://gitlab.icp.uni-stuttgart.de/espressomd/espresso/-/jobs/57294 did not crash.

KaiSzuttor · 2018-12-13T14:03:23Z

no, still have issues in #2406:
https://gitlab.icp.uni-stuttgart.de/espressomd/espresso/-/jobs/57513

mkuron · 2018-12-13T14:11:20Z

Now it only occurs in engine_lb, which makes some sense. It didn't occur in #2390, which means that it was broken by some pull request in the past two weeks.

hmenke · 2018-12-13T20:42:15Z

It might be that HIP does not like the strided memcpy we are performing in copy_v_cs_from_GPU.

espresso/src/core/cuda_common_cuda.cu

Lines 462 to 477 in 7a485cc

    
           #if defined(ENGINE) && defined(LB_GPU) 
        
           // setup and call kernel to copy v_cs to host 
        
           void copy_v_cs_from_GPU(ParticleRange particles) { 
        
             if (global_part_vars_host.communication_enabled == 1 && 
        
                 global_part_vars_host.number_of_particles) { 
        
               // Copy result from device memory to host memory 
        
               if (this_node == 0) { 
        
                 cuda_safe_mem(cudaMemcpy2D( 
        
                     host_v_cs, sizeof(CUDA_v_cs), particle_data_device, 
        
                     sizeof(CUDA_particle_data), sizeof(CUDA_v_cs), 
        
                     global_part_vars_host.number_of_particles, cudaMemcpyDeviceToHost)); 
        
               } 
        
               cuda_mpi_send_v_cs(particles, host_v_cs); 
        
             } 
        
           } 
        
           #endif

What we are doing here is, instead of calling memcpy for each individual particle we are calling memcpy of all particles at once, but in strides of the first few bits. That is also the reason why v_cs must remain the first member in the CUDA_ParticleParametersSwimming structure.

espresso/src/core/cuda_interface.hpp

Lines 46 to 58 in 7a485cc

    
           // Parameters for swimmers 
        
           #ifdef ENGINE 
        
           typedef struct { 
        
             // v_cs has to stay in the front for memmove reasons 
        
             float v_cs[6]; 
        
             float v_swim; 
        
             float f_swim; 
        
             float director[3]; 
        
             int push_pull; 
        
             float dipole_length; 
        
             bool swimming; 
        
           } CUDA_ParticleParametersSwimming; 
        
           #endif

mkuron · 2018-12-13T20:44:08Z

That should be fine. Also it used to work, some recent change must have broken it. I won‘t have time to debug it before next week though.

mkuron · 2018-12-17T11:36:25Z

It's mysteriously fixed on the current master: https://gitlab.icp.uni-stuttgart.de/espressomd/espresso/-/jobs/57998 . I also couldn't reproduce it on my own machine. Please close this issue for now.

RudolfWeeber closed this as completed Dec 17, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`copy_v_cs_from_GPU(Utils::Range<ParticleIterator<Cell**, Particle> >)` seems to make trouble with hip #2413

`copy_v_cs_from_GPU(Utils::Range<ParticleIterator<Cell**, Particle> >)` seems to make trouble with hip #2413

KaiSzuttor commented Dec 12, 2018

mkuron commented Dec 12, 2018

mkuron commented Dec 13, 2018

KaiSzuttor commented Dec 13, 2018

mkuron commented Dec 13, 2018

hmenke commented Dec 13, 2018

mkuron commented Dec 13, 2018

mkuron commented Dec 17, 2018

copy_v_cs_from_GPU(Utils::Range<ParticleIterator<Cell**, Particle> >) seems to make trouble with hip #2413

copy_v_cs_from_GPU(Utils::Range<ParticleIterator<Cell**, Particle> >) seems to make trouble with hip #2413

Comments

KaiSzuttor commented Dec 12, 2018

mkuron commented Dec 12, 2018

mkuron commented Dec 13, 2018

KaiSzuttor commented Dec 13, 2018

mkuron commented Dec 13, 2018

hmenke commented Dec 13, 2018

mkuron commented Dec 13, 2018

mkuron commented Dec 17, 2018

`copy_v_cs_from_GPU(Utils::Range<ParticleIterator<Cell**, Particle> >)` seems to make trouble with hip #2413

`copy_v_cs_from_GPU(Utils::Range<ParticleIterator<Cell**, Particle> >)` seems to make trouble with hip #2413