___ ___ _____ ___ ___ ___ / /\ /__/\ / /::\ / /\ /__/\ / /\ / /::\ \ \:\ / /:/\:\ / /:/_ | |::\ / /:/ / /:/\:\ \ \:\ / /:/ \:\ / /:/ /\ | |:|:\ / /:/ / /:/~/::\ _____\__\:\ /__/:/ \__\:| / /:/ /:/_ __|__|:|\:\ / /::\ /__/:/ /:/\:\/__/::::::::\\ \:\ / /://__/:/ /:/ /\/__/::::| \:\ /__/:/\:\\ \:\/:/__\/\ \:\~~\~~\/ \ \:\ /:/ \ \:\/:/ /:/\ \:\~~\__\/ \__\/ \:\\ \::/ \ \:\ ~~~ \ \:\/:/ \ \::/ /:/ \ \:\ \ \:\\ \:\ \ \:\ \ \::/ \ \:\/:/ \ \:\ \__\/ \ \:\ \ \:\ \__\/ \ \::/ \ \:\ \__\/ \__\/ \__\/ \__\/ tandem version v1.0-66-gabde5ef stack size limit = unlimited Worker affinity 0---------|----------|----------|----------|----------|----------| ---- DOFs: 184320 Mesh size: 0.0348181 Multigrid P-levels: 1 2 Assembly: 0.647582 s Solver warmup: 0.00396788 s 0 KSP unpreconditioned resid norm 7.740999135093e+02 true resid norm 7.740999135093e+02 ||r(i)||/||b|| 1.000000000000e+00 Residual norms for mg_levels_1_ solve. 0 KSP Residual norm 1.000000000000e+00 1 KSP Residual norm 2.125766523215e+01 2 KSP Residual norm 4.918679493969e+02 3 KSP Residual norm 1.431131535909e+04 4 KSP Residual norm 6.744524619233e+05 5 KSP Residual norm 4.482909473315e+07 6 KSP Residual norm 3.294745190064e+09 7 KSP Residual norm 2.488244728310e+11 8 KSP Residual norm 1.901337270528e+13 9 KSP Residual norm 1.464324870949e+15 10 KSP Residual norm 1.134988695804e+17 Residual norms for mg_coarse_ solve. 0 KSP preconditioned resid norm 1.124192366962e+17 true resid norm 1.124192366962e+17 ||r(i)||/||b|| 1.000000000000e+00 1 KSP preconditioned resid norm 2.337205575660e+17 true resid norm 2.337205575660e+17 ||r(i)||/||b|| 2.079008579267e+00 2 KSP preconditioned resid norm 4.983016809232e+17 true resid norm 4.983016809232e+17 ||r(i)||/||b|| 4.432530370847e+00 3 KSP preconditioned resid norm 1.072815406872e+18 true resid norm 1.072815406872e+18 ||r(i)||/||b|| 9.542987823085e+00 4 KSP preconditioned resid norm 2.347643024292e+18 true resid norm 2.347643024292e+18 ||r(i)||/||b|| 2.088292976617e+01 5 KSP preconditioned resid norm 5.364982452927e+18 true resid norm 5.364982452927e+18 ||r(i)||/||b|| 4.772299306236e+01 6 KSP preconditioned resid norm 1.366148444326e+19 true resid norm 1.366148444326e+19 ||r(i)||/||b|| 1.215226579076e+02 7 KSP preconditioned resid norm 4.167197685366e+19 true resid norm 4.167197685366e+19 ||r(i)||/||b|| 3.706836843795e+02 8 KSP preconditioned resid norm 1.496708466632e+20 true resid norm 1.496708466632e+20 ||r(i)||/||b|| 1.331363306333e+03 9 KSP preconditioned resid norm 5.842028502026e+20 true resid norm 5.842028502026e+20 ||r(i)||/||b|| 5.196644874769e+03 10 KSP preconditioned resid norm 2.350142261663e+21 true resid norm 2.350142261663e+21 ||r(i)||/||b|| 2.090516116930e+04 Linear mg_coarse_ solve did not converge due to DIVERGED_DTOL iterations 10 Residual norms for mg_levels_1_ solve. 0 KSP Residual norm 1.134988695804e+17 1 KSP Residual norm 8.845712201334e+18 2 KSP Residual norm 6.927269425188e+20 3 KSP Residual norm 5.448430156694e+22 4 KSP Residual norm 4.303624514021e+24 5 KSP Residual norm 3.417083258819e+26 6 KSP Residual norm 2.736572463071e+28 7 KSP Residual norm 2.231147858108e+30 8 KSP Residual norm 1.892347213587e+32 9 KSP Residual norm 1.734509591198e+34 10 KSP Residual norm 1.783038890577e+36 1 KSP unpreconditioned resid norm 7.740797104470e+02 true resid norm 7.740797104470e+02 ||r(i)||/||b|| 9.999739012213e-01 Residual norms for mg_levels_1_ solve. 0 KSP Residual norm 1.000000000000e+00 1 KSP Residual norm 1.151003516572e+02 2 KSP Residual norm 1.427296536654e+04 3 KSP Residual norm 1.836561736579e+06 4 KSP Residual norm 2.401274286602e+08 5 KSP Residual norm 3.160943242702e+10 6 KSP Residual norm 4.173702929458e+12 7 KSP Residual norm 5.519763120503e+14 8 KSP Residual norm 7.307239403673e+16 9 KSP Residual norm 9.680646040851e+18 10 KSP Residual norm 1.283258631142e+21 Residual norms for mg_coarse_ solve. 0 KSP preconditioned resid norm 1.124192366962e+17 true resid norm 1.124192366962e+17 ||r(i)||/||b|| 1.000000000000e+00 1 KSP preconditioned resid norm 2.337205575660e+17 true resid norm 2.337205575660e+17 ||r(i)||/||b|| 2.079008579267e+00 2 KSP preconditioned resid norm 4.983016809232e+17 true resid norm 4.983016809232e+17 ||r(i)||/||b|| 4.432530370847e+00 3 KSP preconditioned resid norm 1.072815406872e+18 true resid norm 1.072815406872e+18 ||r(i)||/||b|| 9.542987823085e+00 4 KSP preconditioned resid norm 2.347643024292e+18 true resid norm 2.347643024292e+18 ||r(i)||/||b|| 2.088292976617e+01 5 KSP preconditioned resid norm 5.364982452927e+18 true resid norm 5.364982452927e+18 ||r(i)||/||b|| 4.772299306236e+01 6 KSP preconditioned resid norm 1.366148444326e+19 true resid norm 1.366148444326e+19 ||r(i)||/||b|| 1.215226579076e+02 7 KSP preconditioned resid norm 4.167197685366e+19 true resid norm 4.167197685366e+19 ||r(i)||/||b|| 3.706836843795e+02 8 KSP preconditioned resid norm 1.496708466632e+20 true resid norm 1.496708466632e+20 ||r(i)||/||b|| 1.331363306333e+03 9 KSP preconditioned resid norm 5.842028502026e+20 true resid norm 5.842028502026e+20 ||r(i)||/||b|| 5.196644874769e+03 10 KSP preconditioned resid norm 2.350142261663e+21 true resid norm 2.350142261663e+21 ||r(i)||/||b|| 2.090516116930e+04 Linear mg_coarse_ solve did not converge due to DIVERGED_DTOL iterations 10 Residual norms for mg_levels_1_ solve. 0 KSP Residual norm 1.283258631142e+21 1 KSP Residual norm 1.701951390036e+23 2 KSP Residual norm 2.258282430084e+25 3 KSP Residual norm 2.997700373313e+27 4 KSP Residual norm 3.980716341650e+29 5 KSP Residual norm 5.287907539888e+31 6 KSP Residual norm 7.026587167601e+33 7 KSP Residual norm 9.339696007986e+35 8 KSP Residual norm 1.241765656324e+38 9 KSP Residual norm 1.651417804158e+40 10 KSP Residual norm 2.196733058603e+42 2 KSP unpreconditioned resid norm 7.740538025268e+02 true resid norm 7.740538025268e+02 ||r(i)||/||b|| 9.999404327766e-01 Residual norms for mg_levels_1_ solve. 0 KSP Residual norm 1.000000000000e+00 1 KSP Residual norm 1.107996122160e+02 2 KSP Residual norm 1.350745154613e+04 3 KSP Residual norm 1.733702045889e+06 4 KSP Residual norm 2.276047547192e+08 5 KSP Residual norm 3.015484060469e+10 6 KSP Residual norm 4.009989312046e+12 7 KSP Residual norm 5.341028612959e+14 8 KSP Residual norm 7.119382334090e+16 9 KSP Residual norm 9.493923371653e+18 10 KSP Residual norm 1.266386436302e+21 Residual norms for mg_coarse_ solve. 0 KSP preconditioned resid norm 1.124192366962e+17 true resid norm 1.124192366962e+17 ||r(i)||/||b|| 1.000000000000e+00 1 KSP preconditioned resid norm 2.337205575660e+17 true resid norm 2.337205575660e+17 ||r(i)||/||b|| 2.079008579267e+00 2 KSP preconditioned resid norm 4.983016809232e+17 true resid norm 4.983016809232e+17 ||r(i)||/||b|| 4.432530370847e+00 3 KSP preconditioned resid norm 1.072815406872e+18 true resid norm 1.072815406872e+18 ||r(i)||/||b|| 9.542987823085e+00 4 KSP preconditioned resid norm 2.347643024292e+18 true resid norm 2.347643024292e+18 ||r(i)||/||b|| 2.088292976617e+01 5 KSP preconditioned resid norm 5.364982452927e+18 true resid norm 5.364982452927e+18 ||r(i)||/||b|| 4.772299306236e+01 6 KSP preconditioned resid norm 1.366148444326e+19 true resid norm 1.366148444326e+19 ||r(i)||/||b|| 1.215226579076e+02 7 KSP preconditioned resid norm 4.167197685366e+19 true resid norm 4.167197685366e+19 ||r(i)||/||b|| 3.706836843795e+02 8 KSP preconditioned resid norm 1.496708466632e+20 true resid norm 1.496708466632e+20 ||r(i)||/||b|| 1.331363306333e+03 9 KSP preconditioned resid norm 5.842028502026e+20 true resid norm 5.842028502026e+20 ||r(i)||/||b|| 5.196644874769e+03 10 KSP preconditioned resid norm 2.350142261663e+21 true resid norm 2.350142261663e+21 ||r(i)||/||b|| 2.090516116930e+04 Linear mg_coarse_ solve did not converge due to DIVERGED_DTOL iterations 10 Residual norms for mg_levels_1_ solve. 0 KSP Residual norm 1.266386436302e+21 1 KSP Residual norm 1.689537073423e+23 2 KSP Residual norm 2.254386651558e+25 3 KSP Residual norm 3.008387329606e+27 4 KSP Residual norm 4.014887335851e+29 5 KSP Residual norm 5.358453904941e+31 6 KSP Residual norm 7.151979787446e+33 7 KSP Residual norm 9.546170370823e+35 8 KSP Residual norm 1.274220861122e+38 9 KSP Residual norm 1.700866208223e+40 10 KSP Residual norm 2.270405297272e+42 3 KSP unpreconditioned resid norm 7.740481489631e+02 true resid norm 7.740481489631e+02 ||r(i)||/||b|| 9.999331293736e-01 Residual norms for mg_levels_1_ solve. 0 KSP Residual norm 1.000000000000e+00 1 KSP Residual norm 1.252436755936e+02 2 KSP Residual norm 1.600512470503e+04 3 KSP Residual norm 2.062448168712e+06 4 KSP Residual norm 2.666859709080e+08 5 KSP Residual norm 3.453616820751e+10 6 KSP Residual norm 4.475938888796e+12 7 KSP Residual norm 5.803759877522e+14 8 KSP Residual norm 7.528463154034e+16 9 KSP Residual norm 9.769241474870e+18 10 KSP Residual norm 1.268149801465e+21 Residual norms for mg_coarse_ solve. 0 KSP preconditioned resid norm 1.124192366962e+17 true resid norm 1.124192366962e+17 ||r(i)||/||b|| 1.000000000000e+00 1 KSP preconditioned resid norm 2.337205575660e+17 true resid norm 2.337205575660e+17 ||r(i)||/||b|| 2.079008579267e+00 2 KSP preconditioned resid norm 4.983016809232e+17 true resid norm 4.983016809232e+17 ||r(i)||/||b|| 4.432530370847e+00 3 KSP preconditioned resid norm 1.072815406872e+18 true resid norm 1.072815406872e+18 ||r(i)||/||b|| 9.542987823085e+00 4 KSP preconditioned resid norm 2.347643024292e+18 true resid norm 2.347643024292e+18 ||r(i)||/||b|| 2.088292976617e+01 5 KSP preconditioned resid norm 5.364982452927e+18 true resid norm 5.364982452927e+18 ||r(i)||/||b|| 4.772299306236e+01 6 KSP preconditioned resid norm 1.366148444326e+19 true resid norm 1.366148444326e+19 ||r(i)||/||b|| 1.215226579076e+02 7 KSP preconditioned resid norm 4.167197685366e+19 true resid norm 4.167197685366e+19 ||r(i)||/||b|| 3.706836843795e+02 8 KSP preconditioned resid norm 1.496708466632e+20 true resid norm 1.496708466632e+20 ||r(i)||/||b|| 1.331363306333e+03 9 KSP preconditioned resid norm 5.842028502026e+20 true resid norm 5.842028502026e+20 ||r(i)||/||b|| 5.196644874769e+03 10 KSP preconditioned resid norm 2.350142261663e+21 true resid norm 2.350142261663e+21 ||r(i)||/||b|| 2.090516116930e+04 Linear mg_coarse_ solve did not converge due to DIVERGED_DTOL iterations 10 Residual norms for mg_levels_1_ solve. 0 KSP Residual norm -nan 1 KSP Residual norm -nan 2 KSP Residual norm -nan 3 KSP Residual norm -nan 4 KSP Residual norm -nan 5 KSP Residual norm -nan 6 KSP Residual norm -nan 7 KSP Residual norm -nan 8 KSP Residual norm -nan 9 KSP Residual norm -nan KSP Object: 1 MPI process type: fgmres restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with no iterative refinement happy breakdown tolerance 1e-30 maximum iterations=10, initial guess is zero tolerances: relative=1e-09, absolute=1e-50, divergence=10000. right preconditioning using UNPRECONDITIONED norm type for convergence test PC Object: 1 MPI process type: mg type is MULTIPLICATIVE, levels=2 cycles=v Cycles per PCApply=1 Not using Galerkin computed coarse grid matrices Coarse grid solver -- level 0 ------------------------------- KSP Object: (mg_coarse_) 1 MPI process type: richardson damping factor=0.01 maximum iterations=10, initial guess is zero tolerances: relative=0.5, absolute=1e-50, divergence=10000. left preconditioning using PRECONDITIONED norm type for convergence test PC Object: (mg_coarse_) 1 MPI process type: none linear system matrix = precond matrix: Mat Object: 1 MPI process type: seqaijcusparse rows=92160, cols=92160 total: nonzeros=2194560, allocated nonzeros=2194560 total number of mallocs used during MatSetValues calls=0 not using I-node routines Down solver (pre-smoother) on level 1 ------------------------------- KSP Object: (mg_levels_1_) 1 MPI process type: richardson damping factor=0.3 maximum iterations=10, nonzero initial guess tolerances: relative=1e-05, absolute=1e-50, divergence=10000. left preconditioning using PRECONDITIONED norm type for convergence test PC Object: (mg_levels_1_) 1 MPI process type: none linear system matrix followed by preconditioner matrix: Mat Object: 1 MPI process type: shell rows=184320, cols=184320, bs=12 Mat Object: 1 MPI process type: seqaijcusparse rows=184320, cols=184320, bs=12 total: nonzeros=8778240, allocated nonzeros=8778240 total number of mallocs used during MatSetValues calls=0 not using I-node routines Up solver (post-smoother) same as down solver (pre-smoother) linear system matrix followed by preconditioner matrix: Mat Object: 1 MPI process type: shell rows=184320, cols=184320, bs=12 Mat Object: 1 MPI process type: seqaijcusparse rows=184320, cols=184320, bs=12 total: nonzeros=8778240, allocated nonzeros=8778240 total number of mallocs used during MatSetValues calls=0 not using I-node routines Solve: 2.05049 s Solver did not converge. **************************************************************************************************************************************************************** *** WIDEN YOUR WINDOW TO 160 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** **************************************************************************************************************************************************************** ------------------------------------------------------------------ PETSc Performance Summary: ------------------------------------------------------------------ --petsc on a named cachemiss with 1 process and CUDA architecture 86, by ulrich on Tue Sep 17 18:24:43 2024 Using Petsc Release Version 3.21.2, May 29, 2024 Max Max/Min Avg Total Time (sec): 2.915e+00 1.000 2.915e+00 Objects: 0.000e+00 0.000 0.000e+00 Flops: 5.296e+08 1.000 5.296e+08 5.296e+08 Flops/sec: 1.817e+08 1.000 1.817e+08 1.817e+08 MPI Msg Count: 0.000e+00 0.000 0.000e+00 0.000e+00 MPI Msg Len (bytes): 0.000e+00 0.000 0.000e+00 0.000e+00 MPI Reductions: 0.000e+00 0.000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total 0: Main Stage: 8.6428e-01 29.7% 2.1728e+08 41.0% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 1: solve: 1.9765e+00 67.8% 3.1233e+08 59.0% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent AvgLen: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) GPU Mflop/s: 10e-6 * (sum of flop on GPU over all processors)/(max GPU time over all processors) CpuToGpu Count: total number of CPU to GPU copies per processor CpuToGpu Size (Mbytes): 10e-6 * (total size of CPU to GPU copies per processor) GpuToCpu Count: total number of GPU to CPU copies per processor GpuToCpu Size (Mbytes): 10e-6 * (total size of GPU to CPU copies per processor) GPU %F: percent flops on GPU in this event ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage ---- Total GPU - CpuToGpu - - GpuToCpu - GPU Max Ratio Max Ratio Max Ratio Mess AvgLen Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s Mflop/s Count Size Count Size %F --------------------------------------------------------------------------------------------------------------------------------------------------------------- --- Event Stage 0: Main Stage PCSetUp 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 MatConvert 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 MatAssemblyBegin 3 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 MatAssemblyEnd 3 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 MatTranspose 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 2 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 MatMatMultSym 2 1.0 n/a n/a 1.58e+08 1.0 0.0e+00 0.0e+00 0.0e+00 7 30 0 0 0 24 73 0 0 0 n/a n/a 3 1.34e+02 2 5.49e+01 100 MatMatMultNum 2 1.0 n/a n/a 5.27e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 10 0 0 0 0 24 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 100 MatCUSPARSCopyTo 3 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 5 0 0 0 0 n/a n/a 3 1.34e+02 0 0.00e+00 0 cuBLAS Init 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 3 0 0 0 0 9 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxCreate 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxDestroy 2 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxSetUp 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxSetDevice 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 KSPSetUp 3 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 --- Event Stage 1: solve PCApply 4 1.0 n/a n/a 2.97e+08 1.0 0.0e+00 0.0e+00 0.0e+00 61 56 0 0 0 90 95 0 0 0 n/a n/a 5 7.37e+00 4 5.90e+00 74 MatMult 136 1.0 n/a n/a 1.72e+08 1.0 0.0e+00 0.0e+00 0.0e+00 61 32 0 0 0 90 55 0 0 0 n/a n/a 0 0.00e+00 6 8.85e+00 100 MatMultAdd 4 1.0 n/a n/a 8.85e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 3 0 0 0 n/a n/a 4 5.90e+00 0 0.00e+00 100 MatMultTranspose 4 1.0 n/a n/a 8.11e+06 1.0 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 2 3 0 0 0 n/a n/a 1 1.47e+00 0 0.00e+00 100 MatResidual 4 1.0 n/a n/a 7.37e+05 1.0 0.0e+00 0.0e+00 0.0e+00 3 0 0 0 0 4 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 MatView 5 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxCreate 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxSetUp 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxSetDevice 1 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxFork 2 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxJoin 2 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxSync 15 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 DCtxMark 6 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 VecMDot 4 1.0 n/a n/a 3.69e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 n/a n/a 5 7.37e+00 2 5.60e-05 70 VecNorm 186 1.0 n/a n/a 5.16e+07 1.0 0.0e+00 0.0e+00 0.0e+00 1 10 0 0 0 2 17 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 38 VecScale 4 1.0 n/a n/a 7.37e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 25 VecCopy 189 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 n/a n/a 0 0.00e+00 1 1.47e+00 0 VecSet 12 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 0 0.00e+00 0 VecAXPY 123 1.0 n/a n/a 3.80e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 7 0 0 0 0 12 0 0 0 n/a n/a 3 4.42e+00 1 1.47e+00 22 VecAYPX 132 1.0 n/a n/a 2.51e+07 1.0 0.0e+00 0.0e+00 0.0e+00 1 5 0 0 0 1 8 0 0 0 n/a n/a 5 7.37e+00 1 1.47e+00 35 VecMAXPY 6 1.0 n/a n/a 4.42e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 n/a n/a 1 2.40e-05 0 0.00e+00 25 VecCUDACopyTo 18 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 18 2.65e+01 0 0.00e+00 0 VecCUDACopyFrom 8 1.0 n/a n/a 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 n/a n/a 0 0.00e+00 8 1.18e+01 0 KSPSolve 1 1.0 1.9761e+00 1.0 3.12e+08 1.0 0.0e+00 0.0e+00 0.0e+00 68 59 0 0 0 100 100 0 0 0 158 n/a 19 2.65e+01 11 1.33e+01 74 KSPGMRESOrthog 4 1.0 n/a n/a 4.42e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 n/a n/a -16 -1.53e+02 -11 -6.82e+01 -9860 --------------------------------------------------------------------------------------------------------------------------------------------------------------- Object Type Creations Destructions. Reports information only for process 0. --- Event Stage 0: Main Stage Container 6 12 Preconditioner 4 4 Matrix 6 6 PetscDeviceContext 1 0 IS L to G Mapping 4 4 Vector 23 43 Krylov Solver 4 4 Viewer 1 1 --- Event Stage 1: solve Container 6 0 PetscDeviceContext 1 0 Vector 116 96 ======================================================================================================================== Average time to get PetscTime(): 3.5e-08 #PETSc Option Table entries: -ksp_max_it 10 # (source: file) -ksp_monitor_true_residual # (source: file) -ksp_rtol 1.0e-9 # (source: file) -ksp_type fgmres # (source: file) -ksp_view # (source: file) -log_view # (source: file) -mat_type aijcusparse # (source: file) -mg_coarse_ksp_converged_reason # (source: file) -mg_coarse_ksp_max_it 10 # (source: file) -mg_coarse_ksp_monitor_true_residual # (source: file) -mg_coarse_ksp_richardson_scale 1e-2 # (source: file) -mg_coarse_ksp_rtol 0.5 # (source: file) -mg_coarse_ksp_type richardson # (source: file) -mg_coarse_pc_type none # (source: file) -mg_levels_ksp_max_it 10 # (source: file) -mg_levels_ksp_monitor # (source: file) -mg_levels_ksp_norm_type preconditioned # (source: file) -mg_levels_ksp_richardson_scale 0.3 # (source: file) -mg_levels_ksp_type richardson # (source: file) -mg_levels_pc_type none # (source: file) -options_left # (source: file) -pc_type mg # (source: file) -vec_type cuda # (source: file) #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with 64-bit PetscInt Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --prefix=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/petsc-3.21.2-v27tvdpxpyj5fep27tv3dtldx5lfa5gr --with-ssl=0 --download-c2html=0 --download-sowing=0 --download-hwloc=0 --with-make-exec=make --with-cc=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/bin/mpicc --with-cxx=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/bin/mpic++ --with-fc=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/bin/mpif90 --with-precision=double --with-scalar-type=real --with-shared-libraries=1 --with-debugging=0 --with-openmp=0 --with-64-bit-indices=1 --with-blaslapack-lib=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openblas-0.3.27-jd3ju7eilgxfc54httc6mjomfgtogplt/lib/libopenblas.so --with-memalign=32 --with-x=0 --with-sycl=0 --with-clanguage=C --with-cuda=1 --with-cuda-dir=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/cuda-12.5.0-vc25sc6unsgx2phn7nftp3zapqcg3f7x --with-hip=0 --with-metis=1 --with-metis-include=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/metis-5.1.0-a37lt4tqieagkaxgjj5pbum67kkqsc4f/include --with-metis-lib=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/metis-5.1.0-a37lt4tqieagkaxgjj5pbum67kkqsc4f/lib/libmetis.so --with-hypre=1 --with-hypre-include=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hypre-2.31.0-usaf7rfi6poyfs5tdn6hjobd6i6kjn2i/include --with-hypre-lib=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hypre-2.31.0-usaf7rfi6poyfs5tdn6hjobd6i6kjn2i/lib/libHYPRE.so --with-parmetis=1 --with-parmetis-include=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/parmetis-4.0.3-zgalvrqlzooma73h6hsroylg6i7hykxh/include --with-parmetis-lib=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/parmetis-4.0.3-zgalvrqlzooma73h6hsroylg6i7hykxh/lib/libparmetis.so --with-kokkos=0 --with-kokkos-kernels=0 --with-superlu_dist=1 --with-superlu_dist-include=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/superlu-dist-8.2.1-i7z6gn62acaonqz62afnoxuuier6xtva/include --with-superlu_dist-lib=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/superlu-dist-8.2.1-i7z6gn62acaonqz62afnoxuuier6xtva/lib/libsuperlu_dist.so --with-ptscotch=0 --with-suitesparse=0 --with-hdf5=1 --with-hdf5-include=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hdf5-1.14.3-pcsiqx5igle5bhlc6j4bsh7mpmzanlwt/include --with-hdf5-lib=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hdf5-1.14.3-pcsiqx5igle5bhlc6j4bsh7mpmzanlwt/lib/libhdf5.so --with-zlib=0 --with-mumps=1 --with-mumps-include=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/include --with-mumps-lib="/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/lib/libsmumps.so /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/lib/libzmumps.so /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/lib/libcmumps.so /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/lib/libdmumps.so /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/lib/libmumps_common.so /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/lib/libpord.so" --with-trilinos=0 --with-fftw=0 --with-valgrind=0 --with-gmp=0 --with-libpng=0 --with-giflib=0 --with-mpfr=0 --with-netcdf=0 --with-pnetcdf=0 --with-moab=0 --with-random123=0 --with-exodusii=0 --with-cgns=0 --with-memkind=0 --with-p4est=0 --with-saws=0 --with-yaml=0 --with-hwloc=0 --with-libjpeg=0 --with-scalapack=1 --with-scalapack-lib=/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/netlib-scalapack-2.2.0-bflx52o6f456pqrrigpog6nsx55co735/lib/libscalapack.so --with-strumpack=0 --with-mmg=0 --with-parmmg=0 --with-tetgen=0 --with-zoltan=0 ----------------------------------------- Libraries compiled on 2024-09-16 22:09:03 on cachemiss Machine characteristics: Linux-6.10.6+bpo-amd64-x86_64-with-glibc2.36 Using PETSc directory: /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/petsc-3.21.2-v27tvdpxpyj5fep27tv3dtldx5lfa5gr Using PETSc arch: ----------------------------------------- Using C compiler: /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/bin/mpicc -fPIC -Wall -Wwrite-strings -Wno-unknown-pragmas -Wno-lto-type-mismatch -Wno-stringop-overflow -fstack-protector -fvisibility=hidden -g -O Using Fortran compiler: /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/bin/mpif90 -fPIC -Wall -ffree-line-length-none -ffree-line-length-0 -Wno-lto-type-mismatch -Wno-unused-dummy-argument -g -O ----------------------------------------- Using include paths: -I/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/petsc-3.21.2-v27tvdpxpyj5fep27tv3dtldx5lfa5gr/include -I/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hypre-2.31.0-usaf7rfi6poyfs5tdn6hjobd6i6kjn2i/include -I/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/include -I/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/superlu-dist-8.2.1-i7z6gn62acaonqz62afnoxuuier6xtva/include -I/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/parmetis-4.0.3-zgalvrqlzooma73h6hsroylg6i7hykxh/include -I/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/metis-5.1.0-a37lt4tqieagkaxgjj5pbum67kkqsc4f/include -I/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hdf5-1.14.3-pcsiqx5igle5bhlc6j4bsh7mpmzanlwt/include -I/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/cuda-12.5.0-vc25sc6unsgx2phn7nftp3zapqcg3f7x/include ----------------------------------------- Using C linker: /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/bin/mpicc Using Fortran linker: /import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/bin/mpif90 Using libraries: -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/petsc-3.21.2-v27tvdpxpyj5fep27tv3dtldx5lfa5gr/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/petsc-3.21.2-v27tvdpxpyj5fep27tv3dtldx5lfa5gr/lib -lpetsc -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hypre-2.31.0-usaf7rfi6poyfs5tdn6hjobd6i6kjn2i/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hypre-2.31.0-usaf7rfi6poyfs5tdn6hjobd6i6kjn2i/lib -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/mumps-5.7.2-hnotadxq2764rlisqk4tpad3sjhawjjk/lib -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/netlib-scalapack-2.2.0-bflx52o6f456pqrrigpog6nsx55co735/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/netlib-scalapack-2.2.0-bflx52o6f456pqrrigpog6nsx55co735/lib -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/superlu-dist-8.2.1-i7z6gn62acaonqz62afnoxuuier6xtva/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/superlu-dist-8.2.1-i7z6gn62acaonqz62afnoxuuier6xtva/lib -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openblas-0.3.27-jd3ju7eilgxfc54httc6mjomfgtogplt/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openblas-0.3.27-jd3ju7eilgxfc54httc6mjomfgtogplt/lib -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/parmetis-4.0.3-zgalvrqlzooma73h6hsroylg6i7hykxh/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/parmetis-4.0.3-zgalvrqlzooma73h6hsroylg6i7hykxh/lib -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/metis-5.1.0-a37lt4tqieagkaxgjj5pbum67kkqsc4f/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/metis-5.1.0-a37lt4tqieagkaxgjj5pbum67kkqsc4f/lib -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hdf5-1.14.3-pcsiqx5igle5bhlc6j4bsh7mpmzanlwt/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/hdf5-1.14.3-pcsiqx5igle5bhlc6j4bsh7mpmzanlwt/lib -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/cuda-12.5.0-vc25sc6unsgx2phn7nftp3zapqcg3f7x/lib64 -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/cuda-12.5.0-vc25sc6unsgx2phn7nftp3zapqcg3f7x/lib64 -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/cuda-12.5.0-vc25sc6unsgx2phn7nftp3zapqcg3f7x/lib64/stubs -Wl,-rpath,/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/lib -L/import/exception-dump/ulrich/myLibs/spack-packages/linux-debian12-zen2/gcc-12.2.0/openmpi-5.0.3-lovxp7hefm62cdqtuqaffypaxcaes7xh/lib -Wl,-rpath,/usr/lib/gcc/x86_64-linux-gnu/12 -L/usr/lib/gcc/x86_64-linux-gnu/12 -lHYPRE -lsmumps -lzmumps -lcmumps -ldmumps -lmumps_common -lpord -lscalapack -lsuperlu_dist -lopenblas -lparmetis -lmetis -lhdf5 -lm -lcudart -lnvToolsExt -lcufft -lcublas -lcusparse -lcusolver -lcurand -lcuda -lmpi_usempif08 -lmpi_usempi_ignore_tkr -lmpi_mpifh -lmpi -lgfortran -lm -lgfortran -lm -lgcc_s -lquadmath -lstdc++ -lquadmath ----------------------------------------- #PETSc Option Table entries: -ksp_max_it 10 # (source: file) -ksp_monitor_true_residual # (source: file) -ksp_rtol 1.0e-9 # (source: file) -ksp_type fgmres # (source: file) -ksp_view # (source: file) -log_view # (source: file) -mat_type aijcusparse # (source: file) -mg_coarse_ksp_converged_reason # (source: file) -mg_coarse_ksp_max_it 10 # (source: file) -mg_coarse_ksp_monitor_true_residual # (source: file) -mg_coarse_ksp_richardson_scale 1e-2 # (source: file) -mg_coarse_ksp_rtol 0.5 # (source: file) -mg_coarse_ksp_type richardson # (source: file) -mg_coarse_pc_type none # (source: file) -mg_levels_ksp_max_it 10 # (source: file) -mg_levels_ksp_monitor # (source: file) -mg_levels_ksp_norm_type preconditioned # (source: file) -mg_levels_ksp_richardson_scale 0.3 # (source: file) -mg_levels_ksp_type richardson # (source: file) -mg_levels_pc_type none # (source: file) -options_left # (source: file) -pc_type mg # (source: file) -vec_type cuda # (source: file) #End of PETSc Option Table entries There are no unused options.