-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Task04 Никита Вербин SPbSU #134
base: task04
Are you sure you want to change the base?
Conversation
// Убедитесь что название PR соответствует шаблону: // Впишите сюда (между pre и /pre тэгами) вывод тестирования на вашем компьютере: Локальный вывод
/mnt/data/Programming/ProgrammingSP/VideoCardCalc/GPGPUTasks2024/build/matrix_transpose 1 OpenCL devices: Device #0: CPU. Intel(R) Core(TM) i5-7400 CPU @ 3.00GHz. Intel(R) Corporation. Total memory: 7900 Mb Device #1: GPU. NVIDIA GeForce GTX 750 Ti. Total memory: 4033 Mb Using device #1: GPU. NVIDIA GeForce GTX 750 Ti. Total memory: 4033 Mb Data generated for M=4096, K=4096 [matrix_transpose_naive] GPU: 0.00665927+-0.000303213 s GPU: 2519.38 millions/s [matrix_transpose_local_bad_banks] GPU: 0.00288885+-5.97285e-05 s GPU: 5807.58 millions/s [matrix_transpose_local_good_banks] GPU: 0.0025979+-4.05519e-05 s GPU: 6457.99 millions/s /mnt/data/Programming/ProgrammingSP/VideoCardCalc/GPGPUTasks2024/build/matrix_multiplication 1 OpenCL devices: Device #0: CPU. Intel(R) Core(TM) i5-7400 CPU @ 3.00GHz. Intel(R) Corporation. Total memory: 7900 Mb Device #1: GPU. NVIDIA GeForce GTX 750 Ti. Total memory: 4033 Mb Using device #1: GPU. NVIDIA GeForce GTX 750 Ti. Total memory: 4033 Mb Data generated for M=1024, K=1024, N=1024 CPU: 7.72198+-0 s CPU: 0.259001 GFlops [naive, ts=4] GPU: 0.0774387+-0.000761379 s GPU: 25.8269 GFlops Average difference: 0.000149043% [naive, ts=8] GPU: 0.0419122+-1.43691e-05 s GPU: 47.7188 GFlops Average difference: 0.000149043% [naive, ts=16] GPU: 0.0256253+-0.000573867 s GPU: 78.0478 GFlops Average difference: 0.000149043% [local, ts=4] GPU: 0.0689645+-1.45574e-05 s GPU: 29.0004 GFlops Average difference: 0.000149043% [local, ts=8] GPU: 0.0212967+-8.84433e-06 s GPU: 93.9114 GFlops Average difference: 0.000149043% [local, ts=16] GPU: 0.0147798+-9.13631e-06 s GPU: 135.32 GFlops Average difference: 0.000149043% [local wpt, ts=4, wpt=2] GPU: 0.107218+-3.4946e-05 s GPU: 18.6536 GFlops Average difference: 0.000149043% [local wpt, ts=4, wpt=4] GPU: 0.204786+-6.3748e-05 s GPU: 9.7663 GFlops Average difference: 0.000149043% [local wpt, ts=8, wpt=2] GPU: 0.0209273+-2.14761e-05 s GPU: 95.5688 GFlops Average difference: 0.000149043% [local wpt, ts=8, wpt=4] GPU: 0.0288332+-3.00578e-05 s GPU: 69.3646 GFlops Average difference: 0.000149043% [local wpt, ts=8, wpt=8] GPU: 0.0661975+-1.15578e-05 s GPU: 30.2126 GFlops Average difference: 0.000149043% [local wpt, ts=16, wpt=2] GPU: 0.0109615+-3.5473e-06 s GPU: 182.457 GFlops Average difference: 0.000149043% [local wpt, ts=16, wpt=4] GPU: 0.0101667+-1.69967e-06 s GPU: 196.721 GFlops Average difference: 0.000149043% [local wpt, ts=16, wpt=8] GPU: 0.0129238+-9.99027e-06 s GPU: 154.753 GFlops Average difference: 0.000149043% [local wpt, ts=16, wpt=16] GPU: 0.0367468+-1.52689e-05 s GPU: 54.4265 GFlops Average difference: 0.000149043% // Затем создайте PR, должна начать выполняться автоматическиая сборка на Github CI (Github Actions) - рядом с коммитом в PR появится оранжевый шарик (сборка в процессе), Вывод Github CI
./matrix_transpose shell: /usr/bin/bash -e {0} env: BUILD_TYPE: RelWithDebInfo OpenCL devices: Device #0: CPU. AMD EPYC 7763 64-Core Processor . Intel(R) Corporation. Total memory: 15991 Mb Using device #0: CPU. AMD EPYC 7763 64-Core Processor . Intel(R) Corporation. Total memory: 15991 Mb Data generated for M=4096, K=4096 [matrix_transpose_naive] GPU: 0.0155612+-0.00013746 s GPU: 1078.15 millions/s [matrix_transpose_local_bad_banks] GPU: 0.0263048+-4.98146e-05 s GPU: 637.799 millions/s [matrix_transpose_local_good_banks] GPU: 0.0273989+-7.18847e-05 s GPU: 612.332 millions/s ./matrix_multiplication shell: /usr/bin/bash -e {0} env: BUILD_TYPE: RelWithDebInfo OpenCL devices: Device #0: CPU. AMD EPYC 7763 64-Core Processor . Intel(R) Corporation. Total memory: 15991 Mb Using device #0: CPU. AMD EPYC 7763 64-Core Processor . Intel(R) Corporation. Total memory: 15991 Mb Data generated for M=1024, K=1024, N=1024 CPU: 6.313+-0 s CPU: 0.316806 GFlops [naive, ts=4] GPU: 0.26489+-0.00227359 s GPU: 7.5503 GFlops Average difference: 0.000149043% [naive, ts=8] GPU: 0.260856+-0.00302101 s GPU: 7.66708 GFlops Average difference: 0.000149043% [naive, ts=16] GPU: 0.264585+-0.00630681 s GPU: 7.55901 GFlops Average difference: 0.000149043% [local, ts=4] GPU: 0.92003+-0.00340236 s GPU: 2.17384 GFlops Average difference: 0.000149043% [local, ts=8] GPU: 0.18696+-0.000637197 s GPU: 10.6975 GFlops Average difference: 0.000149043% [local, ts=16] GPU: 0.124166+-0.000385947 s GPU: 16.1074 GFlops Average difference: 0.000149043% [local wpt, ts=4, wpt=2] GPU: 0.947565+-0.001652 s GPU: 2.11067 GFlops Average difference: 0.000149043% [local wpt, ts=4, wpt=4] GPU: 0.782714+-0.00185724 s GPU: 2.55521 GFlops Average difference: 0.000149043% [local wpt, ts=8, wpt=2] GPU: 0.212608+-0.00029748 s GPU: 9.40697 GFlops Average difference: 0.000149043% [local wpt, ts=8, wpt=4] GPU: 0.22813+-0.000416355 s GPU: 8.76692 GFlops Average difference: 0.000149043% [local wpt, ts=8, wpt=8] GPU: 0.209914+-0.00052147 s GPU: 9.52771 GFlops Average difference: 0.000149043% [local wpt, ts=16, wpt=2] GPU: 0.138075+-0.000485082 s GPU: 14.4849 GFlops Average difference: 0.000149043% [local wpt, ts=16, wpt=4] GPU: 0.144341+-0.00479876 s GPU: 13.8561 GFlops Average difference: 0.000149043% [local wpt, ts=16, wpt=8] GPU: 0.13521+-0.000230756 s GPU: 14.7918 GFlops Average difference: 0.000149043% [local wpt, ts=16, wpt=16] GPU: 0.146567+-0.000239671 s GPU: 13.6457 GFlops Average difference: 0.000149043% |
No description provided.