Skip to content

Commit

Permalink
Merge pull request roced-scheduler#17 from uschnoor/ben_benchmark
Browse files Browse the repository at this point in the history
update of Benchmarks section (5.1)
  • Loading branch information
benoitroland authored Sep 11, 2018
2 parents 34e0c20 + 9cf00c7 commit 3f4e890
Show file tree
Hide file tree
Showing 6 changed files with 86 additions and 3 deletions.
Binary file not shown.
Binary file not shown.
Binary file not shown.
44 changes: 44 additions & 0 deletions publication/Draft_CompSoftwBigScience/include/Benchmarks.tex
Original file line number Diff line number Diff line change
@@ -0,0 +1,44 @@
Besides the use of the legacy HEP-SPEC06 (HS06) benchmark \cite{Hepspec}, the performance of the compute resources is furthermore evaluated with two fast benchmarks
developed to provide real-time information of the WLCG performance and available in the CERN benchmark suite \cite{Alef:2017jyx}; the Dirac Benchmark 2012 DB12 \cite{DB12}
and the ATLAS Kit Validation KV \cite{DeSalvo:2010zza}. The DB12 program is evaluating the performance of CPUs through floating-point arithmetic operations, while the KV benchmark
is making use of the simulation toolkit GEANT4 \cite{Agostinelli:2002hh} to simulate the interactions of single muon events in the detector of the ATLAS experiment. As our primary
target is to measure performances of CPUs in the context of High Energy Physics applications, the KV benchmark constitutes a realistic workload. The DB12 benchmark is using
the HS06 units, and the KV output provides the number of events produced per second. \\

To assess the impact of the virtualisation, the performance of the same hardware configuration (20 cores Intel Xeon E5-2630 CPUs) has been determined either deployed via
the standard bare metal operation on the NEMO cluster (NEMO bare metal) and on the local ATLAS Tier-3 centre in Freiburg (ATLAS-BFG bare metal), or as virtual machines on the
NEMO cluster (NEMO VM). On the ATLAS-BFG bare metal and on the virtual machines running on the NEMO cluster, hyper-threading (HT) technology is activated. Both are using Scientific
Linux 6 \cite{SL6} as operating system. The NEMO bare metal has no HT activated due to the more general use case of the system, and uses CentOS7 as operating system \cite{CentOS7}. % \\
The scores of the HEP-SPEC06, DB12, and KV benchmarks have been determined for these three configurations as a function of the number of cores actually used by the benchmarking processes.
This number ranges from 2 to 40 for the ATLAS-BFG bare metal and for the NEMO VM, for which HT is enabled, and from 2 to 20 for the NEMO bare metal,
for which HT is not implemented. The results have been determined by step of two core units. The benchmarks have been run 20 times for each core multiplicity value, and the means
and standard deviations of the corresponding distributions have been extracted. \\

\begin{figure}[htbp]
\centering
\includegraphics[width=0.44\textwidth]{figures/benchmark-hepspec06-total.pdf}
\includegraphics[width=0.44\textwidth]{figures/DB12-BFG-BM-NEMO-VM-NEMO-BM-total.pdf}
\includegraphics[width=0.44\textwidth]{figures/kv-BFG-BM-NEMO-VM-NEMO-BM-total.pdf}
\caption{Total score as a function of the core multiplicity for the HEP-SPEC06 (top), DB12 (middle) and KV (bottom) benchmarks
for the ATLAS-BFG bare metal (blue open circles), the NEMO VMs (red full circles) and the NEMO bare metal (black open squares). The data points represent
the average values of the benchmarks for each core multiplicity, and the vertical bars show the associated standard deviations.}
\label{bmk-total}
\end{figure}

The total scores are presented in figure \ref{bmk-total} for the three benchmarks and the three configurations considered, except for the KV software for which the NEMO bare metal
results are not yet available. The total scores observed for the different benchmarks are increasing until the maximum number of physical cores has been reached, and are characterized by a flattening increase
or a constant behaviour afterwards. The benchmark scores of the virtual machines running on the NEMO cluster are only slightly lower than those obtained for the NEMO bare metal operation, and the lost of performance
due to the virtualisation does not exceed 10$\%$. For the ATLAS-BFG bare metal and the VMs running on the NEMO cluster, the interplay between the virtualisation and the different operating systems leads
to similar benchmark behaviours for the two configurations.

% References for HEPSPEC
%%https://indico.cern.ch/event/49388/contributions/2014772/attachments/960838/1363966/20090127_NewCPUbenchmark_final.pdf
%%http://w3.hepix.org/benchmarking.html
%%https://spec.org/benchmarks.html
%%

%
%realistic load: as user
%machine reserved: root
%Hier sieht man, dass die Bare-Metal-User-Jobs wesentlich weniger performant sind, und die User-Jobs auf den VMs beinahe an die ROOT-Tests auf bare-metal herankommen. Interessant ist auch, dass bei den User-Jobs der Trend umgekehrt ist: Dort ist die HEPSPEC pro Core(!) etwas besser fuer 8 Cores. Es koennte aber auch sein, dass das bei den ROOT-Jobs auch so waere, dort haben wir die Zahlen in dem Bereich ja nicht.
%
Binary file modified publication/Draft_CompSoftwBigScience/nemo-virtualization.pdf
Binary file not shown.
45 changes: 42 additions & 3 deletions publication/Draft_CompSoftwBigScience/nemo-virtualization.tex
Original file line number Diff line number Diff line change
Expand Up @@ -178,11 +178,12 @@ \subsection{Using SLURM as front-end scheduler}

\section{Analysis of performance and usage}

The apporach described above has been implemented and put into production in the research groups at University of Freiburg (Physikalisches Institut) and the Karlsruhe Institute of Technology (Institute of Experimental Particle Physics). The following chapter presents the statistical analysis of the performance of the virtualized setup both in terms of job performance as well as usage statistics.
The approach described above has been implemented and put into production in the research groups at the University of Freiburg (Physikalisches Institut) and the Karlsruhe Institute of Technology (Institute of Experimental
Particle Physics). The following section presents the statistical analysis of the performance of the virtualized setup both in terms of CPU performance and usage statistics.

\subsection{HEPSPEC benchmarks}
\subsection{Benchmarks}
%$\to$ ATLAS Freiburg\\
\input{include/Hepspec.tex}
\input{include/Benchmarks.tex}



Expand Down Expand Up @@ -396,6 +397,44 @@ \section{Conclusions and Outlook}
\bibitem{Hepspec} HEPiX Benchmarking Working Group:
\url{https://twiki.cern.ch/twiki/bin/view/FIOgroup/TsiBenchHEPSPEC}, accessed 2018-01-29


\bibitem{Alef:2017jyx}
M.~Alef {\it et al.},
``Benchmarking cloud resources for HEP'',
J.\ Phys.\ Conf.\ Ser.\ {\bf 898} (2017) no.9, 092056.
doi:10.1088/1742-6596/898/9/092056
%%CITATION = doi:10.1088/1742-6596/898/9/092056;%%

\bibitem{DB12}
Graciani, Ricardo and Andrew McNab, Dirac benchmark 2012,
\url{https://gitlab.cern.ch/mcnab/dirac-benchmark/tree/master}

\bibitem{DeSalvo:2010zza}
A.~De Salvo and F.~Brasolin,
``Benchmarking the ATLAS software through the kit validation engine'',
J.\ Phys.\ Conf.\ Ser.\ {\bf 219} (2010) 042037.
doi:10.1088/1742-6596/219/4/042037
%%CITATION = doi:10.1088/1742-6596/219/4/042037;%%
%4 citations counted in INSPIRE as of 11 Sep 2018

\bibitem{Agostinelli:2002hh}
S.~Agostinelli {\it et al.} [GEANT4 Collaboration],
``GEANT4: A Simulation toolkit'',
Nucl.\ Instrum.\ Meth.\ A {\bf 506} (2003) 250.
doi:10.1016/S0168-9002(03)01368-8
%%CITATION = doi:10.1016/S0168-9002(03)01368-8;%%
%9602 citations counted in INSPIRE as of 11 Sep 2018

\bibitem{SL6}
Fermilab and CERN,
``Scientific Linux 6'',
\url{http://www.scientificlinux.org/}

\bibitem{CentOS7}
The CentOS Project,
``CentOS Linux 7'',
\url{https://www.centos.org/}

\bibitem{PowhegBox}
P. Nason, JHEP 0411 (2004) 040, hep-ph/0409146;
S. Frixione, P. Nason and C. Oleari, JHEP 0711 (2007) 070, arXiv:0709.2092;
Expand Down

0 comments on commit 3f4e890

Please sign in to comment.