-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathintro.tex
100 lines (77 loc) · 4.33 KB
/
intro.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
% !TeX root = sr.tex
\section{Introduction}
\subsection{Measuring Storage Resource Consumption}
% storeage resource intro
Measuring consumption of storage resources is distinctively different from
the measurement of the consumption of computing resources in batch jobs.
On a computing resource it is quite easy to continuously collect CPU
usage information with a high time resolution - the operating system itself does
it and an accounting system can access these data. Additionally,
the resource usage data are usually strictly increasing, e.g.
CPU time or wall clock time.
A collection of storage resource consumption is much harder
to achieve. The usage of each participant will vary over time and only few
systems do permanently record the storage use of the single users or
groups with a fine granularity in time.
Usage snapshot are taken from time to time only
resulting in a rough consumption estimation especially in environments
with a high fluctuation rate.
Nevertheless, there is a need to do storage accounting both for the provider
of storage resources and the users. It is the base for billing and for
the development of a storage infrastructure.
% While jobs typically have exclusive access
% to a set of resources (CPU cores) for a given amount of time, a storage
% resource is almost always shared
% among several parties, with each of these participants using a part of the
% resource. Furthermore, the usage of each participant will vary over time, and as
% data tend to be long-lived there is no fixed start and end time of the resource
% consumption.
% integrals and spikes
%To exactly measure the amount of consumed resource\ldots
\subsection{About this document}
In the following the StAR format
(\textbf{St}orage \textbf{A}ccounting \textbf{R}ecord) is described
that allows to report the usage of
storage resources. StAR tries to define a standardized way to exchange storage
consumption data. Though it was developed in the context of distributed storage
in a Grid environment it is not specific to such a storage infrastructure.
Due to StAR's generic approach it can be used in any storage environment.
This document does not address in detail how the StAR records should be used,
how the accounting data are aggregated
nor does it attempt to dictate the format in which the accounting records are
stored at a local site.
Furthermore, nothing is said regarding the communication mechanisms
employed to exchange the records, i.e. transport layer, framing,
authentication, integrity, etc.
\subsection{Record Structure \& Content}
The structure of the format described in this document can be split into
logical parts, each describing an aspect of the resource consumption.
The parts are
\begin{description}
\item[Resource:] Fields describing the system the resource was consumed on. They
can specify a certain subsystem of the storage system.
\item[Consumption Details:] Fields describing what the data is consuming, e.g.
storage classes, directory path, etc.
\item[Identity:] Describes the person or group accountable for the resource
consumption.
\item[Resource Consumption:] Fields describing how much of the described
resource has been used.
\end{description}
Please note, that these logical sections are not necessarily directly reflected
in record format. They are merely a good mental model to have in mind.
Most of the information in the record is common to all files, e.g. resource or
identity description. Certain fields are aggregates over the consumed
resources. This includes the consumption itself, but can also be consumption
details, such as number of files.
The record is not intended to be used for describing intricate information
about the consumption. E.g., file names, per-file data or application
metadata should not be included in the record. Such details are out of scope
for the record and are not important for accounting of resource consumption.
To see examples of records, see section~\ref{sec:record-examples}.
% this might be a seperate section sometime, for now it is here
\subsection{Including Additional Information}
It is allowed to add additional fields with information in the record, e.g.
for a more accurate description of the data that has consumed the resources.
If any user or group information is added it must be added under the
SubjectIdentity block. This makes it possible to automatically remove user
and group information.