Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improved documentation about time stamped data #719

Merged
merged 11 commits into from
Jan 23, 2020
3 changes: 3 additions & 0 deletions manual/source/design.rst
Original file line number Diff line number Diff line change
Expand Up @@ -121,6 +121,9 @@ an array of several dimensions.
``data`` (*NX_NUMBER*)
Data values from the detector, ``units="NX_ANY"``

In the case of streaming data acquisition, when time stamped values of data are collected, fields can be replaced with :ref:`NXlog` structures of
the same name. For example, if time stamped data for wavelength is being streamed, wavelength would not be an array but a :ref:`NXlog` structure.


.. index::
! single: field attribute
Expand Down
13 changes: 8 additions & 5 deletions manual/source/introduction.rst
Original file line number Diff line number Diff line change
Expand Up @@ -14,18 +14,21 @@ to define a common data exchange format for neutron, X-ray, and muon experiments
NeXus is built on top of the scientific data format HDF5 and adds
domain-specific rules for organizing data within HDF5 files in addition
to a dictionary of well-defined domain-specific field names. The NeXus
data format has two purposes:
data format has three purposes:

#. *raw data*:
NeXus defines a format that can
serve as a container for all relevant data associated with a scientific
instrument or beamline.
This is a very important use case.
instrument or beamline. This is a very important use case. This includes
the case of streaming data acquisition, where time stamped data are logged.
#. *processed data*:
NeXus also defines standards for processed data. This is data which has underwent some form of data
reduction or data analysis. NeXus allows to store the results of such processing together with
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

allows to use can

documentation about how the processed data was generated.
#. *standards*:
NeXus defines standards in
the form of *application definitions* for the exchange of data
between applications. NeXus provides structures for raw experimental
data as well as for processed data.
between applications. NeXus provides standards for both raw and processed data.

A community of scientists and computer programmers working in neutron
and synchrotron facilities around the world came to the conclusion that a
Expand Down
16 changes: 16 additions & 0 deletions manual/source/rules.rst
Original file line number Diff line number Diff line change
Expand Up @@ -368,6 +368,22 @@ dimensions ``xraster, yraster, xsize, ysize``.
.. warning:: Be warned: if you use the 2D rasterisation method with ``xraster, yraster`` you may end up with
invalid data if the scan is aborted prematurely. This cannot happen if the first method is used.


Streaming Data Acquisition And Logging
======================================

More and more data is collected in streaming mode. This means that time stamped data is logged for one or more inputs,
possibly together with detector data. Another use case is the logging of parameters, for example temperature, while a long
running data collection is in progress. NeXus covers this case too. There is one simple rule for structuring such files:

Just use the standard NeXus raw data file structure for such data. But replace the streamed or logged data element
benajamin marked this conversation as resolved.
Show resolved Hide resolved
in its place in the hierarchy through a :ref:`NXlog` or a :ref:`NXevent_data` structure of the same name.
benajamin marked this conversation as resolved.
Show resolved Hide resolved

For example, consider your instrument is streaming detector images against a magnetic_field on the sample. In this case both
NXsample/magnetic_field and NXdetector/data would become NXlog structures instead of simple arrays. The NXlog structure will
have the same name as the NeXus field name.


NXcollection
============

Expand Down
21 changes: 16 additions & 5 deletions manual/source/strategies.rst
Original file line number Diff line number Diff line change
Expand Up @@ -149,10 +149,17 @@ which also allows "energy" to be specified if one is so inclined.
Strategies: Time-stamped data
#############################
*How should I store time-stamped data?*
Time-stamped data can stored in both :ref:`NXlog` and :ref:`NXevent_data`.
:ref:`NXevent_data` is used for storing neutron event data and :ref:`NXlog` would be used
for storing any other time-stamped data, e.g. sample temperature, chopper
top-dead-centre, motor position etc.

Time-stamped data can be stored in both :ref:`NXlog` and :ref:`NXevent_data` structures.
benajamin marked this conversation as resolved.
Show resolved Hide resolved
Of the two, :ref:`NXlog` is the most important one, :ref:`NXevent_data` is used only for storing neutron event data
and :ref:`NXlog` would be used for storing any other time-stamped data, e.g. sample temperature, chopper top-dead-centre,
motor position, detector data etc.

Regarding the NeXus file structure to use, there is one simple rule: just use the standard NeXus file structure but replace
benajamin marked this conversation as resolved.
Show resolved Hide resolved
the fields for streamed data elements through :ref:`NXlog` or :ref:`NXevent_data` structures. For example, consider the
collection of detector images against a change in the magnetic field on the sample. Then, both NXsample/magnetic_field and
NXdetector/data would be :ref:`NXlog` structures containing the time stamped data.


Both :ref:`NXlog` and :ref:`NXevent_data` have additional support for storing time-stamped
data in the form of cues; cues can be used to place markers in the data that allow one to
Expand All @@ -174,7 +181,11 @@ to grab the data between 20 seconds and 40 seconds, and then trim that data to g
want.
Obviously in this simple example this does not gain us a lot, but it is easy to see that in a
large dataset having appropriately placed cues can save significant computational time when looking
up values in a certain time-stamp range.
up values in a certain time-stamp range. NeXus has actually borrowed the cueing table concept
from video file formats where it allows viewing software to quickly access your most favourite scene.
benajamin marked this conversation as resolved.
Show resolved Hide resolved
Correspondingly, cueing in NeXus allows you to quickly access your most favourate morsel of time stamped
benajamin marked this conversation as resolved.
Show resolved Hide resolved
data.


In the NeXus Features repository, the feature `ECB064453EDB096D <https://github.com/nexusformat/features/tree/b0f4862f267844a3f66efa701953e684978b0959/src/recipes/ECB064453EDB096D>`_
shows example code that uses cues to select time-stamped data.
Expand Down