LHC Programme Coordination Home Page

Rules and definitions for the "Massi" summary files

Drafted by
Colin.Barschel@cern.ch
Massimiliano.Ferro-Luzzi@cern.ch

updated by
Christoph.Schwick@cern.ch
Jamie.Boyd@cern.ch
Federico.Alessio@cern.ch

Modalities:

Summary files: the experiments provide data for each physics fills which had at least some time spent in physics, i.e. in the beam mode 'Stable Beams' (or 'Unstable Beams', if ever used). The data are requested only for the time spent in physics, but data during non-physics are also welcome. A flag in each file can identify if the data are collected in 'Stable Beams' mode or not.

The data will be put on experiment-owned directories (before AFS, after EOS) as quickly as possible at the end of a fill (e.g. within 24h, but ideally a few minutes after the change of a fill number).

The data files can be updated, if require: e.g. if better calibrations constants become available, if a more precise analysis technique leads to better results or to cope with occasional problems with DAQ, fill numbers, etc..

Data directories

ALICE:   alicedir = /eos/project/a/alice-lhcif/massiFiles/<year>/<lumi|lumiregion|beamgas>
ATLAS: atlasdir = /eos/project/a/atlas-datasummary/public/lumifiles/<year>/<lumi|lumiregion|beamgas>
CMS:     cmsdir   = /eos/project/c/cms-runcoordination/LHCFILES/<year>/<lumi|lumiregion|beamgas>
LHCb:    lhcbdir   = /eos/project/l/lhcbwebsites/www/online/TFC/LPCfiles/<year>/<lumi|lumiregion|beamgas>

where

<year> is the 4 digit year in which the data is produced, followed by one of the subdirectories <lumi|lumiregion|beamgas> which contain the data for various "topics" organized with the specfications of the 'Massi Files' (see below).

Data in each of these subdirectories are provided in one compressed file per fill. The name of the file is the fill number. The extension can be either tgz or zip, and it should be agreed beforehand to apply the correct decompressing algorithm at the level of the unpacking script. When the archive file is expanded, one directory carrying the fill-number as a name is created and all data files are contained in this directory.

Example: ATLAS lumi data for fill 1234 in year 2006 will be provided in the directory /eos/project/a/atlas-datasummary/public/lumifiles/2006/lumi/1234.tgz
Expanding the tgz file will result in one subdirectory called 1234/ under .../2006 in which all data files are contained.

The LPC tools copies these data-files into an EOS area which is readable for everybody at CERN and from where users are supposed to get these files. This area is under the following path:

/eos/project/l/lpc/public/MassiFiles/<year>/measurements/<experiment>/<lumi|lumiregion|beamgas>

Filenames:

This section defines the names of the files which are contained in the archives described above. Tha changing parameters to compose a filename are the following:

<fillnr> denotes the fill-number (4 or 5 digits)

<expt> denotes the experiments name: ALICE, ATLAS, CMS or LHCb

<rfbktbeam1> and <rfbktbeam2> denote the RF bucket of a specific bunch in beam 1 or 2. The valid range it 1 ... 35631 (i.e. bucket 1 denotes the first bucket after the abort gap where protons might be contained. Ideally bucket 1 is also bunch id 1.)

Luminosity data:

Files listed in this section are contained in the archive under the .../<year>/lumi/ subdirectory.

<fillnr>_lumi_<expt>.txt
- Contains multiple entries of luminosity integrated over a short time interval defined by the experiment.
<fillnr>_lumi_<rfbktbeam1>_<expt>.txt
- Contains multiple entries of luminosity per bunch integrated over a short time interval defined by the experiment.
<fillnr>_summary_<expt>.txt
- Contains the integrated luminosity during stable beams for the given fill. Usually one entry, but can be more if an ADJUST mode happens in between two STABLE BEAMS mode in the same fill.

Luminosity-region data:

Files listed in this section are contained in the archive under the .../<year>/lumiregion/ subdirectory.

<fillnr>_lumireg_<expt>.txt
- Contains mutiple entries with data about the luminous region in the experiment averaged over all colliding bunches. Data is averaged over short time intervals defined by the experiment.
<fillnr>_lumireg_<rfbktbeam1>_<expt>.txt
- Contains multiple entries with data about the luminous region in the experiment for a specific colliding bunch. Data is averaged over a time interval defined by the experiment. This file is optional.

Beam gas data for beam profile measurements:

Files listed in this section are contained in the archive under the .../<year>/beamgas/ subdirectory. These files have only been provided by LHCb, however other experiments may re-evaluate if they can also provide this data.

<fillnr>_beam1_<expt>.txt and <fillnr>_beam2_<expt>.txt
- Contains multiple entries of beam profile data extracted from beam gas interactions averaged over all non-colliding bunches for each beam. Data is averaged over time intervals defined by the experiment.
<fillnr>_beam1_<rfbktbeam1>_<expt>.txt and <fillnr>_beam2_<rfbktbeam2>_<expt>.txt
- Contains multiple entries of beam profile data extracted from beam gas interactions for a specific non-colliding bunch of each beam. Data is averaged over time intervals defined by the experiment. This file is optional.

Meta data:

Data in the Massi files should reflect the best possible data for a given fill in any moment. This implies that experiments will updated the data when more precise offline analysis results become available, or if the calibration for an ongoing (or past) year had been updated. In order to let users know about these updates so that they can judge the quality of the data for their analysis in each compressed archive, files with meta information is provided:

version.txt
- This files contains a single version number identifying unambiguously the data set in the archive.
version_annotation.txt
- This file contains a textual description of all versions existing for the archive (also those which have become obsolete). With this annotation a user can track the history and processing steps for the data in an archive.

File contents:

Definitions and rules for the following sections:

Coordinates: Please use the following coordinate frame:

Z>0 : lhc-clock-wise (seen from above)
Y>0 : toward sky (approximately vertical)
X>0 : toward outside of lhc ring
(X,Y,Z)=(0,0,0) : at your nominal IP. X and Z should lie in the LHC plane (-X should point to the center of LHC).
All lengths in mm.
All angles in rad.
Time granularity: to be chosen by data provider such that the data have a reasonable scatter (approximaly 5% point-to-point errors).
Data must be time-ordered.
Lumi is the instantaneous lumi delivered (no dead time, etc.).
Definition of specific lumi:
- L_spec = < L_spec_i > = < L_bi/ (N_1i * N_2i) >
  L_bi = lumi for colliding pair i
  N_ji = elementary charges in bunch i in beam j
  <*> = average over all colliding pairs at your IP, i=1,...n_c e.g. <Q> = (1/n_c) * sum_over_i{Q_i}
For ions: still use elementary charges (not Pb particles)

Meta data files:

The file version.txt

This file contains a version number for the data in the relevant archive. Version numbers have the following format:

a[.b[.c[.d...]]][._u[.v...]]

where the letters denote integer numbers and everything in [ ] is optional. Numbers without a leading underscore (in the following named "plain" version numbers) have fixed meanings over the year with an experiment. For example an experiment could use for 'a' either 1 or 2 for "raw online values" or "values from offline processing". 'b' could be used to denote the calibration which might change some times over the year e.g. 1 for initial calibration, 2 for the calibration available 2 weeks after the first VdM scan and 3 for a new calibration available in September.

Version numbers with a leading '_' denote special features for a fill or a series of fills. For example if a lumi detector has a technical problem for some fills in a year this could be marked with a version _1. For a given fill an underscore integer may be defined only once, i.e. a version number like 1.2._1._2 is ok whereas 1.2._1._1 is illegal. The reason is that underscore numbers may be combined in the version number in arbitrary sequence.

The file version_annotation.txt

This file contains a textual description of the meaning of the version numbers. The file contains multiple lines formatted in one of the following ways:

n : description of the meaning of the nth integer in the version number
n u : description of the meaning for the nth integer having the value u
_k : meaning of the _k version number

An example could look like this:

1 : denotes the algorithm used to derive the luminosity data
2 : denotes the calibration used to obtain the luminosity data
1 1 : raw online luminometer data; available some hours after the fill
1 2 : offline processing of the silicon tracker data; highest precision; available after approx. 2 weeks
_1 : the main luminometer was down for 50% of the time. Online data is derived from calorimeters as a backup.

Additional Rules:

Annotations for version numbers with a leading '_' should only be given in the files of fills where they are used. On the contrary for the plain version numbers all annotations used up to that point in the year should be listed. This means for the plain version numbers it should be sufficient to look into the annotation file of the last fill of the year to get a full description of all plain numbers used.

If during the year new values for a plain version numberrs become necessary it is allowed to add these values from the first fill onwards which uses the new value. It is only allowed to add information to the annotation file, not to change or to delete information from it (if the latter becomes necessary for some reason, the LPC should be informed since all Massi files need to be re-scanned)

If during the ongoing year it turns out that an experiments needs an additional plain version number it is allowed to add this number from the fill onwards where it is needed without updating all the previous fills. It would be good to add in the description of the plain version number a comment which explains what the situation before the introduction of the version number was.

An example could look like this:

In June experiment X re-aligns all lumi detectors in a technical stop and wants to communicate this fact to users of the Massi Files. The first physics fill with the new alignment is 8765. They introduce a new plain version number (e.g. the third since they use already 2 numbers for other things). Fills with the newly aligned lumi detectors will get the version number 1 for this new number. In the file version_annotation.txt two new lines are added. The first one describing the new version number and the second line describing the meaning of this version number to be 1, i.e.

3 : This number describes changes in the alignment of the lumi detectors. Fills with this numbers missing (before Fill 8765) have the initial alignment of the year.
3 1 : A new alignment of the lumi detector was introduced during TS2. The first fill with this alignment is fill 8765.

An overview table for all versions on a web page accessible from the LPC homepage (Massi File versioning information) is automatically generated and updated when new Massi Files or updated Massi files are coming in.

Further comments on this design can be read here .

The additional workload to maintain these version files should be kept small and the procedure is simple: the annotation for the plain version numbers only requires to add lines to the file when a new version number is used (or a new version number is introduced). The system allows for a very simple versioning but also allows for more detailed information to be communicated to the users of the files.

The system allows users to write simple filters with scripts in order to process only data of their choice. This should be particularly simple when filtering on plain versions. The cases annotated with the version numbers preceded by an underscore need to be handled case by case.

Each lumi file contains:

time stab l dl sl dsl

where

time = UNIX time, i.e. in seconds since UTC Jan 1, 1970, 00:00:00, not counting leap seconds.
stab = stable beam flag: a float value between 0 and 1 which corresponds to the fraction of time spent in stable beams for this time bin.
l = luminosity in Hz/ub
dl = point-to-point error on luminosity in Hz/ub
sl = specific luminosity in Hz/ub (see above)
dsl = point-to-point error on specific luminosity in Hz/ub

Each lumireg file contains:

time stab x dx y dy z dz xsu dxsu ysu dysu zsu dzsu ax dax ay day

where

time   = UNIX time, i.e. in seconds since UTC Jan 1, 1970, 00:00:00, not counting leap seconds.
stab    = stable beam flag: a float value between 0 and 1 which corresponds to the fraction of time spent in stable beams for this time bin.
x, y, z = x-centroid, y-centroid, z-centroid in mm
dx, dy, dz   = "one sigma" errors on x-centroid, y-centroid, z-centroid in mm
xsu, ysu, zsu = x-sigma, y-sigma, z-sigma, after unfolding the vtx detector resolution in mm
dxsu, dysu, dzsu = "one sigma" errors on xsu, ysu, zsu in mm
ax, ay   = angle in x and y when projecting the lumi region on the xz and yz planes in rad
dax, day   = "one sigma" errors on ax, ay in rad

The coordinate system is that of the LHC beams. Local system must be converted into the LHC system.

Beam gas based profile measurement files for beam 1 and beam 2 contain:

time stab x dx y dy ax dax ay day xsu dxsu ysu dysu

where

time = UNIX time, i.e. in seconds since UTC Jan 1, 1970, 00:00:00, not counting leap seconds.
stab   = stable beam flag: a float value between 0 and 1 which corresponds to the fraction of time spent in stable beams for this time bin.
x, y   = x-centroid, y-centroid at z=0 in mm
dx, dy   = "one sigma" errors on x-centroid, y-centroid in mm
ax, ay   = angle in x and y when projecting the beam axis on the xz and yz planes in rad
dax, day   = "one sigma" errors on ax, ay in rad
xsu, ysu   = x-sigma, y-sigma, after unfolding the vtx detector resolution in mm
dxsu, dysu = "one sigma" errors on xsu, ysu in mm

The summary file contains:

a number of lines, one per "segment" of physics periods (normally, there is one segment per fill, but we have to cope with the possibility of Stable Beams interruptions during the fill). Each line contains:

timestart, timestop, plu, lui

where

timestart = start of stable beams of this segment in seconds UNIX time.
timestop = stop of stable beams of this segment in seconds UNIX time.
plu = peak lumi in Hz/ub in this segment
lui = integrated lumi in 1/ub in this segment

Comments, headers, footers, etc can be added to all files. The number sign '#' is to be used to signify this. Any extra information is to be put behind a '#' and finish with an end of line. This could be used for example to put the date/time/version of production of your file, your internal run numbers, or anything else needed.