swh:1:snp:af87cd67498ef4fe47c76ed3e7caffe5b61facaf
Raw File
Tip revision: f7d9d495e18c7dcdaf910f882cdedf62d7d17594 authored by Pere Mato on 06 July 2016, 16:36:49 UTC
Update ROOT version files to v6.06/06.
Tip revision: f7d9d49
pq2-ana-dist.1
.\"
.\" $Id:$
.\"
.TH PQ2-ANA-DIST 1 "Version 5" "ROOT"
.\" NAME should be all caps, SECTION should be 1-8, maybe w/ subsection
.\" other parms are allowed: see man(7), man(1)
.SH NAME
pq2-ana-dist \- Analyse the file distribution of a dataset (or a set of datasets) from a dataset meta-repository based on ROOT files
.SH SYNOPSIS
.B pq2-ana-dist
[options] datasets
.SH "DESCRIPTION"
This manual page documents briefly the
.BR pq2-ana-dist
program.
.PP
.B pq2-ana-dist
is a script invoking the
.B pq2
ROOT application to analyse the file distribution of a dataset (or a set of datasets) over the file
servers, eiher in terms of files or of file sizes. The output is a text file with the the file movements
needed to make the file distribution even in the chosen metrics to be used, for example, as in input
to \fIpq2-redistribute\fR(1). Optionally the internal objects can be saved so that they can be used as starting
point for a subsequent run. An histogram and a plot can also be saved to visualize the file distribution.
The repository with the dataset information can be accessed via the local file
system or a remote file server daemon or a PROOF facility.
.PP
More details about the underlying 'pq2' application can be found in the man page \fIpq2\fR(1).
.SH ARGUMENTS
.TP
\fIdatasets\fR
Comma-separated list of datasets to be analysed; the '*' wild card in the items (in such a case the
full string - as shown by pq2-ls - should be given in quotes, e.g. '/default/ganis/h1-set5*'.
.SH OPTIONS
.TP
\fB-h\fR, \fB--help\fR
Display help information.
.TP
\fB-k\fR, \fB--keep\fR
Keep the temporary files created during the analysis under $TMPDIR
.TP
\fB-v\fR
Verbose mode
.TP
\fB--dataset\fR=<\fIdatasets\fR>
Alternative way to define the datasets to be analysed.
.TP
\fB-s\fR <\fIservers\fR>, \fB--servers\fR=<\fIservers\fR>
Comma-separated list of servers to be used (-s) in the analysis; a '+' in front of the list adds the
specified servers to the existing ones: this can be useful when determining file movements to empty or
new servers
.TP
\fB-e\fR <\fIexcsrvs\fR>, \fB--exclude\fR=<\fIexcsrvs\fR>
Comma-separated list of servers to be excluded from the target servers; this can be used, for example,
to determine the files movements to drain a server.
.TP
\fB-i\fR <\fIignsrvs\fR>, \fB--ignore\fR=<\fIignsrvs\fR>
Comma-separated list of servers to be ignored in the analysis; this can be used, for example, to skip
the redirector.
.TP
\fB-m\fR <\fImetrics\fR>, \fB--metrics\fR=<\fImetrics\fR>
Metrics to be used to calculate the degree of evenness:
.nf
    F          use the number of files (default)
    S          use the file size
.fi
.TP
\fB-f\fR <\fIfilemv\fR>, \fB--filemv\fR=<\fIfilemv\fR>
Defines the file where to save the result of the analysis; by the default the result is send to the screen.
The output contains one line per each file that needs to be moved with the format 'file source destination'
where 'file' is the file name, 'source' is the source server URL and 'destination' is the destination server URL.
The file can be used as input, for example, to \fIpq2-redistribute\fR(1).
.TP
\fB--plot\fR[=<\fIfileplot.fmt\fR>]
Defines the file with the output plot with the original distribution with the server names and the +-10% limits; the extension (if known) defines the format; the default format is 'png' and the default name 'plot.png'.  The plot can  also  be  obtained  directly from a binary output file (saved with '--fout=<outfile>.root') but just specifying '--fin=<outfile>.root --plot'. The available formats are those known by ROOT: png (default), eps, ps, pdf, svg, gif, xpm, jpg, tiff.
.TP
\fB--fout\fR[=<\fIoutfile\fR>]
Defines the file where to save the output of the analysis in binary form (ROOT file); this output can be
used as starting point for a next run, allowing to run over many datasets in separate steps.
.TP
\fB--fin\fR[=<\fIinfile\fR>]
Defines the ROOT file from where to fetch the output of a previous run (saved with --fout=infile).
.TP
\fB-u\fR <\fIserverurl\fR>, \fB--url\fR=<\fIserverurl\fR>
URL of the PROOF master or data server providing the information; for data servers, it must include the directory.
Can also be specified via the environment variables PQ2PROOFURL or PQ2DSSRVURL (see ENVIRONMENT VARIABLES)."
.TP
\fB-t\fR <\fIdir\fR>, \fB--tmpdir\fR=<\fIdir\fR>
Directory for temporary files; defualt is /tmp/<username>.
.SH FORMAT OF THE FILE WITH MOVE DIRECTIVES
The out file constist of one line per file to be moved with the following format:
.nf

file-name       source-server-URL       destination-server-URL
.fi
.SH "ENVIRONMENT VARIABLES"
See \fIsetup-pq2\fR(1).
.SH "SEE ALSO"
\fIpq2\fR(1), \fIsetup-pq2\fR(1), \fIpq2-ls\fR(1), \fIpq2-ls-files\fR(1),
\fIpq2-ls-files-server\fR(1), \fIpq2-info-server\fR(1),
\fIpq2-redistribute\fR(1), \fIpq2-verify\fR(1), \fIpq2-rm\fR(1), \fIpq2-cache\fR(1)
.PP
For more information on the \fBROOT\fR system, please refer to
.UR http://root.cern.ch/
.I http://root.cern.ch
.UE
.SH "ORIGINAL AUTHORS"
Gerardo Ganis for the ROOT team.
.SH "COPYRIGHT"
This library is free software; you can redistribute it and/or modify
it under the terms of the GNU Lesser General Public License as
published by the Free Software Foundation; either version 2.1 of the
License, or (at your option) any later version.
.P
This library is distributed in the hope that it will be useful, but
WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
Lesser General Public License for more details.
.P
You should have received a copy of the GNU Lesser General Public
License along with this library; if not, write to the Free Software
Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
.SH AUTHOR
This manual page was originally written by Gerardo Ganis <gerardo.ganis@cern.ch>, for ROOT version 5.
back to top