lunar (1) collectl.1.gz

Provided by: collectl_4.3.1-1_all bug

NAME

       collectl - Collects data that describes the current system status.

SYNOPSIS

       Record Mode - read data from live system and write to file or display on terminal

       collectl [-f file] [options]

       Playback Mode - read data from one or more raw data files and display on terminal

       collectl -p file1 [file2 ...] [options]

OPTIONS

       Record Mode

       In  this  mode  data  is  taken from a live system and either displayed on the terminal or
       written to one or more files or a socket.

       --align
              If the HiRes modules is present, collectl sample monitoring will  be  aligned  such
              that  a  sample will always be taken at the top of a minute (this does NOT mean the
              first sample will occur then) so that all instances  of  collectl  running  on  any
              systems  which  have  their  clocks  synchronized will all take samples at the same
              time.  Furthermore, if one is doing process monitoring, those samples will also  be
              taken  at the top of the minute and so can delay the start of sampling up to 2 full
              process monitoring intervals.

       --all
              Collect summary data  for  ALL  subsystems  except  slabs,  since  slab  monitoring
              requires a different monitoring interval.  This also means you won't get any detail
              data which also includes processes and environmementals.  You can use  this  switch
              anywhere  -s  can  be  used  but  not both together.  If the system supports lustre
              and/or interconnect monitoring those statistics will be provided but  the  warnings
              produced  when  they  are  not available you try to select them with -s will not be
              displayed.

       --ALL
              This is actually a superset of --all by adding detail statistics as well  with  the
              exception  of  TCP  details  when  displaying  to  a  terminal since those are only
              available with -P or -f.

       -A, --address address[:port[:timeout]] | server[:port]
              In the first form, one specifies an address, optional port and timeout  (the  first
              colon  is  required to specify timeout for default port).  All data is then written
              to that socket prefaced with the current host name at the named  address  and  port
              until the socket is closed, at which time collectl will exit.

              In  the  second form one enters the text "server" and optional port.  In this form,
              collectl runs as a server, waiting for a connection  and  once  established  writes
              data  on  that  socket.   The  key difference here is if the client exists collectl
              keeps running and will again look for a new  connection,  allowing  it  to  survive
              client restarts or crashes.

              The default port is set at 2655 but can be changed - see collectl.conf.

              In  both  forms,  one  can  additionally request local data logging by specifying a
              combination of -P and -f.  See man collectl-logging for more details.

       --comment string
              Add the specified string to the end of the  headers  in  the  data  files.  If  any
              embedded  spaces  be  sure  to  quote  it.   This  can  be  very  useful when doing
              characterizations or benchmarking and you're frequently changing system/application
              parameters and restarting collectl between tests.

       -C, --config filename
              Name/location  of  the  collectl  configuration  file.   If not specified, collectl
              searches for collectl.conf first in /etc (the default), then in the same  directory
              the collectl executable is in, and finally the current working directory.

       -c, --count Samples
              The  number  of samples to record. This is one way of 3 ways of describing how long
              collectl should run (see -r and -R ).  Note that  these  3  switches  are  mutually
              exclusive.

       -D, --daemon
              Run  collectl  as  a daemon, primarily used when starting as a service.  One caveat
              about this mode is you can only run one copy.

       --export file[,options]
              This requests that collectl does not print anything on the terminal (or send it  to
              a  socket)  using  the  standard brief/verbose/plot formats.  Instead it executes a
              perl "require" on the named file, using an extension of ph if  not  specified.   It
              first  looks in the current directory and if not there the directory the executable
              is in.  It then calls the function "file"Init(options)  towards  the  beginning  of
              collectl  and  again as simply  "file"(@options) to generate the exported formatted
              output.  See the online documentation on Exporting Custom Output  and  Logging  for
              more details.

       -f, --filename Filename
              This  is  the name of a file to write the output to.  For details on how the output
              files  are  named,  see  the  File  Naming  section   of   the   documentation   on
              collectl.sourceforge.net OR /usr/share/doc/collectl/FileNaming.html

       -F, --flush seconds
              Flush  output  buffers after this number of seconds.  This is equivalent to issuing
              kill -s USR1 at the same frequency (but a lot easier!).  If 0, a flush  will  occur
              every data collection interval.

       --grep pattern
              The  main  purpose  of  this switch is for those users who have discovered there is
              some data in the raw files that never appears in any  display  and  have  taken  to
              displaying  it  themselves  with  grep.  Unfortunately this method does not include
              timestamps and so makes it difficult to interpret the results.  Even if you include
              the timestamp from the file it is in UTC and so needs to be translated to be of any
              real value.  This switch does just that and then some.

              Specifically, it allows you to  playback  a  file  and  instead  of  processing  it
              normally it simply searches for any entries that match the perl pattern and reports
              those lines prefaced with time stamps.  You can optionally change the  time  format
              with the usual -o options and can even select the timeframe with --from and --thru.

       --home
              Always  start  the  display  for the current interval at the top of the screen also
              known as the home position (non-plot format only).   This  generates  a  real-time,
              continously refreshing display when the data fits on a single screen.

       --import file1[,options][:file2[,options]...]
              This  loads  the  named  files  and  executes  callbacks  to them, which is the API
              mechanism for importing additional metrics into collectl.  See the webpage  on  the
              API for further detail.

              Since these files also include instructions for how to report the output in all the
              various forms, you will also need to include --import  during  playback.   Finally,
              since  the  default  is  to  seamlessly  include imported data with everything else
              collectl reports, if you ONLY want to display imported  data  you  much  explicitly
              deselect  all  other  subsystems  either  by including -s- (note the trailing minus
              sign) followed by all the subsystems were recorded OR simply say -s-all.

       -i, --interval interval[:interval2[:interval3]]
              This is the sampling interval in seconds.  The default is 10 seconds when run as  a
              daemon  and  1 second otherwise.  The process subsystem and slabs (-sY and -sZ) are
              sampled at the lower rate of interval2.  Environmentals (-sE), which only apply  to
              a  subset  of hardware, are sampled at interval3.  Both interval2 and interval3, if
              specified,  must  be  an  even  multiple  of  interval1.   The  daemon  default  is
              -i10:60:300  and  all  other  modes  are -i1:60:300.  To sample only processes once
              every 10 seconds use -i:10.

       --nohup
              Whenever collectl finishes a data collection interval, it  checks  to  see  if  the
              starting  parent  has  exited.   This is to prevent the case in which someone might
              start a copy of collectl and then the process dies and collectl keeps running.   If
              that  is  the  behavior  someone  actually intends, they should start collectl with
              --nohup.

              NOTE - when running as a daemon, --nohup is implied.

       --quiet
              Whenever collectl wants to tell the user something, it assigns  a  category  to  it
              such as Informational, Warning, Error or Fatal.  When run with -m, all messages are
              displayed for the user and if logging data to a file with -f,  these  messages  are
              also  sent  to  a  log  file  which  is in the data collection directory and has an
              extenion of "log".  However, if -m is not specified Informational messages (such as
              collectl  starting  or  stopping)  are not reported on the terminal but the other 3
              are.  Sometimes the warnings can be  annoying  and  one  can  suppress  these  with
              --quiet  though  they  will  still be written to the message log in -f.  You cannot
              suppress Error or Fatal errors.

       -r, --rolllogs time[[,days[:months]][,minutes]]
              When selected, collectl runs indefinately (or at least until the  system  reboots).
              The  maximum  number of raw and/or plot files that will be retained (older ones are
              automatically deleted) is controlled by the days field, the default is 7.  When  -m
              is also specified to direct collectl to write messages to a log file in the logging
              directory, the number of months to retain those logs is controlled  by  the  months
              field  and  its  default is 12.  The increment field which is also optional (but is
              position dependent) specifies the duration of  an  individual  collection  file  in
              minutes the default of which is 1440 or 1 day.

       --rawdskfilt
              This  switch  overrides  the  DiskFilter  setting  in  collectl.conf and explicitly
              defines a perl regx expression  against  which  records  from  /prod/diskstats  are
              selected  for  processing.  When there are a lot of disks to process, this can be a
              handy way to reduce the amount of data collected and actually  improve  performance
              since  there  are  less patterns to match each input record against.  Just remember
              that unlike --dskfilt which only filters during display, records filtered with this
              switch are never even recorded and so lost forever.

              You  can  optionally  specify  your  filter  with  a  leading plus-sign which tells
              collectl to just add your filter to the  default  specification.   Care  should  be
              taken  here  as  longer  filters  will slightly increase overhead and with a lot of
              disks and/or shorter monitoring intervals can add up.

              As a side benefit of this switch, if you really want to  look  at  partition  level
              stats you can do so by leaving off the trailing space in the default pattern.

              One must be also be careful in selecting the correct pattern since it's easy to get
              it wrong and you may  end  up  collecting  the  WRONG  data!   To  verify  you  are
              collecting  what  you  think you are, make a test run using -d4 to see the raw data
              being recorded in real-time.

       --rawdskignore
              This is the opposite of the rawdskfilt switch.  When specified any disks listed are
              completely  ignored  and will not appear in the raw file.  Typically this switch is
              useful when you're only interested in recording a subset of disk statistics.

       --rawnetfilt
              This works just like --rawdskfilt except  it  applies  to  networks.   Unlike  disk
              filtering  which has an explicit default pattern, the default for network filtering
              is to simply record all network data from /proc/net/dev.

              The -d4 switch also works here, as well as everywhere, to see the raw data as it is
              being collected.

       --rawnetignore
              This  is the opposite of the rawnetfilt switch and works just like the rawdskignore
              switch.  When specified any networks listed are ignored and will not appear in  the
              raw file.  Typically this switch is useful when you're only interested in recording
              a subset of network statistics.

       --rawtoo
              Only available in conjunction with -P, this switch causes the  creation/logging  of
              raw  data  in  addition  to plottable data.  While this may seem excessive, keep in
              mind that unlike plottable data,  raw  data  can  be  played  back  with  different
              switches  potentially  providing  more  details.   The  overhead  to write out this
              additional data is minimal, the only real cost being that of extra disk space.

       -R, --runas uid[:gid]
              This switch only works when running in daemon mode and so must be specified in  the
              DaemonCommands  line.   Its  presence will cause collectl to write the collectl.pid
              file into the same directory as its other output files as specified  by  -f,  since
              /var/run  does  not normally grant non-privileged users write access.  Furthermore,
              the ownership of that directory must match the specified ownership  since  collectl
              needs  to  write  ALL  it's files to that directory and can no longer assume global
              permissions when run as root.

              This WILL also  require  manually  modifying  /etc/init.d/collectl  to  change  the
              PIDFILE  variable  to  point  to  the  same  directory  which  the -f switch in the
              DaemonCommands line of collectl.conf points to.

              As  a  final  note  of  caution,  since  this  mechanism  changes  where   collectl
              reads/writes  its pid file, once you start using --runas, all calls to run collectl
              as a daemon must use it or it may be confused and exhibit unpredictable behavior.

       -R, --runtime duration
              Specify the duration of data collection where the duration is a number followed  by
              one  of  wdhms,  indicating  how  many  weeks,  days, hours, minutes or seconds the
              collection is to be taken for.

       --sep separator
              Specify the plot format separator - default is a space.  If this is a numeric field
              it  is  interpretted  as  the decimal value of the associated ASCII character code.
              Otherwise it is interpretted as the character itself.  In other  words,  "--sep  :"
              sets  the separator character to a colon and "--sep 9" sets it to a horizontal tab.
              "--sep 58" would also set it to a colon.

       --tworaw
              The switches -G  and  --group  have  been  replaced  by  --rawtoo,  which  is  more
              rescriptive  of  its  function.  When specified, it tells collectl to treat process
              and slab data as an entirely separate group of raw files, named with the  extention
              "rawp".   These separate files can be played back and processed just like any other
              collectl raw files and in fact one can even play back both at the same time if that
              is  what  is desired.  The only real purpose of this switch is that on some systems
              with many processes, it is possible to generate huge  raw  files  (some  have  been
              observerd  to  be  >250MB!) and while collectl will happily play back/process these
              files it can take a long time.  By using the --tworaw switch one still gets a  huge
              rawp  file,  but the normal raw file is a much more manageable size and as a result
              will faster to process then when all data is combined into the same file.

       Playback Mode

       In this mode, data is read from one or more data files that were generated in Record Mode

       --export Filename
              When playing back a file, use this switch to create an identical raw file differing
              only  in  the  timeframe being convered, so naturally one must also include --from,
              --thru or both.  Further, since the resultant file will contain the exact same  raw
              data  you  cannot select a subset using -s.  This switch is actually intended for a
              support function for situations where somone is having problems playing back a file
              and  a  subset  of  the  original  raw  file  that covers the problem time has been
              requested, hopefully allowing a significantly file to be posted or emailed.

       --extract filename
              If specified, rather than actually play back the file specified with  -p,  ALL  raw
              data  between  the  date  ranges is selected and a subset of that raw file created.
              The rules for how to interpret the filename are the same as used for -f.

       -f, --filename filename
              If specified, this is the name of a file  or  directory  to  write  the  output  to
              (rather  than the terminal).  See the description for details on the format of this
              field.  This requires the -P flag as well.

       --from time range
              Play back data starting with this time, which may  optionally  include  the  ending
              time  as  well, which is of the format of [date:]time[-[date:]time].  The leading 0
              of the hour is optional and if the seconds field is not specified is assumed to  be
              0.   If  no  dates  specified  the  time(s)  apply  to  each  file specified by -P.
              Otherwise the time(s) only apply to the first/last  dates  and  any  files  between
              those dates will have all their data reported.

       --full
              Full mode is actually a superset of --verbose and if selected will force --verbose.
              It will also force the RECORD separator to be printed for every  interval  even  if
              only  a  single  subsystem  was requested and to include the actual subsystems that
              follow following the utc timestamp as a parsing aid for those who may wish to parse
              the text output rather than the plot data.

       --offsettime seconds
              This  field  originally  was used before collectl reported the timezone in the file
              headers and allowed one to compensate.  Since then it is rarely  needed  except  in
              two  possible  cases,  one  in which data on two systems is to be compared and they
              weren't synchonized with ntp.  This allows all the times to be reported as  shifted
              by  some number of seconds.  The other case (and this is very rare) is when a clock
              had changed in the middle of a sample and will not be  converted  correctly.   When
              this  happens  one may have to play back the samples in pieces and manually set the
              time offset.

       --passwd filename
              When reporting usernames associated with a UID, use  this  file  for  the  mapping.
              This  is particularly important on systems running NIS where this are no user names
              in /etc/passwd.

       -p, --playback Filename
              Read data from the specified playback file(s), noting that one can use wildcards in
              the filename if quoted (if playing back multiple files to the terminal you probably
              want to include -m to see the filenames as they are processed).  The filename  must
              either  end in raw or raw.gz.  As an added feature, since people sometimes automate
              the running of this option and don't want to hard code a date, you can specify  the
              string  YESTERDAY  or TODAY and they will be replaced in the filename string by the
              appropriate date.

       --pname name
              By default, collectl uses the file /var/run/collectl.pid to indicate the pid of the
              running instance of collectl and prevent multiple copies from being run.  If you DO
              want to run a second copy, this switch will cause collectl to  change  its  process
              name to collectl-name and use that name as the associated pid file as well.

       --procanalyze
              When  specified  and  there is process data in the raw file, a summary file will be
              generated with one entry unique process containing such things  as  the  total  cpu
              consumed  for  both  user  and system, min/max utilization of various memory types,
              total page faults and several others.

       --slabanalyze
              When specified and there is slab data in the raw  file,  a  summary  file  will  be
              generated  with  one  entry unique slab containing data on physical memory usage by
              that slab.

       --thru time
              Time thru which to play back a raw file.  See --from for more

       Common Switches - both record and playback modes

       -d, --debug debug
              Control the level of debugging information, not typically used.   For  details  see
              the source code.

       -h, --help, -x, --helpext, -X, --helpall
              Display  standard,  extended  help  message  (which  doesn't  include  the optional
              displays such as  --showoptions,  --showsubsys,  --showsubopts,  --showtopopts)  or
              everything.

       --hr, --headerrepeat num
              Sets  the  number  of intervals to display data for before repeating the header.  A
              value -1 will prevent any headers from being displayed and a value of 0 will  cause
              only a single header to be displayed and never repeated.

       --iosize
              In brief mode, include iosize with disk, infiniband and network data.

       -l, --limits limit
              Override one or more default exception limits.  If more than one limit they must be
              separated by hyphens.  Current values are:

              SVC:value
                     Report partition activity with Service times >= 30 msec

              IOS:value
                     Report device activity with 10 or more reads or writes per second

              LusKBS:value
                     Report client or OSS activity greater than limit.  Only  applies  to  Client
                     Summary or OSS Detail reporting.  [default=100000]

              LusReints:value
                     Report  MDS  activity  with  Reint  greater than limit.  Only applies to MDS
                     Summary reporting.  [default=1000]

              AND
                     Both the IOS and SCV limits must be reached before  a  device  is  reported.
                     This is the default value and is only included for completeness.

              OR
                     Report device activity if either IOS or SVC thresholds are reached.

              -L, --lustsvcs [c|m|o][:seconds]
                     This  switch  limits  which  servics  lustre checks for and the frequency of
                     those checks.  For more information see the man page collectl-lustre.

       -m, --messages
              Write status to a monthly log file  in  the  same  directory  as  the  output  file
              (requires   -f   to  be  specified  as  well).   The  name  of  the  file  will  be
              collectl-yyyymm.log and will track various messages that may get  generated  during
              every run of collectl.

       -N, --nice
              Set priority to a nicer one of 10.

       -o, --options Options
              These  apply to the way output is displayed OR written to a plot file.  They do not
              effect the way data is selected for recording.  Most of these switches work in both
              record as well as playback mode.  If you're not sure, just try it.

              1
                     Data  in  plotting  format  should  use  1  decimal  point  of  precision as
                     appropriate.

              2
                     Data in plotting  format  should  use  2  decimal  points  of  precision  as
                     appropriate.

              a
                     Always  append  data  to  an  existing plot file.  By default if a plot file
                     exists, the playback file will be  skipped  as  a  way  of  assuring  it  is
                     associated  with  a  single  recorded  file.   This  switch  overrides  that
                     mechanism allowing muliple recorded files to be processed and written  to  a
                     single plot file.

              c
                     Always  open  newly named plot fies in create mode, overwriting any old ones
                     that may already exists.  If one processes multiple files for the  same  day
                     in  append  mode  multiple times, the same data will be appended to the same
                     file mulitple times.  This assures a new file is created at the start of the
                     processing.

              d
                     For  use  with  terminal  output  and  brief mode.  Preceed each line with a
                     date/time stamp, the date being in mm/dd format.  This option  can  also  be
                     applied  to  plot  formatit  which  will  cause  the date portion to also be
                     displayed in this format as opposed to D format.

              D
                     For use with terminal output and brief  mode.   Preceed  each  line  with  a
                     date/time stamp, the date being in yyyymmdd format.

              g
                     For  use with terminal output and brief mode.   When displaying values of 1G
                     or greater there is limited precision for  1  digit  values.   This  options
                     provides  a  way  to  display  additional  digits  for  more  granularity by
                     substituting a "g" for the decimal point rather than the trailing "G".

              G
                     For use with terminal output and brief mode.  This is  similar  to  "g"  but
                     preserves the trailing "G" by sacrificing a digit of granularity.

              m
                     Whenever times are reported in plot format, in the normal terminal reporting
                     format at the bginning of each  interval  or  when  when  one  of  the  time
                     reporting options (d, D, T or U is selected), append the milliseconds to the
                     time.

              n
                     Where appropriate, data such as disk KBs  or  transfers  are  normalized  to
                     units  per  second  by  taking  the  change in a counter and dividing by the
                     number of seconds in that  interval.   In  the  case  of  CPUs,  utilization
                     (calculated in jiffies) is normalized as a percentage of the interval.

                     Normalization can be disabled via this option, the result being the reported
                     values are not divided by  the  duration  of  the  interval.   This  can  be
                     particulary  useful  for reporting values that are < 1/2 the sampling, which
                     will be rounded to 0.

              T
                     For use with terminal output and brief mode, preceeds each line with a  time
                     stamp.

              u
                     Create  plot  files  with  unique  names  by  include the starting time of a
                     colletion in the name.  This forces multiple collections taken the same  day
                     to be written to multiple files.

              -U or --utc
                     In  plot  format only, report timestamps in Coordinated Universal time which
                     is more commonly know as UTC.

              x
                     Report only exception records for selected subsystems.  Exception  reporting
                     also  requires  --verbose.   Currently  this only applies to disk detail and
                     Lustre server information so one must select at least -s D, l or L for  this
                     to  apply.   If  writing to a detail file, this data will go into a separate
                     file with the extension X appended to the regular detail file name.

              X
                     Report both exceptions as well as all details for selected  subsystems,  for
                     -s D, l or L only.

              z
                     If  the  compression  library  has  been installed, all output files will be
                     compressed by default.  This switch  tells  collectl  not  to  compress  any
                     plottable  files.   If  collectl  tries  to  compress but cannot because the
                     library hasn't been installed, it will  generate  a  warning  which  can  be
                     suppressed with this switch.

       -P, --plot
              Generate output in plot format.  This format is space separated data which consists
              of a header (prefaced with a # for easy identification by an  analysis  program  as
              well as identifying it as a comment for programs, such as gnuplot, which honor that
              convention).  When written to disk, which is the typical way this option  is  used,
              summary  data  elements are written to the tab file and the detail elements written
              to one or more files, one per detail subsystem.  If -f is not specified, all output
              is sent to the terminal.  Output is always one line per sampling interval.

       --stats
              This  switch will cause brief data to be reported as both totals and averages after
              processing one or more files for the same day or in playback mode.

       --statopts option(s)
              This switch controls the way brief stats are reported, the default is to report the
              totals once, at the end of a day's worth of raw files, if more than one.

              a - include averages along with totals
              i - include the interval data itself, which is the equivalent of -oA
              s - print summary stats at the end of each file processed even if more than one per
              day

       -s, --subsys subsystem
              This field controls which subsystem data is to be collected or  played  back.   The
              default  for  collecting  data  is  "cdn",  which  stands for CPU, Disk and Network
              summary data and the default for playback is everthing that was collected.

              The rules for displaying results vary depending on the type of data  selected.   If
              you write data for CPUs and DISKs to a raw file and play it back with -sc, you will
              only see CPU data.  If you play it back with -scm you will still only see CPU  data
              since  memory  data  was  not collected.  However, when used with -P, collectl will
              always honor the subsystems specified with this switch so in the  previous  example
              you  will  see  CPU  data  plus  memory  data of all 0s.  To see the current set of
              default subsystems, which are a subset of this full list, use -h.

              You can also use + or - to add or subtract subsystems to/from the  default  values.
              For  example,  "-s-cdn+N"<  will  remove  cpu, disk and network monitoring from the
              defaults while adding network detail.

              Refer   to   data    definitions    on    the    sourceforge    website    OR    in
              /usr/share/collectl/doc/collectl-xxx  to  see  complete  descriptions  of  the data
              returned.

              SUMMARY SUBSYSTEMS

              b - buddy info (memory fragmentation)
              c - CPU
              d - Disk
              f - NFS V3 Data
              i - Inode and File System
              j - Interrupts
              l - Lustre
              m - Memory
              n - Networks
              s - Sockets
              t - TCP
              x - Interconnect
              y - Slabs (system object caches)

              DETAIL SUBSYSTEMS

              This is the set of detail data from which in most cases the  corresponding  summary
              data  is  derived.   There  are  currently  2  types that do not have corresponding
              summary data and those are "Environmental" and "Process".  So, if one has  3  disks
              and chooses -sd, one will only see a single total taken across all 3 disks.  If one
              chooses -sD, individual disk totals will be reported but no totals.  Choosing  -sdD
              will get you both.

              C - CPU
              D - Disk
              E - Environmental data (fan, power, temp),  via ipmitool
              F - NFS Data
              J - Interrupts
              L - Lustre OST detail OR client Filesystem detail
              M - Memory node data, which is also known as numa data
              N - Networks
              T - 65 TCP counters only available in plot format
              X - Interconnect
              Y - Slabs (system object caches)
              Z - Processes

       --showheader
              In  collectl  mode this command will cause the header that is normally written to a
              data file to be displayed on the terminal and collectl then exists.  This can be  a
              handy way to get a brief overview of the system configuration.

       --showoptions
              This  command  shows  only  the  portion  of the help text that desribes the -o and
              --options switches to save the time of wading through the entire help screen.

       --showcolheaders
              This command shows the first set of headers that will be printed  by  collectl  and
              exits.   Doesn't  really  make  sense for multi-section output like several sets of
              verbose or detail data.  Also note that since it requires one  monitoring  interval
              to build up some headers which may be dynamic, it also forces the interval to 0.

       --showsubopts
              List all the subsystem specifice options

       --showtopopts
              Show  all the different values for the --top type field, which specify the field(s)
              by to sort the data

       --showrootslabs
              This command only works on systems using the new slab allocator and will  list  the
              root  name  (these  are  those entries in /sys/slab which are not soft links) along
              with all its alias names.  If a name doesn't have an alias, it will not  appear  in
              this report.

       --showslabaliases
              This   command   only  works  on  systems  using  the  new  slab  allocator.   Like
              --showrootslabs, it will name a slab and all its aliases but rather than  show  the
              root  slab  name it will show one of the aliases to provide a more meaningful name.
              If there are any slabs that only have a single (or  no)  alias  they  will  not  be
              included in this report.

       --showsubopts
              Similar  to  --showoptions,  this  command summaries just the paramaters associated
              with -O and --subopts.

       --showsubsys
              Yet another way to summare a portion of the help  text,  this  command  only  shows
              valid subsystems.

       --top [type][,num[,v]]
              Include  the top "num" consumers by resource for this interval.  The default number
              is the height of the window if it can be determined otherwise 24, and  the  default
              resource  is  the  total  cpu time which is taken as the sum of SysT and UsrT.  See
              --showtopopts for a list of other types of data you can sort on.

              This switch can also be used with -s in which case  a  portion  of  the  window  is
              reserved  at  the  top to fill in the subsystem data, which is currently in verbose
              mode though a brief format is contemplated for some time in the future.

              In interactive mode and if not specified, the process monitoring interval  will  be
              set  to  that  for  other subsystems.  The screen will be cleared for each interval
              resulting in a display similar to the "top" utility.  In playback more  the  screen
              will NOT be cleared.  You cannot use this switch in "record" mode.

              Finally,  if  v  is  specified  as the 3rd parameter, the output scrolls vertically
              (like playbak mode) rather than clearing the screen between intervals.

       --umask mask
              Sets collectl's umask to control output file permissions.  Only root  can  set  the
              umask.  See "man umask" for details.

       --utime mask
              Write  periodic micro-timestamps into raw file at different points in time for fine
              grained measurements of operation times.
              1 - write timestamps when entering major sections
              2 - write timestamps for all /proc accesses except for process data
              4 - write timestamps for /proc data for all processes including threads

       -v
              Show version and whether or not Compression  and/or  HiResTime  modules  have  been
              installed and exit.

       -V
              Show  default  parmeter  and  control  settings,  all  of  which  can be changed in
              /etc/collectl.conf

       --verbose
              Display output in verbose mode.  This often displays more data than in the  default
              mode.   When  displaying  detail  data,  verbose  mode  is forced.  Furthermore, if
              summary data for a single subsystem is to be displayed in verbose mode, the headers
              are  only  repeated  occasionally  whereas if multiple subsystems are involved each
              needs their own header.

       -w
              Disply data in wide mode.  When displaying data  on  the  terminal,  some  data  is
              formatted followed by a K, M or G as appropriate.  Selecting this switch will cause
              the full field to be displayed.  Note that there is no attempt to align  data  with
              the column headings in this mode.

SUBSYSTEM OPTIONS

       The  following  options  are  subsystem  specific and typically filter data for collection
       and/or display as well as affect the output format:

       --cpufilt[^]perl-regx[,perl-regx...]
              Works the same as dskfilt and netfilt, allows one  to  select  a  subset  of  CPUs.
              These filters are also honored by interrupt reporting as well.

       --cpuopts
              z  -  only  applies  to cpu details, do not report any CPUs with no load.  In other
              words all entries are zero except for IDLE.

       --dskfilt [^]perl-regx[,perl-regx...]
              NOTE - this does NOT effect data collection  and  ALL  disk  data  will  always  be
              collected, unless --rawdskfilt is specified too.  However, only data for disk names
              that match the pattern(s) will be included in the summary totals and displayed when
              details  are  requested.  Alternatively, if you preface the first expression with a
              caret, all names that match all strings will be excluded from  the  summary  totals
              and detail displays rather then included.  If you don't know perl, a partial string
              will usually work too.

              Just remember, this only applies to collected  data  and  so  if  for  example  you
              specify  a  parition, such as sda1, you'll never see the data since it was filtered
              out at the time of data collection.  To see those  stats  you  would  need  to  say
              --rawdskfilt sda1.

       --dskopts
              f - report some columns as fractions for more precision on detail output
              i - display the i/o sizes in brief mode just like with --iosize
              o - exclude unused disks from new file headers and plot data
              z - only applies to disk details, do not report any lines with values of all zeros.

       --dskremap aaa:bbb,ccc:ddd...
              This  will  cause  disk names matching the perl pattern aaa to be replaced with the
              string bbb.  In some cases, you may simply want to  remove  the  entire  string  in
              which  case the second string should be left empty.  If you want to remove a string
              container a /, be sure to escape it with a backslash.

       --envopts Environmental Options
              The default is to display ALL data but the following will  cause  a  subset  to  be
              displayed

              f - display fan data
              p - display current (power) data
              t - display temperature data
              C - convert temperature to Celcius if in Farenheit
              F - convert temperature to Farenheit if in Celcius
              M - display each type of data on separate line
              T  -  display  data truncated to whole integers (some implemenations displayed them
              with fractional components)
              9 - any number, will tell ipmitool to read on this device number

       --envfilt regx If specified, this regx is evaluated against each line of data returned  by
       ipmitool and only those that match are retained.  All other data is lost.

       --envremap perl-regx,...
              If  specified  as  a  comma separated list of perl regular substitution expressions
              without the =~s portion, each expression is applied  to  each  environmental  field
              name,  thereby  allowing one to rename the column headers.  This can be most useful
              when running on heterogeneuos systems and you want consistent column names.

       --intfilt [^]perl-regx[,perl-regx...]
              NOTE - this does NOT effect data collection,  ALL interrupt  data  will  always  be
              collected.   However,  only  data  for interrupts that match the pattern(s) will be
              included  in  the  summary  totals  and  displayed  when  details  are   requested.
              Alternatively,  if  you  preface  the first expression with a caret, all names that
              match all strings will be excluded from the  summary  totals  and  detail  displays
              rather  then  included.  If you don't know perl, a partial string will usually work
              too.

              NOTE  -  these  expressions  are  applied  to  the  entire   line   one   sees   in
              /proc/interrupts,  including the interrupt number, name and even counters so if you
              do want to include an interrupt number in  the  pattern  be  sure  to  include  the
              trailing colon as well.

       --lustopts Lustre Options
              B - For clients and servers, show buffer stats
              D  -  For  MDSs  and OSTs AND running earlier versions of HPSFS, collect disk block
              iostats
              M - For clients, collect metadata
              O - For OSTs, show detail level stats
              R - For client, collect readahead stats

       --memopts Memory Options
              R - show memory values (including swap space) as rates  of  change  as  opposed  to
              absolute values.  One can also show absolute changes between intervals by including
              -on.

       --netfilt [^]perl-regx[,perl-regx...]
              NOTE - this does NOT effect data collection and ALL network  data  will  always  be
              collected,  unless  --rawnetfilt  is specified too.  Also note that by default only
              eth, ib, em and p1p networks when present are included in the summary.   When  this
              switch  is specified, only data for network names that match the pattern(s) will be
              included in the summary and displayed when  details  are  requested.   This  switch
              therefore also gives you the ability to add other, possibly new, network devices to
              the summary totals.

              Alternatively, if you preface the first expression with a  caret,  all  names  that
              match  all  strings  will  be  excluded from the summary totals and detail displays
              rather then included.  If you don't know perl, a partial string will  usually  work
              too.

       --netopts
              e - include network error counts in brief and explicit error types elsewhere
              E - only include lines with network errors in them
              i - include i/o sizes in brief mode
              o - exclude unused networks from new file headers and plot data
              w - set width of network device name

       --nfsfilt NFS Filters
              Specify  one  or  more  comma separated filters as a C/S followed by an nfs version
              number and only those will have data reported on.  For example, C2 says  to  report
              data  on V2 Clients.  As a data collection performance optimization, if one or more
              client filters are specified, data will actually be collected for all clients as is
              also done for servers.

       --nfsopts NFS Options q.RS z - only display detail lines which have data

       --procfilt Process Filters
              These  filters restrict which processes are selected for collection/display.  Using
              this filter will significanly reduce the load  on  process  data  collection  since
              collectl  creates  a  blacklist  of  those  existing processes that do not pass the
              filter and so are permanently excluded from any future processing.

              The format of a filter is a one charter type followed by a match string.   Multiple
              filters may be specified if separated by commas.

              c - substring of the command being executed as explicitly read from /proc/pid/stat.
              Note that this can actually be a perl expression, so if you  want  a  command  that
              ends  in  a  particular  string  all  you  need to is append a \$ to the end of the
              string.  Otherwise it would match any commands containing that string.
              C - any command that starts with the specified string
              f - full path of the command, including arguments, as read from  /proc/pid/cmdline.
              Like the c modifier this too can be a perl expression.
              p - pid
              P - parent pid
              u - any process ownerd by this user's UID or in the range specifide by uxxx-yyy
              U - any process owned by this username

              caution: the process names collectl tries to match with c and C is the second field
              in /proc/pid/stat which may not necessarily be what you think!  eg the name  for  X
              emacs is actually emacs-x

       --procopts options
              These  options  control  the  way  data  is  displayed  and  can  also improve data
              collection  performance

              c - include CPU time of children who have exited (same as ps -S)
              f - use cumulative totals for page faults in process data instead of rates
              i - show process I/O counters in display instead of default format
              I - disable collection of I/O counters, see note below
              k - remove known shells from process  names,  making  it  possible  to  see  actual
              command
              m - show breakdown of memory utilization instead of default format
              p - never look for new pids or threads during data collection
              r - show root command name only (no directory) for narrower display. Note that this
              is applied AFTER 'k' so if arg1 becomes the new command it will be  truncated  now,
              which is very handy when running in a virtual python environment
              R - show ALL process priorities ('RT' currently displayed if realtime)
              s - show process start time in hh:mm:ss format
              S - show process start time in mmmdd-hh:mm:ss format
              t - include ALL process threads (increases collection overhead)
              u - report username as 12 chars instead of 8, noting uxx will cause column width to
              be xx but cannot be less than 8
              w - widen display by including whole argument string, with optional max width
              x - include extended process attributes (currently only for context switches)
              z - exclude any processes with 0 in sort field (in --top mode)

              Process data is the most expensive type of data collected, costing  as  much  as  3
              times  the  CPU  load  as all other types of data combined.  Collecting thread data
              makes this even more expensive.  One can significantly reduce this load by over  25
              percent  by disabling the collection of I/O stats.  However, keep in mind that even
              if you don't try to optimize process data collection, the overall  system  load  by
              collectl  can  still  be  on  the order of about 0.2% when running as a daemon with
              default collection rates.  See the online documentation  on  measuring  performance
              for more information.

              A  security  hole  was  identified  that  allowed  non-priviledged  users  to  read
              /proc/pid/io and guess password lengths and noe many distros retrict access to  the
              owner  or  root.   As a result, non-priviledged users will see all 0 I/O counts for
              processes that are not theirs when specifying --procopt i.

       --slabfilt Slab Filters
              One can specify a list of slab names separated by commas and only those slabs whose
              names start with those strings will be listed or summaried.

       --slabopts Slab Options
              s - exclude any slabs with an allocation of 0
              S - only show those slabs whose allocations changed since last display

       --tcpfilt
              These filters actually control both what is collected as well as displayed.  If one
              selects non-collected filters, 0s will be reported.  There is one special case  and
              that  is  if one includes T (tcp extended stats) in the filter string, there are no
              brief ones and therefore --verbose will be forced.
              i - ip stats
              t - tcp stats
              u - udp stats
              c - icmp stats
              I - ip extended stats
              T - tcp excented stats

       --xopts
              i - include i/o sizes in brief mode

DESCRIPTION

       The collectl utility is a  system  monitoring  tool  that  records  or  displays  specific
       operating  system data for one or more sets of subsystems. Any set of the subsystems, such
       as CPU, Disks, Memory or Sockets can be included in  or  excluded  from  data  collection.
       Data  can  either  be  displayed back to the terminal, or stored in either a compressed or
       uncompressed data file. The data files themselves can either be in raw format (essentially
       a  direct  copy  from  the  associated /proc structures) or in a space separated plottable
       format such that it can be easily plotted using tools such  as  gnuplot  or  excel.   Data
       files  can  be  read  and  manipulated  from  the  command line, or through use of command
       scripts.

       Upon startup, collectl.conf is read, which sets a number of default parameters and  switch
       values.  Collectl searches for this file first in /etc, then in the directory the collectl
       execuable lives in  (typically  /usr/sbin)  and  finally  the  current  directory.   These
       locations  can  be  overriden  with  the  -C switch.  Unless you're doing something really
       special, this file need never be touched, the only exception perhaps being  when  choosing
       to  run collectl as a service and you wish to change it's default behavior which is set by
       the DaemonCommand entry.

RESTRICTIONS/PROBLEMS

       Thread reporting currently only works with 2.6 kernels.

       The pagesize has been hardcoded for perl 5.6 systems to 4096 for IA32 and  16384  for  all
       others.   If  you  are  running  5.6  on  a  system with a different pagesize you will see
       incorrect SLAB allocation  sizes  and  will  need  to  scale  the  numbers  you're  seeing
       accordingly.

       I  have  recently discovered there is a bug in /proc in that an extra line is occasionally
       read with the end of the previous buffer!  When this occurs a message is  written  (if  -m
       enabled)  and  always written to the terminal.  Since this happens with a higher frequency
       with process data I silently ignore those as the output can get pretty noisey.  If for any
       reason this is a problem, be sure to let me know.

       Since  collectl has no control over the frequency at which data gets written to /proc, one
       can get anomolous statistics as collectl is only reporting a snapshot  of  what  is  being
       recorded.  For more information see http://collectl.sourceforge.net/TheMath.html.

       At  least  one  network  card occasionally generates erroneous network stats and to try to
       keep the data rational, collectl tries to detect this and when it does generates a message
       that bogus data has been detected.

FILES, EXAMPLES AND MORE INFORMATION

       http://collectl.sourceforge.net OR /opt/hp/collectl/docs

ACKNOWLEDGEMENTS

       I  would  like  to  thank Rob Urban for his creation of the Tru64 Unix collect tool, which
       collectl is based on.

AUTHOR

       This program was written by Mark Seger (mjseger@gmail.com).
       Copyright 2003-2016 Hewlett-Packard Development Company, LP
       collectl may be copied only under the terms of either the  Artistic  License  or  the  GNU
       General Public License, which may be found in the source kit