Provided by: slurmdbd_17.11.2-1build1_amd64 bug

NAME

       slurmdbd.conf - Slurm Database Daemon (SlurmDBD) configuration file

DESCRIPTION

       slurmdb.conf is an ASCII file which describes Slurm Database Daemon (SlurmDBD) configuration information.
       The  file  location  can  be  modified  at system build time using the DEFAULT_SLURM_CONF parameter or at
       execution time by setting the SLURM_CONF environment variable.

       The contents of the file are case insensitive except for the names of nodes and files. Any text following
       a "#" in the configuration file is treated as a comment through the end of that  line.   Changes  to  the
       configuration  file  take  effect  upon restart of SlurmDbd or daemon receipt of the SIGHUP signal unless
       otherwise noted.

       This file should be only on the computer where SlurmDBD executes and should only be readable by the  user
       which  executes  SlurmDBD  (e.g. "slurm").  If the slurmdbd daemon is started as user root and changes to
       another user ID, the configuration file will initially be read as user root, but  will  be  read  as  the
       other  user  ID  in  response to a SIGHUP signal.  This file should be protected from unauthorized access
       since it contains a database password.  The overall configuration parameters available include:

       ArchiveDir
              If ArchiveScript is not set the slurmdbd will generate a file that can be  read  in  anytime  with
              sacctmgr  load  filename.  This directory is where the file will be placed after a purge event has
              happened and archive for that element is set to true.  Default is /tmp.  The format for this files
              name is $ArchiveDir/$ClusterName_$ArchiveObject_archive_$BeginTimeStamp_$endTimeStamp

       ArchiveEvents
              When purging events also archive them.  Boolean, yes to archive event data, no otherwise.  Default
              is no.

       ArchiveJobs
              When purging jobs also archive them.  Boolean, yes to archive job data, no otherwise.  Default  is
              no.

       ArchiveResvs
              When  purging  reservations  also  archive  them.   Boolean,  yes  to archive reservation data, no
              otherwise.  Default is no.

       ArchiveScript
              This script can be executed every time a rollup happens (every hour, day and month), depending  on
              the  Purge*After  options.  This script is used to transfer accounting records out of the database
              into an archive.  It is used in place of the internal process used to archive objects.  The script
              is executed with a no arguments, The following environment variables are set.

              SLURM_ARCHIVE_EVENTS
                     1 for archive events 0 otherwise.

              SLURM_ARCHIVE_LAST_EVENT
                     Time of last event start to archive.

              SLURM_ARCHIVE_JOBS
                     1 for archive jobs 0 otherwise.

              SLURM_ARCHIVE_LAST_JOB
                     Time of last job submit to archive.

              SLURM_ARCHIVE_STEPS
                     1 for archive steps 0 otherwise.

              SLURM_ARCHIVE_LAST_STEP
                     Time of last step start to archive.

              SLURM_ARCHIVE_SUSPEND
                     1 for archive suspend data 0 otherwise.

              SLURM_ARCHIVE_TXN
                     1 for archive transaction data 0 otherwise.

              SLURM_ARCHIVE_USAGE
                     1 for archive usage data 0 otherwise.

              SLURM_ARCHIVE_LAST_SUSPEND
                     Time of last suspend start to archive.

       ArchiveSteps
              When purging steps also archive them.  Boolean, yes to archive step data, no  otherwise.   Default
              is no.

       ArchiveSuspend
              When  purging  suspend  data also archive it.  Boolean, yes to archive suspend data, no otherwise.
              Default is no.

       ArchiveTXN
              When purging transaction data also archive it.  Boolean,  yes  to  archive  transaction  data,  no
              otherwise.  Default is no.

       ArchiveUsage
              When purging usage data (Cluster, Association and WCKey) also archive it.  Boolean, yes to archive
              transaction data, no otherwise.  Default is no.

       AuthInfo
              Additional  information  to  be  used  for authentication of communications with the Slurm control
              daemon (slurmctld) on each cluster.   The  interpretation  of  this  option  is  specific  to  the
              configured  AuthType.   In  the  case  of auth/munge, this can be configured to use a Munge daemon
              specifically configured to provide authentication between clusters while the default Munge  daemon
              provides  authentication  within  a  cluster.  In that case, this will specify the pathname of the
              socket to use. Per  default  this  value  is  left  unspecified,  which  results  in  the  default
              authentication mechanism being used.

       AuthType
              Define  the  authentication method for communications between Slurm components.  Acceptable values
              at present include "auth/none" and "auth/munge".  The default value is  "auth/none",  which  means
              the  UID  included  in  communication  messages  is  not  verified.   This may be fine for testing
              purposes, but do not use "auth/none" if you desire  any  security.   "auth/munge"  indicates  that
              LLNL's  Munge system is to be used (this is the best supported authentication mechanism for Slurm,
              see "https://code.google.com/p/munge/" for more information).  SlurmDBD must be  terminated  prior
              to changing the value of AuthType and later restarted.

       CommitDelay
              How  many  seconds  between commits on a connection from a Slurmctld.  This speeds up inserts into
              the database dramatically.  If you are running a very high throughput of jobs you should  consider
              setting  this.   In  testing,  1 second improves the slurmdbd performance dramatically and reduces
              overhead.  There is a small probability of data loss though since this creates a window  in  which
              if  the  slurmdbd  seg  faults  or exits abnormally for any reason the data not committed could be
              lost.  While this situation should be very rare, it does present an extremely small risk, but  may
              be  the  only  way to run in extremely heavy environments.  In all honesty, the risk is quite low,
              but still present.

       DbdBackupHost
              The short, or long, name of the machine where the backup Slurm Database Daemon is  executed  (i.e.
              the  name  returned  by  the  command  "hostname  -s").   This  host  must have access to the same
              underlying database specified by the 'Storage' options mentioned below.

       DbdAddr
              Name that DbdHost should be referred to in establishing a communications path. This name  will  be
              used  as  an  argument  to the gethostbyname() function for identification. For example, "elx0000"
              might be used to designate the Ethernet address for node "lx0000".  By default the DbdAddr will be
              identical in value to DbdHost.

       DbdHost
              The short, or long, name of the machine where the Slurm Database Daemon is executed (i.e. the name
              returned by the command "hostname -s").  This value must be specified.

       DbdPort
              The port number that the Slurm Database Daemon (slurmdbd) listens to for work. The  default  value
              is  SLURMDBD_PORT as established at system build time. If none is explicitly specified, it will be
              set to 6819.  This value must be equal to the AccountingStoragePort parameter  in  the  slurm.conf
              file.

       DebugFlags
              Defines specific subsystems which should provide more detailed event logging.  Multiple subsystems
              can  be  specified  with comma separators.  Most DebugFlags will result in verbose logging for the
              identified subsystems and could impact performance.  Valid subsystems available today  (with  more
              to come) include:

              DB_ARCHIVE       SQL statements/queries when dealing with archiving and purging the database.

              DB_ASSOC         SQL statements/queries when dealing with associations in the database.

              DB_EVENT         SQL statements/queries when dealing with (node) events in the database.

              DB_JOB           SQL statements/queries when dealing with jobs in the database.

              DB_QOS           SQL statements/queries when dealing with QOS in the database.

              DB_QUERY         SQL statements/queries when dealing with transactions and such in the database.

              DB_RESERVATION   SQL statements/queries when dealing with reservations in the database.

              DB_RESOURCE      SQL statements/queries when dealing with resources like licenses in the database.

              DB_STEP          SQL statements/queries when dealing with steps in the database.

              DB_USAGE         SQL  statements/queries  when  dealing  with  usage  queries  and  inserts in the
                               database.

              DB_WCKEY         SQL statements/queries when dealing with wckeys in the database.

              FEDERATION       SQL statements/queries when dealing with federations in the database.

       DebugLevel
              The level of detail to provide the Slurm Database Daemon's logs.  The default value is info.

              quiet     Log nothing

              fatal     Log only fatal errors

              error     Log only errors

              info      Log errors and general informational messages

              verbose   Log errors and verbose informational messages

              debug     Log errors and verbose informational messages and debugging messages

              debug2    Log errors and verbose informational messages and more debugging messages

              debug3    Log errors and verbose informational messages and even more debugging messages

              debug4    Log errors and verbose informational messages and even more debugging messages

              debug5    Log errors and verbose informational messages and even more debugging messages

       DebugLevelSyslog
              The slurmdbd daemon will log events to the syslog file at  the  specified  level  of  detail  (the
              LogFile  file  will include log messages at level of details specified be DebugLevel configuration
              parameter).  If the slurmdbd daemon is run in the foreground (started with  the  -D  command  line
              option)  the DebugLevelSyslog configuration parameter will be ignored.  The default value is quiet
              unless there is no configured SlurmdLogFile, in which case the default value will be  fatalo  that
              fatal errors are logged somewhere.

              quiet     Log nothing

              fatal     Log only fatal errors

              error     Log only errors

              info      Log errors and general informational messages

              verbose   Log errors and verbose informational messages

              debug     Log errors and verbose informational messages and debugging messages

              debug2    Log errors and verbose informational messages and more debugging messages

              debug3    Log errors and verbose informational messages and even more debugging messages

              debug4    Log errors and verbose informational messages and even more debugging messages

              debug5    Log errors and verbose informational messages and even more debugging messages

       DefaultQOS
              When  adding  a  new  cluster  this  will  be  used as the qos for the cluster unless something is
              explicitly set by the admin with the create.

       LogFile
              Fully qualified pathname of a file into which the Slurm Database Daemon's logs are  written.   The
              default value is none (performs logging via syslog).
              See the section LOGGING in the slurm.conf man page if a pathname is specified.

       LogTimeFormat
              Format  of  the  timestamp  in  slurmdbd  log  files. Accepted values are "iso8601", "iso8601_ms",
              "rfc5424", "rfc5424_ms", "clock", and "short". The values ending in "_ms"  differ  from  the  ones
              without  in  that  fractional seconds with millisecond precision are printed. The default value is
              "iso8601_ms". The "rfc5424" formats are the same as the "iso8601" formats except that the timezone
              value is also shown. The "clock" format shows a timestamp in microseconds  retrieved  with  the  C
              standard  clock()  function.  The  "short" format is a short date and time format. The "thread_id"
              format shows the timestamp in the C standard ctime() function form without the year but  including
              the microseconds, the daemon's process ID and the current thread ID.

       MaxQueryTimeRange
              Return an error if a query is against too large of a time span, to prevent ill-formed queries from
              causing  performance problems within SlurmDBD.  Default value is INFINITE which allows any queries
              to proceed.  User SlurmUser and root are exempt from this restriction.  Note  that  queries  which
              attempt to return over 3GB of data will still fail to complete with ESLURM_RESULT_TOO_LARGE.

       MessageTimeout
              Time permitted for a round-trip communication to complete in seconds. Default value is 10 seconds.

       PidFile
              Fully  qualified pathname of a file into which the Slurm Database Daemon may write its process ID.
              This may be used for automated signal processing.  The default value is "/var/run/slurmdbd.pid".

       PluginDir
              Identifies the places in which to look for Slurm plugins.   This  is  a  colon-separated  list  of
              directories, like the PATH environment variable.  The default value is "/usr/local/lib/slurm".

       PrivateData
              This  controls what type of information is hidden from regular users.  By default, all information
              is visible to all users.  User SlurmUser, root, and users with AdminLevel=Admin  can  always  view
              all  information.   Multiple  values  may  be specified with a comma separator.  Acceptable values
              include:

              accounts
                     prevents users from viewing any account definitions unless they are coordinators of them.

              events prevents users from viewing event information unless they have operator status or above.

              jobs   prevents users  from  viewing  job  records  belonging  to  other  users  unless  they  are
                     coordinators of the association running the job when using sacct.

              reservations
                     restricts getting reservation information to users with operator status and above.

              usage  prevents users from viewing usage of any other user.  This applys to sreport.

              users  prevents  users from viewing information of any user other than themselves, this also makes
                     it so users can only see associations they deal with.  Coordinators can see associations of
                     all users they are coordinator of, but can only see themselves when listing users.

       PurgeEventAfter
              Events happening on the cluster over this age are purged from the database.   This  includes  node
              down times and such.  The time is a numeric value and is a number of months.  If you want to purge
              more  often you can include "hours", or "days" behind the numeric value to get those more frequent
              purges (i.e. a value of "12hours" would purge everything older than 12 hours).   The  purge  takes
              place  at  the  start of the each purge interval.  For example, if the purge time is 2 months, the
              purge would happen at the beginning of each month.  If not set (default), then  job  step  records
              are never purged.

       PurgeJobAfter
              Individual job records over this age are purged from the database.  Aggregated information will be
              preserved  to  "PurgeUsageAfter".   The time is a numeric value and is a number of months.  If you
              want to purge more often you can include "hours", or "days" behind the numeric value to get  those
              more  frequent purges (i.e. a value of "12hours" would purge everything older than 12 hours).  The
              purge takes place at the start of the each purge interval.  For example, if the purge  time  is  2
              months,  the  purge  would  happen at the beginning of each month.  If not set (default), then job
              records are never purged.

       PurgeResvAfter
              Individual reservation records over this age are purged from the database.  Aggregated information
              will be preserved to "PurgeUsageAfter".  The time is a numeric value and is a  number  of  months.
              If you want to purge more often you can include "hours", or "days" behind the numeric value to get
              those more frequent purges (i.e. a value of "12hours" would purge everything older than 12 hours).
              The  purge takes place at the start of the each purge interval.  For example, if the purge time is
              2 months, the purge would happen at the beginning of each  month.   If  not  set  (default),  then
              reservation records are never purged.

       PurgeStepAfter
              Individual  job  step  records over this age are purged from the database.  Aggregated information
              will be preserved to "PurgeUsageAfter".  The time is a numeric value and is a  number  of  months.
              If you want to purge more often you can include "hours", or "days" behind the numeric value to get
              those more frequent purges (i.e. a value of "12hours" would purge everything older than 12 hours).
              The  purge takes place at the start of the each purge interval.  For example, if the purge time is
              2 months, the purge would happen at the beginning of each month.  If not set (default),  then  job
              step records are never purged.

       PurgeSuspendAfter
              Records  of  individual  suspend  times  for  jobs  over  this  age  are purged from the database.
              Aggregated information will be preserved to "PurgeUsageAfter".  The time is a numeric value and is
              a number of months.  If you want to purge more often you can include "hours", or "days" behind the
              numeric value to get those more frequent purges (i.e. a value of "12hours" would purge  everything
              older  than  12  hours).   The  purge  takes  place  at the start of the each purge interval.  For
              example, if the purge time is 2 months, the purge would happen at the beginning of each month.  If
              not set (default), then job step records are never purged.

       PurgeTXNAfter
              Records of individual transaction times for  transactions  over  this  age  are  purged  from  the
              database.  The time is a numeric value and is a number of months.  If you want to purge more often
              you  can  include  "hours",  or  "days" behind the numeric value to get those more frequent purges
              (i.e. a value of "12hours" would purge everything older than 12 hours).  The purge takes place  at
              the start of the each purge interval.  For example, if the purge time is 2 months, the purge would
              happen  at  the  beginning  of  each month.  If not set (default), then job step records are never
              purged.

       PurgeUsageAfter
              Usage Records (Cluster, Association and WCKey) over this age are purged from  the  database.   The
              time  is  a  numeric  value  and  is  a number of months.  If you want to purge more often you can
              include "hours", or "days" behind the numeric value to get those  more  frequent  purges  (i.e.  a
              value  of  "12hours"  would  purge  everything older than 12 hours).  The purge takes place at the
              start of the each purge interval.  For example, if the purge time is 2  months,  the  purge  would
              happen  at  the  beginning  of  each month.  If not set (default), then job step records are never
              purged.

       SlurmUser
              The name of the user that the slurmctld daemon executes as.  This user must exist on  the  machine
              executing  the  Slurm  Database  Daemon  and have the same user ID as the hosts on which slurmctld
              execute.  For security purposes, a user other than "root" is recommended.  The  default  value  is
              "root".

       StorageHost
              Define the name of the host the database is running where we are going to store the data.  Ideally
              this should be the host on which slurmdbd executes.

       StorageBackupHost
              Define  the  name of the backup host the database is running where we are going to store the data.
              This can be viewed as a backup solution when the StorageHost is not responding.  It is up  to  the
              backup solution to enforce the coherency of the accounting information between the two hosts. With
              clustered database solutions (active/passive HA), you would not need to use this feature.  Default
              is none.

       StorageLoc
              Specify the name of the database as the location where accounting records are written.

       StoragePass
              Define the password used to gain access to the database to store the job accounting data.

       StoragePort
              The port number that the Slurm Database Daemon (slurmdbd) communicates with the database.

       StorageType
              Define   the   accounting   storage   mechanism   type.   Acceptable  values  at  present  include
              "accounting_storage/mysql".   The  value  "accounting_storage/mysql"  indicates  that   accounting
              records  should  be  written to a MySQL or MariaDB database specified by the StorageLoc parameter.
              This value must be specified.

       StorageUser
              Define the name of the user we are going to  connect  to  the  database  with  to  store  the  job
              accounting data.

       TCPTimeout
              Time permitted for TCP connection to be established. Default value is 2 seconds.

       TrackWCKey
              Boolean  yes  or  no.  Used to set display and track of the Workload Characterization Key. Must be
              set to track wckey usage.  This must be set to generate rolled up usage tables from WCKeys.  NOTE:
              If TrackWCKey is set here and not in your various slurm.conf files all jobs will be attributed  to
              their default WCKey.

       TrackSlurmctldDown
              Boolean yes or no.  If set the slurmdbd will mark all idle resources on the cluster as down when a
              slurmctld disconnects or is no longer reachable.  The default is no.

EXAMPLE

       #
       # Sample /etc/slurmdbd.conf
       #
       ArchiveEvents=yes
       ArchiveJobs=yes
       ArchiveResvs=yes
       ArchiveSteps=no
       ArchiveSuspend=no
       ArchiveTXN=no
       ArchiveUsage=no
       #ArchiveScript=/usr/sbin/slurm.dbd.archive
       AuthInfo=/var/run/munge/munge.socket.2
       AuthType=auth/munge
       DbdHost=db_host
       DebugLevel=4
       PurgeEventAfter=1month
       PurgeJobAfter=12month
       PurgeResvAfter=1month
       PurgeStepAfter=1month
       PurgeSuspendAfter=1month
       PurgeTXNAfter=12month
       PurgeUsageAfter=24month
       LogFile=/var/log/slurmdbd.log
       PidFile=/var/tmp/jette/slurmdbd.pid
       SlurmUser=slurm_mgr
       StoragePass=shazaam
       StorageType=accounting_storage/mysql
       StorageUser=database_mgr

COPYING

       Copyright  (C)  2008-2010  Lawrence Livermore National Security.  Produced at Lawrence Livermore National
       Laboratory (cf, DISCLAIMER).
       Copyright (C) 2010-2014 SchedMD LLC.

       This   file   is   part   of   Slurm,   a   resource    management    program.     For    details,    see
       <https://slurm.schedmd.com/>.

       Slurm  is  free  software;  you  can  redistribute it and/or modify it under the terms of the GNU General
       Public License as published by the Free Software Foundation; either version 2 of the License, or (at your
       option) any later version.

       Slurm is distributed in the hope that it will be useful, but  WITHOUT  ANY  WARRANTY;  without  even  the
       implied  warranty  of  MERCHANTABILITY  or  FITNESS FOR A PARTICULAR PURPOSE.  See the GNU General Public
       License for more details.

FILES

       /etc/slurmdbd.conf

SEE ALSO

       slurm.conf(5), slurmctld(8), slurmdbd(8) syslog (2)

July 2015                                   Slurm Configuration File                            slurmdbd.conf(5)