Provided by: watchdog_5.15-2_amd64 bug

NAME

       watchdog.conf - configuration file for the watchdog daemon

DESCRIPTION

       This file carries all configuration options for the Linux watchdog daemon.  Each option has to be written
       on  a  line  for itself. Comments start with '#'.  Blanks are ignored except after the '=' sign. An empty
       text after the '=' sign disables the feature as long as that makes sense.

OPTIONS

       interval = <interval>
              Set the highest possible interval between two writes  to  the  watchdog  device.   The  device  is
              triggered  after  each  check  regardless of the time it took. After finishing all checks watchdog
              goes to sleep for a full cycle of <interval> seconds.  Default  value  is  1  second.  The  kernel
              drivers expects a write command every minute. Otherwise the system will be rebooted.  Therefore an
              interval of more than a minute can only be used with the force command-line option [--force | -f].

       logtick = <logtick>
              If  you  enable  verbose logging, a message is written into the syslog or a logfile. While this is
              nice, it is not necessary to get a message every interval which really fills  up  disk  and  needs
              CPU. logtick allows adjustment of the number of intervals skipped before a log message is written.
              If  you  use  logtick  =  60  and  interval = 10, only every 10 minutes (600 seconds) a message is
              written. This may make the exact time of a crash harder to find but greatly reduces disk usage and
              administrator nerves if you're looking for a  particular  syslog  entry  in  between  of  watchdog
              messages.

       max-load-1 = <load1>
              Set  the  maximal  allowed load average for a 1 minute span. Once this load average is reached the
              system is rebooted. Default value is 0. That means the load average check is disabled. Be  careful
              not to set this parameter too low. To set a value less then the predefined minimal value of 2, you
              have to use the -f command line option.

       max-load-5 = <load5>
              Set  the  maximal  allowed load average for a 5 minute span. Once this load average is reached the
              system is rebooted. Default value is 3/4*max-load-1.  Be careful not to this parameter too low. To
              set a value less then the predefined minimal value of 2, you have  to  use  the  -f  command  line
              option.

       max-load-15 = <load15>
              Set  the  maximal allowed load average for a 15 minute span. Once this load average is reached the
              system is rebooted. Default value is 1/2*max-load-1.  Be careful not to this parameter too low. To
              set a value less then the predefined minimal value of 2, you have  to  use  the  -f  command  line
              option.

       min-memory = <minpage>
              Set  the minimal amount of virtual memory that has to stay free. Note that this is in memory pages
              (4kB on x86). Default value is 0 pages which means this test is disabled. The page size  is  taken
              from the system include files.  This is a 'passive' test and works by reading /proc/meminfo

       allocatable-memory = <minpage>
              Set the minimum amount of allocatable memory available on the system.  Note that this is in pages.
              Default  value  is 0 pages which means the test is disabled.  As with min-memory, the page size is
              taken from the system include files. This is an 'active'  test  and  it  works  by  attempting  to
              memory-map a block of the configured size.

       watchdog-device = <device>
              Set  the  watchdog device name, typically /dev/watchdog. Default is to disable keep alive support.
              This should be tested by running the daemon from the command line before configuring it  to  start
              automatically on booting.

       watchdog-timeout = <timeout>
              Set  the watchdog device timeout during startup.  If not set, a default is used that should be set
              to the kernel timer margin at compile time.

       temperature-sensor = <temp-virtual-file>
              Set the temperature sensor name. This is normally a 'virtual file' under /sys and it contains  the
              temperature in milli-Celsius. Usually these are generated by the sensors package, but take care as
              device  enumeration may not be fixed. Default is to disable temperature checking. Multiple sensors
              can be used by having repeated temperature-sensor entries.

       max-temperature = <temp>
              Set the maximal allowed temperature. Once this temperature  is  reached  the  system  is  stopped.
              Default  value  is  90 C. Watchdog will issue warnings once the temperature increases 90%, 95% and
              98% of this temperature.

       temp-power-off = <yes|no>
              Set the watchdog action on overheating. Yes option (default) is  to  power  the  machine  off,  no
              option is to halt machine and allow Ctrl-Alt-Del reboot.

       file = <filename>
              Set  file  name  for  file  mode.   This option can be given as often as you like to check several
              files.

       change = <mtime>
              Set the change interval time for file mode. This options always belongs to  the  active  filename,
              that is when finding a 'change =' line watchdog assumes it belongs to the most recently read 'file
              ='  line.   They  don't  necessarily  have to follow each other directly. But you cannot specify a
              'change =' before a 'file ='.  The default is to only stat the file and don't  look  for  changes.
              Using  this  feature  to  monitor  changes  in /var/log/messages might require some special syslog
              daemon configuration, e.g. rsyslog needs "$ActionWriteAllMarkMessages on" to be set to  make  sure
              the marks are written no matter what.

       pidfile = <pidfilename>
              Set  pidfile  name  for  server test mode.  This option can be given as often as you like to check
              several servers.

       ping = <ip-addr>
              Set IPv4 address for ping mode.  This option can  be  used  more  than  once  to  check  different
              connections.

       interface = <if-name>
              Set  interface  name  for network mode.  This option can be used more than once to check different
              interfaces. Note it is only possible to check physical interfaces, and not aliased IP interfaces.

       test-binary = <testbin>
              Execute the given binary to do some user defined tests.

       test-timeout = <timeout in seconds>
              User defined tests may only run for <timeout> seconds. Set to 0 for unlimited.

       repair-binary = <repbin>
              Execute the given binary in case of a problem instead of shutting down the system.

       repair-timeout = <timeout in seconds>
              repair command may only run for <timeout> seconds. Set to 0 for 'unlimited',  but  note  that  the
              hardware timer is not refreshed in this case so the system will hard-reset at some point.

       retry-timeout = <timeout in seconds>
              Allow  most error conditions to persist for <timeout> seconds. Set to 0 for immediate action (like
              softboot behaviour).

       repair-maximum = <count>
              This allows no more then <count> repair attempts against a given fault that report  success  (i.e.
              return 0), but fail to clear the fault, before a reboot is initiated anyway. If set to zero then a
              repairable  fault  can  always  be  blocked by a repair program reporting success (previous daemon
              behaviour).

       admin = <mail-address>
              Email address to send admin mail to. That is, who shall be notified  that  the  machine  is  being
              halted  or  rebooted.  Default  is  'root'. If you want to disable notification via email just set
              admin to en empty string.

       realtime = <yes|no>
              If set to yes watchdog will lock itself into memory so it is never swapped out.

       priority = <schedule priority>
              Set the schedule priority for realtime mode.

       test-directory = <test directory>
              Set the directory to run user test/repair scripts.  Default  is  '/etc/watchdog.d'  See  the  Test
              Directory section in watchdog(8) for more information.

       log-dir = <log directory>
              Set  the  log  directory  to capture the standard output and standard error from repair-binary and
              test-binary execution. Default is '/var/log/watchdog'.

       sigterm-delay = <time in seconds>
              Set the time on shut down between first  sending  SIGTERM  to  all  processes,  and  then  sending
              SIGKILL.  Default  is  5  seconds  which  is generally enough, but systems with large databases or
              virtual machines might need longer.

       verbose = <yes|no>
              This overrides the command line --verbose option. Generally the verbose mode is only  enabled  for
              debugging as it creates a lot of syslog chatter, so use this option with consideration.

FILES

       /etc/watchdog.conf
              The watchdog configuration file

       /etc/watchdog.d
              A  directory containing test-or-repair commands. See the Test Directory section in watchdog(8) for
              more information.

SEE ALSO

       watchdog(8)

4th Berkeley Distribution                         January 2016                                  WATCHDOG.CONF(5)