Provided by: watchdog_5.15-2_amd64 bug

NAME

       watchdog.conf - configuration file for the watchdog daemon

DESCRIPTION

       This  file  carries  all configuration options for the Linux watchdog daemon.  Each option
       has to be written on a line for itself. Comments  start  with  '#'.   Blanks  are  ignored
       except  after  the '=' sign. An empty text after the '=' sign disables the feature as long
       as that makes sense.

OPTIONS

       interval = <interval>
              Set the highest possible interval between two writes to the watchdog  device.   The
              device  is  triggered  after  each  check  regardless  of  the  time it took. After
              finishing all checks watchdog goes to sleep for a full cycle of <interval> seconds.
              Default value is 1 second. The kernel drivers expects a write command every minute.
              Otherwise the system will be rebooted.  Therefore an interval of more than a minute
              can only be used with the force command-line option [--force | -f].

       logtick = <logtick>
              If  you  enable verbose logging, a message is written into the syslog or a logfile.
              While this is nice, it is not necessary to  get  a  message  every  interval  which
              really  fills  up  disk  and  needs CPU. logtick allows adjustment of the number of
              intervals skipped before a log message is written. If you  use  logtick  =  60  and
              interval  =  10, only every 10 minutes (600 seconds) a message is written. This may
              make the exact time of a crash harder to find but greatly reduces  disk  usage  and
              administrator  nerves if you're looking for a particular syslog entry in between of
              watchdog messages.

       max-load-1 = <load1>
              Set the maximal allowed load average for a 1 minute span. Once this load average is
              reached  the  system  is  rebooted. Default value is 0. That means the load average
              check is disabled. Be careful not to set this parameter too low.  To  set  a  value
              less  then  the  predefined minimal value of 2, you have to use the -f command line
              option.

       max-load-5 = <load5>
              Set the maximal allowed load average for a 5 minute span. Once this load average is
              reached the system is rebooted. Default value is 3/4*max-load-1.  Be careful not to
              this parameter too low. To set a value less then the predefined minimal value of 2,
              you have to use the -f command line option.

       max-load-15 = <load15>
              Set  the  maximal allowed load average for a 15 minute span. Once this load average
              is reached the system is rebooted. Default value is 1/2*max-load-1.  Be careful not
              to this parameter too low. To set a value less then the predefined minimal value of
              2, you have to use the -f command line option.

       min-memory = <minpage>
              Set the minimal amount of virtual memory that has to stay free. Note that  this  is
              in  memory  pages  (4kB  on x86). Default value is 0 pages which means this test is
              disabled. The page size is  taken  from  the  system  include  files.   This  is  a
              'passive' test and works by reading /proc/meminfo

       allocatable-memory = <minpage>
              Set  the  minimum  amount of allocatable memory available on the system.  Note that
              this is in pages.  Default value is 0 pages which means the test is  disabled.   As
              with  min-memory,  the page size is taken from the system include files. This is an
              'active' test and it works by attempting to memory-map a block  of  the  configured
              size.

       watchdog-device = <device>
              Set  the  watchdog device name, typically /dev/watchdog. Default is to disable keep
              alive support. This should be tested by running the daemon from  the  command  line
              before configuring it to start automatically on booting.

       watchdog-timeout = <timeout>
              Set the watchdog device timeout during startup.  If not set, a default is used that
              should be set to the kernel timer margin at compile time.

       temperature-sensor = <temp-virtual-file>
              Set the temperature sensor name. This is normally a 'virtual file' under  /sys  and
              it  contains  the  temperature in milli-Celsius. Usually these are generated by the
              sensors package, but take care as device enumeration may not be fixed.  Default  is
              to  disable  temperature  checking. Multiple sensors can be used by having repeated
              temperature-sensor entries.

       max-temperature = <temp>
              Set the maximal allowed temperature. Once this temperature is reached the system is
              stopped.  Default  value is 90 C. Watchdog will issue warnings once the temperature
              increases 90%, 95% and 98% of this temperature.

       temp-power-off = <yes|no>
              Set the watchdog action on overheating.  Yes  option  (default)  is  to  power  the
              machine off, no option is to halt machine and allow Ctrl-Alt-Del reboot.

       file = <filename>
              Set  file  name  for  file  mode.  This option can be given as often as you like to
              check several files.

       change = <mtime>
              Set the change interval time for file mode. This  options  always  belongs  to  the
              active filename, that is when finding a 'change =' line watchdog assumes it belongs
              to the most recently read 'file =' line.  They don't  necessarily  have  to  follow
              each  other  directly.  But you cannot specify a 'change =' before a 'file ='.  The
              default is to only stat the file and don't look for changes.  Using this feature to
              monitor  changes  in  /var/log/messages  might  require  some special syslog daemon
              configuration, e.g. rsyslog needs "$ActionWriteAllMarkMessages on"  to  be  set  to
              make sure the marks are written no matter what.

       pidfile = <pidfilename>
              Set  pidfile  name  for server test mode.  This option can be given as often as you
              like to check several servers.

       ping = <ip-addr>
              Set IPv4 address for ping mode.  This option can be used more than  once  to  check
              different connections.

       interface = <if-name>
              Set  interface  name  for  network mode.  This option can be used more than once to
              check different interfaces. Note it is only possible to check physical  interfaces,
              and not aliased IP interfaces.

       test-binary = <testbin>
              Execute the given binary to do some user defined tests.

       test-timeout = <timeout in seconds>
              User defined tests may only run for <timeout> seconds. Set to 0 for unlimited.

       repair-binary = <repbin>
              Execute the given binary in case of a problem instead of shutting down the system.

       repair-timeout = <timeout in seconds>
              repair  command  may  only run for <timeout> seconds. Set to 0 for 'unlimited', but
              note that the hardware timer is not refreshed in this case so the system will hard-
              reset at some point.

       retry-timeout = <timeout in seconds>
              Allow  most  error  conditions  to  persist  for  <timeout>  seconds.  Set to 0 for
              immediate action (like softboot behaviour).

       repair-maximum = <count>
              This allows no more then <count> repair attempts against a given fault that  report
              success  (i.e. return 0), but fail to clear the fault, before a reboot is initiated
              anyway. If set to zero then a repairable fault can always be blocked  by  a  repair
              program reporting success (previous daemon behaviour).

       admin = <mail-address>
              Email  address  to  send  admin  mail  to.  That is, who shall be notified that the
              machine is being halted or rebooted. Default is 'root'.  If  you  want  to  disable
              notification via email just set admin to en empty string.

       realtime = <yes|no>
              If set to yes watchdog will lock itself into memory so it is never swapped out.

       priority = <schedule priority>
              Set the schedule priority for realtime mode.

       test-directory = <test directory>
              Set  the  directory  to run user test/repair scripts.  Default is '/etc/watchdog.d'
              See the Test Directory section in watchdog(8) for more information.

       log-dir = <log directory>
              Set the log directory to capture  the  standard  output  and  standard  error  from
              repair-binary and test-binary execution. Default is '/var/log/watchdog'.

       sigterm-delay = <time in seconds>
              Set  the time on shut down between first sending SIGTERM to all processes, and then
              sending SIGKILL. Default is 5 seconds which is generally enough, but  systems  with
              large databases or virtual machines might need longer.

       verbose = <yes|no>
              This  overrides  the  command  line --verbose option. Generally the verbose mode is
              only enabled for debugging as it creates a lot  of  syslog  chatter,  so  use  this
              option with consideration.

FILES

       /etc/watchdog.conf
              The watchdog configuration file

       /etc/watchdog.d
              A  directory  containing test-or-repair commands. See the Test Directory section in
              watchdog(8) for more information.

SEE ALSO

       watchdog(8)