Provided by: htcondor_8.0.5~dfsg.1-1ubuntu1_amd64 bug

Name

       condor_dagman meta - scheduler of the jobs submitted as the nodes of a DAG or DAGs

Synopsis

       condor_dagman [-debug level] [-maxidle numberOfJobs] [-maxjobs numberOfJobs] [-maxpre NumberOfPREscripts]
       [-maxpost     NumberOfPOSTscripts]     [-noeventchecks]     [-allowlogerror]    [-usedagdir]    -lockfile
       filename[-waitfordebug]      [-autorescue       0|1]       [-dorescuefrom       number]       -csdversion
       version_string[-allowversionmismatch]    [-DumpRescue]    [-verbose]   [-force]   [-notification   value]
       [-suppress_notification]   [-dont_suppress_notification]   [-dagman    DagmanExecutable]    [-outfile_dir
       directory]  [-update_submit]  [-import_env]  [-DontAlwaysRunPost]  -dag  dag_file[-dag dag_file_2... -dag
       dag_file_n]

Description

       condor_dagman is a meta scheduler for the HTCondor  jobs  within  a  DAG  (directed  acyclic  graph)  (or
       multiple DAGs). In typical usage, a submitter of jobs that are organized into a DAG submits the DAG using
       condor_submit_dag.  condor_submit_dagdoes  error  checking  on  aspects  of  the  DAG  and  then  submits
       condor_dagman as an HTCondor job. condor_dagman uses log files to coordinate the  further  submission  of
       the jobs within the DAG.

       All command line arguments to the DaemonCorelibrary functions work for condor_dagman.

       Arguments  to  condor_dagmanare  either  automatically  set  by condor_submit_dagor they are specified as
       command-line arguments to condor_submit_dagand passed on  to  condor_dagman.  The  method  by  which  the
       arguments are set is given in their description below.

       condor_dagmancan run multiple, independent DAGs. This is done by specifying multiple -dag arguments. Pass
       multiple DAG input files as command-line arguments to condor_submit_dag.

       Debugging  output  may be obtained by using the -debug leveloption. Level values and what they produce is
       described as

          * level = 0; never produce output, except for usage info

          * level = 1; very quiet, output severe errors

          * level = 2; normal output, errors and warnings

          * level = 3; output errors, as well as all warnings

          * level = 4; internal debugging output

          * level = 5; internal debugging output; outer loop debugging

          * level = 6; internal debugging output; inner loop debugging; output DAG input file lines as they  are
          parsed

          * level = 7; internal debugging output; rarely used; output DAG input file lines as they are parsed

Options

       -debug level

          An integer level of debugging output. levelis an integer, with values of 0-7 inclusive, where 7 is the
          most  verbose  output.  This  command-line  option  to  condor_submit_dagis passed to condor_dagman or
          defaults to the value 3.

       -maxidle NumberOfJobs

          Sets the maximum number of idle jobs allowed before condor_dagman stops submitting more jobs.  If  DAG
          nodes  have  a  cluster with more than one job in it, each job in the cluster is counted individually.
          Once idle jobs start to run, condor_dagman will resume  submitting  jobs.  NumberOfJobsis  a  positive
          integer.  This  command-line option to condor_submit_dagis passed to condor_dagman . If not specified,
          the number of idle jobs is unlimited. Note that nothing special is  done  to  the  submit  description
          file. Setting  queue 5000 in the submit description file, where -maxidleis set to 250 will result in a
          cluster of 5000 new jobs being submitted to the condor_schedd. In this case, condor_dagman will resume
          submitting jobs when the number of idle jobs falls below 250.

       -maxjobs numberOfJobs

          Sets  the  maximum  number  of clusters within the DAG that will be submitted to HTCondor at one time.
          numberOfJobsis  a  positive  integer.  This  command-line  option  to  condor_submit_dagis  passed  to
          condor_dagman  .  If not specified, the default number of clusters is unlimited. If a cluster contains
          more than one job, only the cluster is counted for purposes of maxjobs.

       -maxpre NumberOfPREscripts

          Sets  the  maximum  number  of  PRE  scripts  within  the  DAG  that  may  be  running  at  one  time.
          NumberOfPREScriptsis  a  positive  integer.  This command-line option to condor_submit_dagis passed to
          condor_dagman . If not specified, the default number of PRE scripts is unlimited.

       -maxpost NumberOfPOSTscripts

          Sets the  maximum  number  of  POST  scripts  within  the  DAG  that  may  be  running  at  one  time.
          NumberOfPOSTScriptsis  a  positive  integer. This command-line option to condor_submit_dagis passed to
          condor_dagman . If not specified, the default number of POST scripts is unlimited.

       -noeventchecks

          This argument is no longer used; it is now ignored.  Its  functionality  is  now  implemented  by  the
          DAGMAN_ALLOW_EVENTS configuration variable.

       -allowlogerror

          This  optional  argument  has condor_dagman try to run the specified DAG, even in the case of detected
          errors in the job event log specification. As of version 7.3.2, this argument has an  effect  only  on
          DAGs containing Stork job nodes.

       -usedagdir

          This  optional  argument causes condor_dagman to run each specified DAG as if the directory containing
          that DAG file was the current working directory. This option is most useful when running multiple DAGs
          in a single condor_dagman .

       -lockfile filename

          Names the file created and used as a lock file. The lock file prevents execution of two  of  the  same
          DAG,  as  defined by a DAG input file. A default lock file ending with the suffix  .dag.lock is passed
          to condor_dagman by condor_submit_dag.

       -waitfordebug

          This optional argument causes condor_dagman to wait at startup until someone attaches to  the  process
          with a debugger and sets the wait_for_debug variable in main_init() to false.

       -autorescue 0|1

          Whether to automatically run the newest rescue DAG for the given DAG file, if one exists (0 =  false ,
          1 =  true ).

       -dorescuefrom number

          Forces  condor_dagman  to  run  the specified rescue DAG number for the given DAG. A value of 0 is the
          same as not specifying this option. Specifying a nonexistent rescue DAG is a fatal error.

       -csdversion version_string

          version_stringis the version of the condor_submit_dagprogram. At startup, condor_dagman checks  for  a
          version mismatch with the condor_submit_dagversion in this argument.

       -allowversionmismatch

          This  optional  argument causes condor_dagman to allow a version mismatch between condor_dagman itself
          and  the    .condor.sub   file   produced   by   condor_submit_dag(or,   in   other   words,   between
          condor_submit_dagand  condor_dagman  ).  WARNING!  This  option  should  be  used  only  if absolutely
          necessary. Allowing version mismatches can cause  subtle  problems  when  running  DAGs.  (Note  that,
          starting  with  version  7.4.0, condor_dagman no longer requires an exact version match between itself
          and the  .condor.sub file. Instead, a "minimum compatible version" is defined,  and  any   .condor.sub
          file of that version or newer is accepted.)

       -DumpRescue

          This optional argument causes condor_dagman to immediately dump a Rescue DAG and then exit, as opposed
          to  actually  running  the  DAG.  This  feature is mainly intended for testing. The Rescue DAG file is
          produced whether or not there are parse errors reading the original DAG input file. The  name  of  the
          file differs if there was a parse error.

       -verbose

          (This  argument  is  included  only to be passed to condor_submit_dagif lazy submit file generation is
          used for nested DAGs.) Cause condor_submit_dagto give verbose error messages.

       -force

          (This argument is included only to be passed to condor_submit_dagif lazy  submit  file  generation  is
          used  for nested DAGs.) Require condor_submit_dagto overwrite the files that it produces, if the files
          already exist. Note that  dagman.out will be appended to, not overwritten.  If  new-style  rescue  DAG
          mode  is in effect, and any new-style rescue DAGs exist, the -forceflag will cause them to be renamed,
          and the original DAG will be run. If old-style rescue DAG mode is in effect,  any  existing  old-style
          rescue  DAGs  will  be  deleted,  and the original DAG will be run. See the HTCondor manual section on
          Rescue DAGs for more information.

       -notification value

          This argument is only included to be passed to condor_submit_dagif lazy submit file generation is used
          for nested DAGs. Sets the e-mail notification for DAGMan itself. This information will be used  within
          the  HTCondor  submit  description  file  for  DAGMan. This file is produced by condor_submit_dag. The
          notificationoption is described in the condor_submitmanual page.

       -dagman DagmanExecutable

          (This argument is included only to be passed to condor_submit_dagif lazy  submit  file  generation  is
          used  for  nested  DAGs.) Allows the specification of an alternate condor_dagman executable to be used
          instead of the one found in the user's path. This must be a fully qualified path.

       -outfile_dir directory

          (This argument is included only to be passed to condor_submit_dagif lazy  submit  file  generation  is
          used  for  nested  DAGs.)  Specifies the directory in which the  .dagman.out file will be written. The
          directorymay be specified relative to the current working directory as  condor_submit_dagis  executed,
          or  specified  with an absolute path. Without this option, the  .dagman.out file is placed in the same
          directory as the first DAG input file listed on the command line.

       -update_submit

          (This argument is included only to be passed to condor_submit_dagif lazy  submit  file  generation  is
          used  for  nested DAGs.) This optional argument causes an existing  .condor.sub file to not be treated
          as an error; rather, the  .condor.sub file will be overwritten, but the existing values  of  -maxjobs,
          -maxidle, -maxpre, and -maxpostwill be preserved.

       -import_env

          (This  argument  is  included  only to be passed to condor_submit_dagif lazy submit file generation is
          used  for  nested  DAGs.)  This  optional  argument  causes  condor_submit_dagto  import  the  current
          environment into the environmentcommand of the  .condor.sub file it generates.

       -dag filename

          filenameis  the name of the DAG input file that is set as an argument to condor_submit_dag, and passed
          to condor_dagman .

       -DontAlwaysRunPost

          This option causes condor_dagman to observe the exit status of the PRE script when deciding whether or
          not to run the POST script. Versions of condor_dagman previous to HTCondor version 7.7.2 would not run
          the POST script if the PRE script exited with a nonzero status, but this default has been changed such
          that the POST script will run, regardless of the exit status of the  PRE  script.  Using  this  option
          restores  the previous behavior, in which condor_dagman will not run the POST script if the PRE script
          fails.

       -suppress_notification

          Causes jobs submitted by condor_dagman to not send email notification for events. The same effect  can
          be  achieved  by  setting  the  configuration  variable   DAGMAN_SUPPRESS_NOTIFICATION to  True . This
          command  line  option  is  independent  of  the  -notificationcommand  line  option,  which   controls
          notification   for   the   condor_dagman   job   itself.   This  flag  is  generally  superfluous,  as
          DAGMAN_SUPPRESS_NOTIFICATION defaults to  True .

       -dont_suppress_notification

          Causes jobs submitted by condor_dagman to defer to content within the  submit  description  file  when
          deciding  to  send  email  notification  for  events.  The  same effect can be achieved by setting the
          configuration variable  DAGMAN_SUPPRESS_NOTIFICATION to  False . This command line flag is independent
          of the -notificationcommand line option, which controls notification for the condor_dagman job itself.
          If both -dont_suppress_notificationand -suppress_notificationare specified  within  the  same  command
          line, the last argument is used.

Exit Status

       condor_dagmanwill  exit  with  a status value of 0 (zero) upon success, and it will exit with the value 1
       (one) upon failure.

Examples

       condor_dagmanis normally not run directly, but submitted as an HTCondor job by running condor_submit_dag.
       See the condor_submit_dag manual page for examples.

Author

       Center for High Throughput Computing, University of Wisconsin-Madison

Copyright

       Copyright (C) 1990-2013 Center for High Throughput Computing, Computer Sciences Department, University of
       Wisconsin-Madison, Madison, WI. All Rights Reserved. Licensed under the Apache License, Version 2.0.

                                                  January 2014                                  condor_dagman(1)