Provided by: condor_23.6.2+dfsg-2_amd64 bug

NAME

       condor_dagman - HTCondor Manual

       meta scheduler of the jobs submitted as the nodes of a DAG or DAGs

SYNOPSIS

       condor_dagman -f -t -l . -help

       condor_dagman -version

       condor_dagman  -f  -l  .  -csdversion  version_string  [-debug  level] [-dryrun] [-maxidle
       numberOfProcs]   [-maxjobs   numberOfJobs]    [-maxpre    NumberOfPreScripts]    [-maxpost
       NumberOfPostScripts]  [-maxhold  NumberOfHoldScripts]  [-usedagdir  ]  -lockfile  filename
       [-waitfordebug  ]  [-autorescue  0|1]   [-dorescuefrom   number]   [-load_save   filename]
       [-allowversionmismatch  ]  [-DumpRescue  ]  [-verbose  ]  [-force  ] [-notification value]
       [-suppress_notification  ]  [-dont_suppress_notification  ]   [-dagman   DagmanExecutable]
       [-outfile_dir   directory]  [-update_submit  ]  [-import_env  ]  [-include_env  Variables]
       [-insert_env  Key=Value]  [-priority  number]  [-DontAlwaysRunPost  ]  [-AlwaysRunPost   ]
       [-DoRecovery ] [-dot] -dag dag_file [-dag dag_file_2 ... -dag dag_file_n ]

DESCRIPTION

       condor_dagman  is  a  meta  scheduler for the HTCondor jobs within a DAG (directed acyclic
       graph) (or multiple DAGs). In typical usage, a submitter of jobs that are organized into a
       DAG  submits  the  DAG  using  condor_submit_dag. condor_submit_dag does error checking on
       aspects of the DAG and then submits condor_dagman as an HTCondor job.  condor_dagman  uses
       log files to coordinate the further submission of the jobs within the DAG.

       All  command  line  arguments  to the DaemonCore library functions work for condor_dagman.
       When invoked from the command line, condor_dagman requires the arguments -f -l . to appear
       first  on  the  command  line,  to be processed by DaemonCore. The csdversion must also be
       specified;  at  start  up,  condor_dagman  checks  for  a  version   mismatch   with   the
       condor_submit_dag  version  in this argument. The -t argument must also be present for the
       -help option, such that output is sent to the terminal.

       Arguments to condor_dagman are either automatically set by condor_submit_dag or  they  are
       specified  as  command-line arguments to condor_submit_dag and passed on to condor_dagman.
       The method by which the arguments are set is given in their description below.

       condor_dagman can run multiple, independent DAGs. This is done by specifying multiple -dag
       a rguments. Pass multiple DAG input files as command-line arguments to condor_submit_dag.

       Debugging  output  may be obtained by using the -debug level option. Level values and what
       they produce is described as

       • level = 0; never produce output, except for usage info

       • level = 1; very quiet, output severe errors

       • level = 2; normal output, errors and warnings

       • level = 3; output errors, as well as all warnings

       • level = 4; internal debugging output

       • level = 5; internal debugging output; outer loop debugging

       • level = 6; internal debugging output; inner loop debugging; output DAG input file  lines
         as they are parsed

       • level  =  7; internal debugging output; rarely used; output DAG input file lines as they
         are parsed

OPTIONS

          -help  Display usage information and exit.

          -version
                 Display version information and exit.

          -csdversion VersionString
                 Sets the  version  of  condor_submit_dag  command  used  to  submit  the  DAGMan
                 workflow. Used to help identify version mismatching.

          -debug level
                 An  integer  level  of debugging output. level is an integer, with values of 0-7
                 inclusive, where 7 is the most  verbose  output.  This  command-line  option  to
                 condor_submit_dag is passed to condor_dagman or defaults to the value 3.

          -dryrun
                 Inform  DAGMan  to  do  a  dry  run.  Where the DAG is ran but node jobs are not
                 actually submitted.

          -maxidle NumberOfProcs
                 Sets the maximum  number  of  idle  procs  allowed  before  condor_dagman  stops
                 submitting  more  node  jobs.  If this option is omitted then the number of idle
                 procs is  limited  by  the  configuration  variable  DAGMAN_MAX_JOBS_IDLE  which
                 defaults  to  1000.   To  disable  this  limit,  set  NumberOfProcs  to  0.  The
                 NumberOfProcs can be exceeded if a nodes job has a queue command with more  than
                 one proc to queue. i.e. queue 500 will submit all procs even if NumberOfProcs is
                 250. In this case DAGMan will wait for for the number  of  idle  procs  to  fall
                 below 250 before submitting more jobs to the condor_schedd.

          -maxjobs NumberOfClusters
                 Sets  the  maximum  number  of clusters within the DAG that will be submitted to
                 HTCondor at one time. Each cluster is associated with one node job no matter how
                 many  individual  procs  are in the cluster.  NumberOfClusters is a non-negative
                 integer. If this option is omitted then the number of clusters is limited by the
                 configuration   variable   DAGMAN_MAX_JOBS_SUBMITTED   which   defaults   to   0
                 (unlimited).

          -maxpre NumberOfPreScripts
                 Sets the maximum number of PRE scripts within the DAG that may be running at one
                 time.  NumberOfPreScripts is a non-negative integer.  If this option is omitted,
                 the  number  of  PRE  scripts  is  limited   by   the   configuration   variable
                 DAGMAN_MAX_PRE_SCRIPTS which defaults to 20.

          -maxpost NumberOfPostScripts
                 Sets  the  maximum  number of POST scripts within the DAG that may be running at
                 one time. NumberOfPostScripts is a  non-negative  integer.  If  this  option  is
                 omitted,  the  number  of  POST scripts is limited by the configuration variable
                 DAGMAN_MAX_POST_SCRIPTS which defaults to 20.

          -maxhold NumberOfHoldScripts
                 Sets the maximum number of HOLD scripts within the DAG that may  be  running  at
                 one  time.  NumberOfHoldscripts  is  a  non-negative integer.  If this option is
                 omitted, the number of HOLD scripts is limited  by  the  configuration  variable
                 DAGMAN_MAX_HOLD_SCRIPTS, which defaults to 0 (unlimited).

          -usedagdir
                 This  optional argument causes condor_dagman to run each specified DAG as if the
                 directory containing that DAG file  was  the  current  working  directory.  This
                 option is most useful when running multiple DAGs in a single condor_dagman.

          -lockfile filename
                 Names the file created and used as a lock file. The lock file prevents execution
                 of two of the same DAG, as defined by a DAG input  file.  A  default  lock  file
                 ending   with   the   suffix   .dag.lock   is   passed   to   condor_dagman   by
                 condor_submit_dag.

          -waitfordebug
                 This optional argument causes condor_dagman to wait  at  startup  until  someone
                 attaches  to the process with a debugger and sets the wait_for_debug variable in
                 main_init() to false.

          -autorescue 0|1
                 Whether to automatically run the newest rescue DAG for the given  DAG  file,  if
                 one exists (0 = false, 1 = true).

          -dorescuefrom number
                 Forces condor_dagman to run the specified rescue DAG number for the given DAG. A
                 value of 0 is the same as not specifying this option. Specifying  a  nonexistent
                 rescue DAG is a fatal error.

          -load_save filename
                 Specify  a  file with saved DAG progress to re-run the DAG from. If given a path
                 DAGMan will attempt to read that file following  that  path.  Otherwise,  DAGMan
                 will check for the file in the DAG's save_files sub-directory.

          -allowversionmismatch
                 This  optional argument causes condor_dagman to allow a version mismatch between
                 condor_dagman itself and the .condor.sub file produced by condor_submit_dag (or,
                 in  other  words,  between  condor_submit_dag  and condor_dagman). WARNING! This
                 option should be used only if absolutely necessary. Allowing version  mismatches
                 can cause subtle problems when running DAGs.

          -DumpRescue
                 This optional argument causes condor_dagman to immediately dump a Rescue DAG and
                 then exit, as opposed to actually running  the  DAG.   This  feature  is  mainly
                 intended  for  testing. The Rescue DAG file is produced whether or not there are
                 parse errors reading the original DAG input file. The name of the  file  differs
                 if there was a parse error.

          -verbose
                 (This argument is included only to be passed to condor_submit_dag if lazy submit
                 file generation is used  for  nested  DAGs.)  Cause  condor_submit_dag  to  give
                 verbose error messages.

          -force (This argument is included only to be passed to condor_submit_dag if lazy submit
                 file generation is used for nested DAGs.) Require condor_submit_dag to overwrite
                 the  files  that  it  produces, if the files already exist. Note that dagman.out
                 will be appended to, not overwritten. If new-style rescue DAG mode is in effect,
                 and  any  new-style  rescue  DAGs  exist,  the -force flag will cause them to be
                 renamed, and the original DAG will be run. If old-style rescue DAG  mode  is  in
                 effect, any existing old-style rescue DAGs will be deleted, and the original DAG
                 will  be  run.  See  the  HTCondor  manual  section  on  Rescue  DAGs  for  more
                 information.

          -notification value
                 This  argument is only included to be passed to condor_submit_dag if lazy submit
                 file generation is used for nested DAGs. Sets the e-mail notification for DAGMan
                 itself.  This  information  will  be used within the HTCondor submit description
                 file for DAGMan. This file is produced by  condor_submit_dag.  The  notification
                 option is described in the condor_submit manual page.

          -suppress_notification
                 Causes  jobs  submitted  by  condor_dagman  to  not  send email notification for
                 events. The same effect can be achieved by setting  the  configuration  variable
                 DAGMAN_SUPPRESS_NOTIFICATION to True. This command line option is independent of
                 the -notification command line  option,  which  controls  notification  for  the
                 condor_dagman   job   itself.   This   flag   is   generally   superfluous,   as
                 DAGMAN_SUPPRESS_NOTIFICATION defaults to True.

          -dont_suppress_notification
                 Causes jobs submitted by condor_dagman to defer to  content  within  the  submit
                 description  file  when deciding to send email notification for events. The same
                 effect   can   be   achieved   by    setting    the    configuration    variable
                 DAGMAN_SUPPRESS_NOTIFICATION  to False. This command line flag is independent of
                 the -notification command line  option,  which  controls  notification  for  the
                 condor_dagman    job    itself.    If   both   -dont_suppress_notification   and
                 -suppress_notification are specified within the  same  command  line,  the  last
                 argument is used.

          -dagman DagmanExecutable
                 (This argument is included only to be passed to condor_submit_dag if lazy submit
                 file generation is used  for  nested  DAGs.)  Allows  the  specification  of  an
                 alternate  condor_dagman  executable  to be used instead of the one found in the
                 user's path. This must be a fully qualified path.

          -outfile_dir directory
                 (This argument is included only to be passed to condor_submit_dag if lazy submit
                 file  generation  is used for nested DAGs.) Specifies the directory in which the
                 .dagman.out file will be written. The directory may be specified relative to the
                 current working directory as condor_submit_dag is executed, or specified with an
                 absolute path. Without this option, the .dagman.out file is placed in  the  same
                 directory as the first DAG input file listed on the command line.

          -update_submit
                 (This argument is included only to be passed to condor_submit_dag if lazy submit
                 file generation is used for nested  DAGs.)  This  optional  argument  causes  an
                 existing .condor.sub file to not be treated as an error; rather, the .condor.sub
                 file will be  overwritten,  but  the  existing  values  of  -maxjobs,  -maxidle,
                 -maxpre, and -maxpost will be preserved.

          -import_env
                 (This argument is included only to be passed to condor_submit_dag if lazy submit
                 file generation  is  used  for  nested  DAGs.)  This  optional  argument  causes
                 condor_submit_dag to import the current environment into the environment command
                 of the .condor.sub file it generates.

          -include_env Variables
                 This optional argument takes a comma separated list of enviroment  variables  to
                 add  to  .condor.sub  getenv  environment  filter  which  causes  found matching
                 environment variables to be added to the DAGMan manager jobs environment.

          -insert_env Key=Value
                 This optional argument takes a delimited string of Key=Value pairs to explicitly
                 set  into  the  .condor.sub  files  Environment  macro.  The base delimiter is a
                 semicolon that can be overriden by setting the first character in the string  to
                 a  valid  delimiting  character.  If multiple -insert_env flags contain the same
                 Key then the last occurances Value will be set in the DAGMan jobs environment.

          -priority number
                 Sets the minimum job priority of node jobs  submitted  and  running  under  this
                 condor_dagman job.

          -DontAlwaysRunPost
                 This option causes condor_dagman to not run the POST script of a node if the PRE
                 script fails.

          -AlwaysRunPost
                 This option causes condor_dagman to always run the POST script of a  node,  even
                 if the PRE script fails.

          -DoRecovery
                 Causes  condor_dagman  to  start  in recovery mode. This means that it reads the
                 relevant job user log(s) and catches up to the given DAG's previous state before
                 submitting any new jobs.

          -dot   Run condor_dagman up until the point when a DOT file is produced.

          -dag filename
                 filename  is  the  name  of  the  DAG  input  file that is set as an argument to
                 condor_submit_dag, and passed to condor_dagman.

EXIT STATUS

       condor_dagman will exit with a status value of 0 (zero) upon success,  and  it  will  exit
       with the value 1 (one) upon failure.

EXAMPLES

       condor_dagman  is  normally  not run directly, but submitted as an HTCondor job by running
       condor_submit_dag. See the condor_submit_dag manual page for examples.

AUTHOR

       HTCondor Team

COPYRIGHT

       1990-2024, Center for High Throughput Computing, Computer Sciences Department,  University
       of Wisconsin-Madison, Madison, WI, US. Licensed under the Apache License, Version 2.0.

                                           Aug 03, 2024                          CONDOR_DAGMAN(1)