oracular (1) salloc.1.gz

Provided by: slurm-client_24.05.2-1_amd64 bug

NAME

       salloc  -  Obtain  a  Slurm  job  allocation  (a  set  of nodes), execute a command, and then release the
       allocation when the command is finished.

SYNOPSIS

       salloc [OPTIONS(0)...] [ : [OPTIONS(N)...]] [command(0) [args(0)...]]

       Option(s) define multiple jobs in a co-scheduled heterogeneous job.  For more details about heterogeneous
       jobs see the document
       https://slurm.schedmd.com/heterogeneous_jobs.html

DESCRIPTION

       salloc  is  used  to  allocate a Slurm job allocation, which is a set of resources (nodes), possibly with
       some set of constraints (e.g. number of processors  per  node).  When  salloc  successfully  obtains  the
       requested  allocation,  it  then runs the command specified by the user. Finally, when the user specified
       command is complete, salloc relinquishes the job allocation.

       The command may be any program the  user  wishes.  Some  typical  commands  are  xterm,  a  shell  script
       containing  srun  commands,  and srun (see the EXAMPLES section). If no command is specified, then salloc
       runs the user's default shell.

       The following document describes the influence of various options on the allocation of cpus to  jobs  and
       tasks.
       https://slurm.schedmd.com/cpu_management.html

       NOTE: The salloc logic includes support to save and restore the terminal line settings and is designed to
       be executed in the foreground. If you need to execute salloc in the background, set its standard input to
       some file, for example: "salloc -n16 a.out </dev/null &"

RETURN VALUE

       If  salloc  is  unable  to execute the user command, it will return 1 and print errors to stderr. Else if
       success or if killed by signals HUP, INT, KILL, or QUIT: it will return 0.

COMMAND PATH RESOLUTION

       If provided, the command is resolved in the following order:

       1. If command starts with ".", then path is constructed as: current working directory / command
       2. If command starts with a "/", then path is considered absolute.
       3. If command can be resolved through PATH. See path_resolution(7).
       4. If command is in current working directory.

       Current working directory is the calling process working directory unless the --chdir argument is passed,
       which will override the current working directory.

OPTIONS

       -A, --account=<account>
              Charge  resources  used by this job to specified account.  The account is an arbitrary string. The
              account name may be changed after job submission using the scontrol command.

       --acctg-freq=<datatype>=<interval>[,<datatype>=<interval>...]
              Define the job accounting and profiling sampling intervals  in  seconds.   This  can  be  used  to
              override  the  JobAcctGatherFrequency  parameter  in  the  slurm.conf  file. <datatype>=<interval>
              specifies the task sampling interval for the jobacct_gather plugin or a sampling  interval  for  a
              profiling  type  by the acct_gather_profile plugin. Multiple comma-separated <datatype>=<interval>
              pairs may be specified. Supported datatype values are:

              task        Sampling interval for the  jobacct_gather  plugins  and  for  task  profiling  by  the
                          acct_gather_profile plugin.
                          NOTE:  This  frequency  is used to monitor memory usage. If memory limits are enforced
                          the highest frequency a user can request is what is configured in the slurm.conf file.
                          It can not be disabled.

              energy      Sampling interval for energy profiling using the acct_gather_energy plugin.

              network     Sampling interval for infiniband profiling using the acct_gather_interconnect plugin.

              filesystem  Sampling interval for filesystem profiling using the acct_gather_filesystem plugin.

              The  default  value for the task sampling interval is 30 seconds.  The default value for all other
              intervals is 0.  An interval of 0 disables sampling of the specified type.  If the  task  sampling
              interval  is  0,  accounting  information  is  collected  only  at job termination (reducing Slurm
              interference with the job).
              Smaller (non-zero) values have a greater impact upon job performance, but a value of 30 seconds is
              not likely to be noticeable for applications having less than 10,000 tasks.

       --bb=<spec>
              Burst  buffer  specification.  The  form of the specification is system dependent.  Note the burst
              buffer may not be accessible from a login node, but require that salloc spawn a shell  on  one  of
              its allocated compute nodes.  When the --bb option is used, Slurm parses this option and creates a
              temporary burst buffer script file that is used  internally  by  the  burst  buffer  plugins.  See
              Slurm's burst buffer guide for more information and examples:
              https://slurm.schedmd.com/burst_buffer.html

       --bbf=<file_name>
              Path  of  file  containing  burst  buffer  specification.  The form of the specification is system
              dependent.  Also see --bb.  Note the burst buffer may not be accessible from  a  login  node,  but
              require that salloc spawn a shell on one of its allocated compute nodes.  See Slurm's burst buffer
              guide for more information and examples:
              https://slurm.schedmd.com/burst_buffer.html

       --begin=<time>
              Defer eligibility of this job allocation until the specified time.

              Time may be of the form HH:MM:SS to run a job at a specific time of day  (seconds  are  optional).
              (If  that  time  is  already past, the next day is assumed.)  You may also specify midnight, noon,
              elevenses (11 AM), fika (3 PM) or teatime (4 PM) and you can have a time-of-day suffixed  with  AM
              or  PM  for running in the morning or the evening.  You can also say what day the job will be run,
              by specifying a date of the form MMDDYY or MM/DD/YY YYYY-MM-DD. Combine date and  time  using  the
              following  format  YYYY-MM-DD[THH:MM[:SS]].  You  can also give times like now + count time-units,
              where the time-units can be seconds (default), minutes, hours, days, or weeks  and  you  can  tell
              Slurm  to  run  the  job today with the keyword today and to run the job tomorrow with the keyword
              tomorrow.  The value may be changed after job submission using the scontrol command.  For example:
                 --begin=16:00
                 --begin=now+1hour
                 --begin=now+60           (seconds by default)
                 --begin=2010-01-20T12:34:00

              Notes on date/time specifications:
               - Although the 'seconds' field of the HH:MM:SS time specification is allowed by  the  code,  note
              that  the  poll time of the Slurm scheduler is not precise enough to guarantee dispatch of the job
              on the exact second. The job will be eligible to start on the next poll  following  the  specified
              time.  The  exact  poll interval depends on the Slurm scheduler (e.g., 60 seconds with the default
              sched/builtin).
               - If no time (HH:MM:SS) is specified, the default is (00:00:00).
               - If a date is specified without a year (e.g., MM/DD) then the current year  is  assumed,  unless
              the  combination  of  MM/DD  and HH:MM:SS has already passed for that year, in which case the next
              year is used.

       --bell Force salloc to ring the terminal bell when the job allocation is granted (and only if stdout is a
              tty).  By  default,  salloc  only  rings  the  bell if the allocation is pending for more than ten
              seconds (and only if stdout is a tty). Also see the option --no-bell.

       -D, --chdir=<path>
              Change directory to path before beginning execution. The path can be specified  as  full  path  or
              relative path to the directory where the command is executed.

       --cluster-constraint=<list>
              Specifies features that a federated cluster must have to have a sibling job submitted to it. Slurm
              will attempt to submit a sibling job to a cluster  if  it  has  at  least  one  of  the  specified
              features.

       -M, --clusters=<string>
              Clusters  to  issue  commands  to. Multiple cluster names may be comma separated.  The job will be
              submitted to the one cluster providing the earliest expected  job  initiation  time.  The  default
              value  is  the current cluster. A value of 'all' will query to run on all clusters.  Note that the
              slurmdbd must be up for this option  to  work  properly,  unless  running  in  a  federation  with
              FederationParameters=fed_display configured.

       --comment=<string>
              An arbitrary comment.

       -C, --constraint=<list>
              Nodes  can  have features assigned to them by the Slurm administrator.  Users can specify which of
              these features are required by their job using the constraint  option.  If  you  are  looking  for
              'soft'  constraints please see --prefer for more information.  Only nodes having features matching
              the job constraints will be used to satisfy the request.  Multiple constraints  may  be  specified
              with  AND,  OR, matching OR, resource counts, etc. (some operators are not supported on all system
              types).

              NOTE: Changeable features are features defined by a NodeFeatures plugin.

              Supported --constraint options include:

              Single Name
                     Only  nodes  which   have   the   specified   feature   will   be   used.    For   example,
                     --constraint="intel"

              Node Count
                     A request can specify the number of nodes needed with some feature by appending an asterisk
                     and count after  the  feature  name.   For  example,  --nodes=16  --constraint="graphics*4"
                     indicates  that  the  job requires 16 nodes and that at least four of those nodes must have
                     the feature "graphics."  If requesting more than one feature and  using  node  counts,  the
                     request must have square brackets surrounding it.

                     NOTE:  This option is not supported by the helpers NodeFeatures plugin.  Heterogeneous jobs
                     can be used instead.

              AND    Only nodes with all of specified features will be used.  The ampersand is used for  an  AND
                     operator.  For example, --constraint="intel&gpu"

              OR     Only  nodes with at least one of specified features will be used.  The vertical bar is used
                     for an OR operator. If changeable features are not requested, nodes in the  allocation  can
                     have  different  features. For example, salloc -N2 --constraint="intel|amd" can result in a
                     job allocation where one node has the intel feature and the other node has the amd feature.
                     However,  if  the  expression  contains  a  changeable  feature,  then all OR operators are
                     automatically treated as Matching OR so that all nodes in the job allocation have the  same
                     set  of  features.  For example, salloc -N2 --constraint="foo|bar&baz" The job is allocated
                     two nodes where both nodes have foo, or bar and baz (one or both nodes could have foo, bar,
                     and  baz).  The  helpers  NodeFeatures plugin will find the first set of node features that
                     matches all nodes in the job allocation; these features are set as active features  on  the
                     node   and  passed  to  RebootProgram  (see  slurm.conf(5))  and  the  helper  script  (see
                     helpers.conf(5)). In this case, the helpers plugin uses the first  of  "foo"  or  "bar,baz"
                     that match the two nodes in the job allocation.

              Matching OR
                     If  only  one of a set of possible options should be used for all allocated nodes, then use
                     the  OR  operator  and  enclose  the  options  within  square   brackets.    For   example,
                     --constraint="[rack1|rack2|rack3|rack4]"  might  be  used to specify that all nodes must be
                     allocated on a single rack of the cluster, but any of those four racks can be used.

              Multiple Counts
                     Specific counts of multiple resources may be  specified  by  using  the  AND  operator  and
                     enclosing      the      options      within      square     brackets.      For     example,
                     --constraint="[rack1*2&rack2*4]" might be used to specify that two nodes must be  allocated
                     from nodes with the feature of "rack1" and four nodes must be allocated from nodes with the
                     feature "rack2".

                     NOTE: This construct does not support multiple Intel KNL NUMA or MCDRAM modes. For example,
                     while        --constraint="[(knl&quad)*2&(knl&hemi)*4]"       is       not       supported,
                     --constraint="[haswell*2&(knl&hemi)*4]" is supported.  Specification of multiple KNL  modes
                     requires the use of a heterogeneous job.

                     NOTE: This option is not supported by the helpers NodeFeatures plugin.

                     NOTE: Multiple Counts can cause jobs to be allocated with a non-optimal network layout.

              Brackets
                     Brackets can be used to indicate that you are looking for a set of nodes with the different
                     requirements      contained       within       the       brackets.       For       example,
                     --constraint="[(rack1|rack2)*1&(rack3)*2]" will get you one node with either the "rack1" or
                     "rack2" features and two nodes with the "rack3"  feature.   If  requesting  more  than  one
                     feature and using node counts, the request must have square brackets surrounding it.

                     NOTE: Brackets are only reserved for Multiple Counts and Matching OR syntax.  AND operators
                     require a count for each feature inside square  brackets  (i.e.  "[quad*2&hemi*1]").  Slurm
                     will only allow a single set of bracketed constraints per job.

                     NOTE: Square brackets are not supported by the helpers NodeFeatures plugin. Matching OR can
                     be requested without square brackets by using the vertical bar character with at least  one
                     changeable feature.

              Parentheses
                     Parentheses   can   be   used   to   group   like  node  features  together.  For  example,
                     --constraint="[(knl&snc4&flat)*4&haswell*1]" might be used to specify that four nodes  with
                     the  features  "knl",  "snc4"  and  "flat"  plus  one  node  with the feature "haswell" are
                     required.  Parentheses can also be used to  group  operations.  Without  parentheses,  node
                     features  are  parsed strictly from left to right.  For example, --constraint="foo&bar|baz"
                     requests nodes with foo and bar, or baz.  --constraint="foo|bar&baz"  requests  nodes  with
                     foo   and   baz,   or   bar   and   baz   (note   how   baz  was  AND'd  with  everything).
                     --constraint="foo&(bar|baz)" requests nodes with foo and at least one of bar or baz.  NOTE:
                     OR within parentheses should not be used with a KNL NodeFeatures plugin but is supported by
                     the helpers NodeFeatures plugin.

       --container=<path_to_container>
              Absolute path to OCI container bundle.

       --container-id=<container_id>
              Unique name for OCI container.

       --contiguous
              If set, then the allocated nodes must form a contiguous set.

              NOTE: If the SelectType is cons_tres this option  won't  be  honored  with  the  topology/tree  or
              topology/3d_torus plugins, both of which can modify the node ordering.

       -S, --core-spec=<num>
              Count  of Specialized Cores per node reserved by the job for system operations and not used by the
              application.  If AllowSpecResourcesUsage is enabled a job can override the  CoreSpecCount  of  all
              its allocated nodes with this option.  The overridden Specialized Cores will still be reserved for
              system processes.  The job will get an implicit --exclusive allocation for the rest of  the  Cores
              on  the  nodes, resulting in the job's processes being able to use (and being charged for) all the
              Cores on the nodes except for the overridden Specialized Cores.  This option can not be used  with
              the --thread-spec option.

              NOTE: Explicitly setting a job's specialized core value implicitly sets the --exclusive option.

       --cores-per-socket=<cores>
              Restrict  node  selection  to  nodes  with  at least the specified number of cores per socket. See
              additional information under -B option above when task/affinity plugin is enabled.
              NOTE: This option may implicitly set the number of tasks (if -n was not specified) as one task per
              requested thread.

       --cpu-freq=<p1>[-p2][:p3]

              Request  that job steps initiated by srun commands inside this allocation be run at some requested
              frequency if possible, on the CPUs selected for the step on the compute node(s).

              p1 can be [#### | low | medium | high | highm1] which will set the frequency scaling_speed to  the
              corresponding value, and set the frequency scaling_governor to UserSpace. See below for definition
              of the values.

              p1 can be [Conservative | OnDemand | Performance | PowerSave] which will set the  scaling_governor
              to  the  corresponding  value.  The  governor  has  to be in the list set by the slurm.conf option
              CpuFreqGovernors.

              When p2 is present, p1 will be the minimum scaling frequency and p2 will be  the  maximum  scaling
              frequency. In that case the governor p3 or CpuFreqDef cannot be UserSpace since it doesn't support
              a range.

              p2 can be [#### | medium | high | highm1]. p2 must be greater than p1  and  is  incompatible  with
              UserSpace governor.

              p3  can  be [Conservative | OnDemand | Performance | PowerSave | SchedUtil | UserSpace] which will
              set the governor to the corresponding value.

              If p3 is UserSpace, the frequency scaling_speed, scaling_max_freq  and  scaling_min_freq  will  be
              statically set to the value defined by p1.

              Any  requested  frequency  below  the  minimum  available frequency will be rounded to the minimum
              available frequency. In the  same  way,  any  requested  frequency  above  the  maximum  available
              frequency will be rounded to the maximum available frequency.

              The  CpuFreqDef  parameter  in  slurm.conf  will  be used to set the governor in absence of p3. If
              there's no CpuFreqDef, the default governor will be to use the system current governor set in each
              cpu. Specifying a range without CpuFreqDef or a specific governor is therefore not allowed.

              Acceptable values at present include:

              ####          frequency in kilohertz

              Low           the lowest available frequency

              High          the highest available frequency

              HighM1        (high minus one) will select the next highest available frequency

              Medium        attempts to set a frequency in the middle of the available range

              Conservative  attempts to use the Conservative CPU governor

              OnDemand      attempts to use the OnDemand CPU governor (the default value)

              Performance   attempts to use the Performance CPU governor

              PowerSave     attempts to use the PowerSave CPU governor

              UserSpace     attempts to use the UserSpace CPU governor

              The following informational environment variable is set in the job
              step when --cpu-freq option is requested.
                      SLURM_CPU_FREQ_REQ

              This environment variable can also be used to supply the value for the CPU frequency request if it
              is set when the 'srun' command is issued.  The --cpu-freq on the command line  will  override  the
              environment  variable value. The form on the environment variable is the same as the command line.
              See the ENVIRONMENT VARIABLES section for a description of the SLURM_CPU_FREQ_REQ variable.

              NOTE: This parameter is treated as a request, not a requirement.  If the job step's node does  not
              support  setting  the  CPU  frequency,  or  the requested value is outside the bounds of the legal
              frequencies, an error is logged, but the job step is allowed to continue.

              NOTE: Setting the frequency for just the CPUs of the job step implies that the tasks are  confined
              to  those  CPUs.  If  task  confinement  (i.e.  the  task/affinity  TaskPlugin  is enabled, or the
              task/cgroup TaskPlugin is enabled with "ConstrainCores=yes" set in cgroup.conf) is not configured,
              this parameter is ignored.

              NOTE:  When  the  step  completes, the frequency and governor of each selected CPU is reset to the
              previous values.

              NOTE: When submitting jobs with the --cpu-freq option with  linuxproc  as  the  ProctrackType  can
              cause  jobs  to run too quickly before Accounting is able to poll for job information. As a result
              not all of accounting information will be present.

       --cpus-per-gpu=<ncpus>
              Request that ncpus processors be allocated per allocated GPU.  Steps inheriting  this  value  will
              imply --exact.  Not compatible with the --cpus-per-task option.

       -c, --cpus-per-task=<ncpus>
              Advise  Slurm that ensuing job steps will require ncpus processors per task. By default Slurm will
              allocate one processor per task.

              For instance, consider an application that has 4  tasks,  each  requiring  3  processors.  If  our
              cluster  is comprised of quad-processors nodes and we simply ask for 12 processors, the controller
              might give us only 3 nodes. However, by using the --cpus-per-task=3 options, the controller  knows
              that each task requires 3 processors on the same node, and the controller will grant an allocation
              of 4 nodes, one for each of the 4 tasks.

       --deadline=<OPT>
              Remove the job if no ending is possible before this deadline (start >  (deadline  -  time[-min])).
              Default  is  no  deadline.  Note  that  if  neither  DefaultTime nor MaxTime are configured on the
              partition the job is in, the job will need to specify some form of time limit (--time[-min]) if  a
              deadline is to be used.

              Valid time formats are:
              HH:MM[:SS] [AM|PM]
              MMDD[YY] or MM/DD[/YY] or MM.DD[.YY]
              MM/DD[/YY]-HH:MM[:SS]
              YYYY-MM-DD[THH:MM[:SS]]]
              now[+count[seconds(default)|minutes|hours|days|weeks]]

       --delay-boot=<minutes>
              Do  not  reboot  nodes  in order to satisfied this job's feature specification if the job has been
              eligible to run for less than this time period.  If the job has waited for less than the specified
              period,  it  will  use  only  nodes which already have the specified features.  The argument is in
              units of minutes.  A default value may be set by  a  system  administrator  using  the  delay_boot
              option  of  the  SchedulerParameters configuration parameter in the slurm.conf file, otherwise the
              default value is zero (no delay).

       -d, --dependency=<dependency_list>
              Defer  the  start  of  this  job  until  the   specified   dependencies   have   been   satisfied.
              <dependency_list>    is    of    the    form    <type:job_id[:job_id][,type:job_id[:job_id]]>   or
              <type:job_id[:job_id][?type:job_id[:job_id]]>.  All dependencies must  be  satisfied  if  the  ","
              separator  is  used.   Any  dependency  may  be  satisfied if the "?" separator is used.  Only one
              separator may be used. For instance:
              -d afterok:20:21,afterany:23
              means that the job can run only after a 0 return code of jobs 20 and 21 AND completion of job  23.
              However:
              -d afterok:20:21?afterany:23
              means  that  any  of  the  conditions  (afterok:20 OR afterok:21 OR afterany:23) will be enough to
              release the job.  Many jobs can share the same dependency  and  these  jobs  may  even  belong  to
              different  users.  The  value  may  be  changed  after  job submission using the scontrol command.
              Dependencies on remote jobs are allowed in a federation.  Once a job dependency fails due  to  the
              termination  state  of a preceding job, the dependent job will never be run, even if the preceding
              job is requeued and has a different termination state in a subsequent execution.

              after:job_id[[+time][:jobid[+time]...]]
                     After the specified jobs start or are cancelled and 'time' in minutes  from  job  start  or
                     cancellation  happens, this job can begin execution. If no 'time' is given then there is no
                     delay after start or cancellation.

              afterany:job_id[:jobid...]
                     This job can begin execution after the specified jobs have terminated.  This is the default
                     dependency type.

              afterburstbuffer:job_id[:jobid...]
                     This  job  can  begin execution after the specified jobs have terminated and any associated
                     burst buffer stage out operations have completed.

              aftercorr:job_id[:jobid...]
                     A task of this job array can begin  execution  after  the  corresponding  task  ID  in  the
                     specified job has completed successfully (ran to completion with an exit code of zero).

              afternotok:job_id[:jobid...]
                     This  job can begin execution after the specified jobs have terminated in some failed state
                     (non-zero exit code, node failure, timed out, etc).  This job must be submitted  while  the
                     specified  job  is  still  active  or  within MinJobAge seconds after the specified job has
                     ended.

              afterok:job_id[:jobid...]
                     This job can begin execution after the specified jobs have successfully  executed  (ran  to
                     completion  with an exit code of zero).  This job must be submitted while the specified job
                     is still active or within MinJobAge seconds after the specified job has ended.

              singleton
                     This job can begin execution after any previously launched jobs sharing the same  job  name
                     and user have terminated.  In other words, only one job by that name and owned by that user
                     can be running or suspended at any point in time.  In a federation, a singleton  dependency
                     must  be  fulfilled on all clusters unless DependencyParameters=disable_remote_singleton is
                     used in slurm.conf.

       -m,
       --distribution={*|block|cyclic|arbitrary|plane=<size>}[:{*|block|cyclic|fcyclic}[:{*|block|cyclic|fcyclic}]][,{Pack|NoPack}]

              Specify alternate distribution methods for  remote  processes.   For  job  allocation,  this  sets
              environment  variables  that will be used by subsequent srun requests and also affects which cores
              will be selected for job allocation.

              This option controls the distribution  of  tasks  to  the  nodes  on  which  resources  have  been
              allocated, and the distribution of those resources to tasks for binding (task affinity). The first
              distribution method (before the first ":") controls the  distribution  of  tasks  to  nodes.   The
              second  distribution  method  (after  the  first  ":") controls the distribution of allocated CPUs
              across sockets for binding to tasks. The third distribution method (after the second ":") controls
              the  distribution  of  allocated  CPUs  across  cores  for binding to tasks.  The second and third
              distributions apply only if task affinity is enabled.  The third distribution is supported only if
              the task/cgroup plugin is configured. The default value for each distribution type is specified by
              *.

              Note that with select/cons_tres, the number of CPUs allocated to  each  socket  and  node  may  be
              different.  Refer  to  https://slurm.schedmd.com/mc_support.html  for more information on resource
              allocation, distribution of tasks to nodes, and binding of tasks to CPUs.
              First distribution method (distribution of tasks across nodes):

              *      Use the default method for distributing tasks to nodes (block).

              block  The block distribution method will distribute tasks to a node such that  consecutive  tasks
                     share  a  node.  For  example,  consider an allocation of three nodes each with two cpus. A
                     four-task block distribution request will distribute those tasks to the  nodes  with  tasks
                     one  and  two  on the first node, task three on the second node, and task four on the third
                     node. Block distribution is the default behavior if the number of tasks exceeds the  number
                     of allocated nodes.

              cyclic The  cyclic distribution method will distribute tasks to a node such that consecutive tasks
                     are distributed over consecutive nodes (in a round-robin fashion). For example, consider an
                     allocation  of three nodes each with two cpus. A four-task cyclic distribution request will
                     distribute those tasks to the nodes with tasks one and four on the first node, task two  on
                     the  second  node,  and  task  three  on  the  third  node.   Note  that when SelectType is
                     select/cons_tres, the same number  of  CPUs  may  not  be  allocated  on  each  node.  Task
                     distribution will be round-robin among all the nodes with CPUs yet to be assigned to tasks.
                     Cyclic distribution is the default behavior if the number of tasks is no  larger  than  the
                     number of allocated nodes.

              plane  The  tasks  are  distributed  in  blocks  of  size  <size>.  The  size  must  be  given  or
                     SLURM_DIST_PLANESIZE must be set. The number of tasks distributed to each node is the  same
                     as for cyclic distribution, but the taskids assigned to each node depend on the plane size.
                     Additional distribution specifications cannot be  combined  with  this  option.   For  more
                     details        (including        examples       and       diagrams),       please       see
                     https://slurm.schedmd.com/mc_support.html and https://slurm.schedmd.com/dist_plane.html

              arbitrary
                     The arbitrary method of distribution will allocate processes in-order  as  listed  in  file
                     designated  by  the environment variable SLURM_HOSTFILE. If this variable is listed it will
                     override any other method specified. If not set the method will default to  block.   Inside
                     the  hostfile  must contain at minimum the number of hosts requested and be one per line or
                     comma separated. If specifying a task count (-n, --ntasks=<number>),  your  tasks  will  be
                     laid out on the nodes in the order of the file.
                     NOTE:  The  arbitrary distribution option on a job allocation only controls the nodes to be
                     allocated to the job and not the allocation of CPUs on those nodes. This  option  is  meant
                     primarily  to  control  a job step's task layout in an existing job allocation for the srun
                     command.
                     NOTE: If the number of tasks is given and a list of requested  nodes  is  also  given,  the
                     number of nodes used from that list will be reduced to match that of the number of tasks if
                     the number of nodes in the list is greater than the number of tasks.

              Second distribution method (distribution of CPUs across sockets for binding):

              *      Use the default method for distributing CPUs across sockets (cyclic).

              block  The block distribution method will distribute allocated CPUs consecutively  from  the  same
                     socket for binding to tasks, before using the next consecutive socket.

              cyclic The  cyclic  distribution method will distribute allocated CPUs for binding to a given task
                     consecutively from the same socket, and from the next consecutive socket for the next task,
                     in  a  round-robin fashion across sockets.  Tasks requiring more than one CPU will have all
                     of those CPUs allocated on a single socket if possible.
                     NOTE: In nodes with hyper-threading enabled, a  task  not  requesting  full  cores  may  be
                     distributed  across  sockets.  This can be avoided by specifying --ntasks-per-core=1, which
                     forces tasks to allocate full cores.

              fcyclic
                     The fcyclic distribution method will distribute allocated CPUs for binding  to  tasks  from
                     consecutive sockets in a round-robin fashion across the sockets.  Tasks requiring more than
                     one CPU will have each CPUs allocated in a cyclic fashion across sockets.

              Third distribution method (distribution of CPUs across cores for binding):

              *      Use  the  default  method  for  distributing  CPUs  across  cores  (inherited  from  second
                     distribution method).

              block  The  block  distribution  method will distribute allocated CPUs consecutively from the same
                     core for binding to tasks, before using the next consecutive core.

              cyclic The cyclic distribution method will distribute allocated CPUs for binding to a  given  task
                     consecutively  from the same core, and from the next consecutive core for the next task, in
                     a round-robin fashion across cores.

              fcyclic
                     The fcyclic distribution method will distribute allocated CPUs for binding  to  tasks  from
                     consecutive cores in a round-robin fashion across the cores.

              Optional control for task distribution over nodes:

              Pack   Rather  than evenly distributing a job step's tasks evenly across its allocated nodes, pack
                     them as tightly as possible on  the  nodes.   This  only  applies  when  the  "block"  task
                     distribution method is used.

              NoPack Rather than packing a job step's tasks as tightly as possible on the nodes, distribute them
                     evenly.   This  user  option  will   supersede   the   SelectTypeParameters   CR_Pack_Nodes
                     configuration parameter.

       -x, --exclude=<node_name_list>
              Explicitly exclude certain nodes from the resources granted to the job.

       --exclusive[={user|mcs|topo}]
              The  job allocation can not share nodes (or topology segment  with the "=topo") with other running
              jobs (or just other users with the "=user" option or with the "=mcs" option).   If  user/mcsi/topo
              are  not  specified (i.e. the job allocation can not share nodes with other running jobs), the job
              is allocated all CPUs and GRES on all nodes in the allocation,  but  is  only  allocated  as  much
              memory as it requested. This is by design to support gang scheduling, because suspended jobs still
              reside in memory. To request all the memory on a node, use --mem=0.  The default  shared/exclusive
              behavior depends on system configuration and the partition's OverSubscribe option takes precedence
              over the job's option.  NOTE: Since shared GRES (MPS) cannot be allocated at the same  time  as  a
              sharing GRES (GPU) this option only allocates all sharing GRES and no underlying shared GRES.

              NOTE: This option is mutually exclusive with --oversubscribe.

       --extra=<string>
              An  arbitrary  string  enclosed  in  single  or  double  quotes  if  using  spaces or some special
              characters.

              If SchedulerParameters=extra_constraints is enabled, this string is used for node filtering  based
              on the Extra field in each node.

       -B, --extra-node-info=<sockets>[:cores[:threads]]
              Restrict  node  selection to nodes with at least the specified number of sockets, cores per socket
              and/or threads per core.
              NOTE: These options do not  specify  the  resource  allocation  size.   Each  value  specified  is
              considered  a minimum.  An asterisk (*) can be used as a placeholder indicating that all available
              resources of that type are to be utilized. Values can also be specified as min-max. The individual
              levels can also be specified in separate options if desired:
                  --sockets-per-node=<sockets>
                  --cores-per-socket=<cores>
                  --threads-per-core=<threads>
              If  task/affinity  plugin is enabled, then specifying an allocation in this manner also results in
              subsequently launched tasks being bound to threads if the -B  option  specifies  a  thread  count,
              otherwise  an  option  of  cores if a core count is specified, otherwise an option of sockets.  If
              SelectType is configured to select/cons_tres, it must have a parameter of CR_Core, CR_Core_Memory,
              CR_Socket, or CR_Socket_Memory for this option to be honored.  If not specified, the scontrol show
              job will display 'ReqS:C:T=*:*:*'. This option applies to job allocations.
              NOTE: This option is mutually exclusive with --hint, --threads-per-core and --ntasks-per-core.
              NOTE: This option may implicitly set the number of tasks (if -n was not specified) as one task per
              requested thread.

       --get-user-env[=timeout][mode]
              This option will load login environment variables for the user specified in the --uid option.  The
              environment variables are retrieved by running something along the lines of "su  -  <username>  -c
              /usr/bin/env"  and  parsing  the  output.   Be aware that any environment variables already set in
              salloc's environment will take precedence over any  environment  variables  in  the  user's  login
              environment.   The optional timeout value is in seconds. Default value is 3 seconds.  The optional
              mode value controls the "su" options.  With a mode value of "S", "su" is executed without the  "-"
              option.   With  a  mode  value of "L", "su" is executed with the "-" option, replicating the login
              environment.  If mode is not specified,  the  mode  established  at  Slurm  build  time  is  used.
              Examples   of   use   include   "--get-user-env",  "--get-user-env=10"  "--get-user-env=10L",  and
              "--get-user-env=S".  NOTE: This option only works if the caller has an effective uid of "root".

       --gid=<group>
              Submit the job with the specified group's group access permissions.  group may be the  group  name
              or  the  numerical  group  ID.  In the default Slurm configuration, this option is only valid when
              used by the user root.

       --gpu-bind=[verbose,]<type>
              Equivalent  to  --tres-bind=gres/gpu:[verbose,]<type>  See  --tres-bind  for   all   options   and
              documentation.

       --gpu-freq=[<type]=value>[,<type=value>][,verbose]
              Request that GPUs allocated to the job are configured with specific frequency values.  This option
              can be used to independently configure the GPU and its  memory  frequencies.   After  the  job  is
              completed,  the frequencies of all affected GPUs will be reset to the highest possible values.  In
              some cases, system power caps may override the requested values.  The field type can be  "memory".
              If  type  is  not  specified,  the GPU frequency is implied.  The value field can either be "low",
              "medium", "high", "highm1" or a numeric value in megahertz (MHz).  If the specified numeric  value
              is  not  possible,  a  value  as  close  as possible will be used. See below for definition of the
              values.  The verbose option causes current GPU frequency information to be  logged.   Examples  of
              use include "--gpu-freq=medium,memory=high" and "--gpu-freq=450".

              Supported value definitions:

              low       the lowest available frequency.

              medium    attempts to set a frequency in the middle of the available range.

              high      the highest available frequency.

              highm1    (high minus one) will select the next highest available frequency.

       -G, --gpus=[type:]<number>
              Specify  the total number of GPUs required for the job.  An optional GPU type specification can be
              supplied.  For example "--gpus=volta:3".  See  also  the  --gpus-per-node,  --gpus-per-socket  and
              --gpus-per-task options.
              NOTE: The allocation has to contain at least one GPU per node, or one of each GPU type per node if
              types are used. Use heterogeneous jobs if different nodes need different GPU types.

       --gpus-per-node=[type:]<number>
              Specify the number of GPUs required for the job on  each  node  included  in  the  job's  resource
              allocation.     An   optional   GPU   type   specification   can   be   supplied.    For   example
              "--gpus-per-node=volta:3".  Multiple options can be requested  in  a  comma  separated  list,  for
              example:   "--gpus-per-node=volta:3,kepler:1".    See   also  the  --gpus,  --gpus-per-socket  and
              --gpus-per-task options.

       --gpus-per-socket=[type:]<number>
              Specify the number of GPUs required for the job on each socket  included  in  the  job's  resource
              allocation.     An   optional   GPU   type   specification   can   be   supplied.    For   example
              "--gpus-per-socket=volta:3".  Multiple options can be requested in a  comma  separated  list,  for
              example: "--gpus-per-socket=volta:3,kepler:1".  Requires job to specify a sockets per node count (
              --sockets-per-node).  See also the --gpus, --gpus-per-node and --gpus-per-task options.

       --gpus-per-task=[type:]<number>
              Specify the number of GPUs required for the job on each task to be spawned in the  job's  resource
              allocation.     An   optional   GPU   type   specification   can   be   supplied.    For   example
              "--gpus-per-task=volta:1". Multiple options can be  requested  in  a  comma  separated  list,  for
              example:   "--gpus-per-task=volta:3,kepler:1".   See   also   the  --gpus,  --gpus-per-socket  and
              --gpus-per-node options.  This option requires an  explicit  task  count,  e.g.  -n,  --ntasks  or
              "--gpus=X  --gpus-per-task=Y"  rather  than  an  ambiguous  range of nodes with -N, --nodes.  This
              option  will  implicitly  set  --tres-bind=gres/gpu:per_task:<gpus_per_task>,  but  that  can   be
              overridden with an explicit --tres-bind=gres/gpu specification.

       --gres=<list>
              Specifies  a  comma-delimited  list of generic consumable resources.  The format for each entry in
              the list is "name[[:type]:count]".  The name is the type of consumable resource (e.g.  gpu).   The
              type is an optional classification for the resource (e.g. a100).  The count is the number of those
              resources with a default value of 1.  The count can have a suffix  of  "k"  or  "K"  (multiple  of
              1024),  "m"  or "M" (multiple of 1024 x 1024), "g" or "G" (multiple of 1024 x 1024 x 1024), "t" or
              "T" (multiple of 1024 x 1024 x 1024 x 1024), "p" or "P" (multiple of 1024 x 1024 x 1024 x  1024  x
              1024).   The specified resources will be allocated to the job on each node.  The available generic
              consumable resources is configurable by the system administrator.  A  list  of  available  generic
              consumable  resources  will be printed and the command will exit if the option argument is "help".
              Examples of use include "--gres=gpu:2", "--gres=gpu:kepler:2", and "--gres=help".

       --gres-flags=<type>
              Specify generic resource task binding options.

              multiple-tasks-per-sharing
                     Negate  one-task-per-sharing.   This   is   useful   if   it   is   set   by   default   in
                     SelectTypeParameters.

              disable-binding
                     Negate enforce-binding. This is useful if it is set by default in SelectTypeParameters.

              enforce-binding
                     The  only CPUs available to the job will be those bound to the selected GRES (i.e. the CPUs
                     identified in the gres.conf file will be strictly enforced).  This  option  may  result  in
                     delayed  initiation  of  a  job.   For example a job requiring two GPUs and one CPU will be
                     delayed until both GPUs on a single socket are available rather than using  GPUs  bound  to
                     separate  sockets,  however,  the  application  performance may be improved due to improved
                     communication speed.  Requires the node to be configured with  more  than  one  socket  and
                     resource filtering will be performed on a per-socket basis.
                     NOTE: This option can be set by default in SelectTypeParameters.
                     NOTE: This option is specific to SelectType=cons_tres.

              one-task-per-sharing
                     Do not allow different tasks in to be allocated shared gres from the same sharing gres.
                     NOTE: This flag is only enforced if shared gres are requested with --tres-per-task.
                     NOTE:       This       option       can       be       set       by       default      with
                     SelectTypeParameters=ONE_TASK_PER_SHARING_GRES.
                     NOTE: This option is specific to SelectTypeParameters=MULTIPLE_SHARING_GRES_PJ

       -h, --help
              Display help information and exit.

       --hint=<type>
              Bind tasks according to application hints.
              NOTE: This option implies specific values for certain related options, which prevents its use with
              any  user-specified  values  for  --ntasks-per-core,  --threads-per-core or -B.  These conflicting
              options will override --hint when specified as command line arguments. If a conflicting option  is
              specified as an environment variable, --hint as a command line argument will take precedence.

              compute_bound
                     Select  settings  for  compute bound applications: use all cores in each socket, one thread
                     per core.

              memory_bound
                     Select settings for memory bound applications: use only one core in each socket, one thread
                     per core.

              multithread
                     Use  extra  threads  with in-core multi-threading which can benefit communication intensive
                     applications.  Only supported with the task/affinity plugin.

              nomultithread
                     Don't use extra threads with in-core multi-threading; restricts tasks  to  one  thread  per
                     core.  Only supported with the task/affinity plugin.

              help   show this help message

       -H, --hold
              Specify  the  job  is  to  be submitted in a held state (priority of zero).  A held job can now be
              released using scontrol to reset its priority (e.g. "scontrol release <job_id>").

       -I, --immediate[=<seconds>]
              exit if resources are not available within the time period specified.  If  no  argument  is  given
              (seconds  defaults  to  1), resources must be available immediately for the request to succeed. If
              defer is configured  in  SchedulerParameters  and  seconds=1  the  allocation  request  will  fail
              immediately;  defer  conflicts  and takes precedence over this option.  By default, --immediate is
              off, and the command will block until resources become available. Since this option's argument  is
              optional,  for proper parsing the single letter option must be followed immediately with the value
              and not include a space between them. For example "-I60" and not "-I 60".

       -J, --job-name=<jobname>
              Specify a name for the job allocation. The specified name will appear along with the job id number
              when  querying  running  jobs  on  the  system.  The default job name is the name of the "command"
              specified on the command line.

       -K, --kill-command[=signal]
              salloc always runs a user-specified command once the  allocation  is  granted.  salloc  will  wait
              indefinitely  for that command to exit.  If you specify the --kill-command option salloc will send
              a signal to your command any time that the Slurm controller tells salloc that its  job  allocation
              has  been revoked. The job allocation can be revoked for a couple of reasons: someone used scancel
              to revoke the allocation, or the allocation reached its time limit. If you do not specify a signal
              name  or  number  and  Slurm  is  configured to signal the spawned command at job termination, the
              default signal is SIGHUP for interactive and SIGTERM  for  non-interactive  sessions.  Since  this
              option's  argument  is  optional,  for  proper  parsing  the single letter option must be followed
              immediately with the value and not include a space between them. For example "-K1" and not "-K 1".

       -L, --licenses=<license>[@db][:count][,license[@db][:count]...]
              Specification of licenses (or other resources available on all nodes of the cluster) which must be
              allocated  to  this job.  License names can be followed by a colon and count (the default count is
              one).  Multiple license names should be comma separated (e.g.  "--licenses=foo:4,bar").

              NOTE: When submitting heterogeneous jobs, license requests may only be made on the first component
              job.  For example "salloc -L ansys:2 :".

       --mail-type=<type>
              Notify  user  by  email  when  certain event types occur.  Valid type values are NONE, BEGIN, END,
              FAIL, REQUEUE, ALL (equivalent to BEGIN,  END,  FAIL,  INVALID_DEPEND,  REQUEUE,  and  STAGE_OUT),
              INVALID_DEPEND  (dependency  never  satisfied),  STAGE_OUT  (burst  buffer  stage out and teardown
              completed), TIME_LIMIT, TIME_LIMIT_90 (reached 90 percent of time limit),  TIME_LIMIT_80  (reached
              80  percent  of  time limit), and TIME_LIMIT_50 (reached 50 percent of time limit).  Multiple type
              values may be specified in a comma separated list.  NONE will suppress  all  event  notifications,
              ignoring  any other values specified.  By default no email notifications are sent.  The user to be
              notified is indicated with --mail-user.

       --mail-user=<user>
              User to receive email notification of state changes as defined by --mail-type. This may be a  full
              email  address  or a username. If a username is specified, the value from MailDomain in slurm.conf
              will be appended to create an email address.  The default value is the submitting user.

       --mcs-label=<mcs>
              Used only when the mcs/group plugin is enabled.  This parameter is a group among the groups of the
              user.  Default value is calculated by the Plugin mcs if it's enabled.

       --mem=<size>[units]
              Specify  the  real memory required per node.  Default units are megabytes.  Different units can be
              specified using the suffix [K|M|G|T].  Default value is DefMemPerNode and  the  maximum  value  is
              MaxMemPerNode.  If  configured,  both  of  parameters  can  be seen using the scontrol show config
              command.   This  parameter  would  generally  be  used  if  whole  nodes  are  allocated  to  jobs
              (SelectType=select/linear).   Also  see --mem-per-cpu and --mem-per-gpu.  The --mem, --mem-per-cpu
              and --mem-per-gpu options are mutually exclusive. If --mem,  --mem-per-cpu  or  --mem-per-gpu  are
              specified as command line arguments, then they will take precedence over the environment.

              NOTE:  A  memory size specification of zero is treated as a special case and grants the job access
              to all of the memory on each node.

              NOTE: Memory requests will not  be  strictly  enforced  unless  Slurm  is  configured  to  use  an
              enforcement  mechanism. See ConstrainRAMSpace in the cgroup.conf(5) man page and OverMemoryKill in
              the slurm.conf(5) man page for more details.

       --mem-bind=[{quiet|verbose},]<type>
              Bind tasks to memory. Used only when the task/affinity plugin  is  enabled  and  the  NUMA  memory
              functions  are  available.   Note that the resolution of CPU and memory binding may differ on some
              architectures. For example, CPU binding may be performed at  the  level  of  the  cores  within  a
              processor  while  memory  binding will be performed at the level of nodes, where the definition of
              "nodes" may differ from system to system.  By default no memory binding  is  performed;  any  task
              using  any CPU can use any memory. This option is typically used to ensure that each task is bound
              to the memory closest to its assigned CPU. The use of any type other than "none" or "local" is not
              recommended.

              NOTE:  To  have  Slurm always report on the selected memory binding for all commands executed in a
              shell, you can enable verbose mode by setting the SLURM_MEM_BIND  environment  variable  value  to
              "verbose".

              The following informational environment variables are set when --mem-bind is in use:

                   SLURM_MEM_BIND_LIST
                   SLURM_MEM_BIND_PREFER
                   SLURM_MEM_BIND_SORT
                   SLURM_MEM_BIND_TYPE
                   SLURM_MEM_BIND_VERBOSE

              See  the  ENVIRONMENT  VARIABLES  section  for  a  more  detailed  description  of  the individual
              SLURM_MEM_BIND* variables.

              Supported options include:

              help   show this help message

              local  Use memory local to the processor in use

              map_mem:<list>
                     Bind  by  setting  memory  masks  on  tasks  (or  ranks)  as  specified  where  <list>   is
                     <numa_id_for_task_0>,<numa_id_for_task_1>,...   The  mapping  is  specified  for a node and
                     identical mapping is applied to the tasks on every node (i.e. the lowest task  ID  on  each
                     node  is  mapped to the first ID specified in the list, etc.).  NUMA IDs are interpreted as
                     decimal values unless they are preceded  with  '0x'  in  which  case  they  interpreted  as
                     hexadecimal  values.   If  the number of tasks (or ranks) exceeds the number of elements in
                     this list, elements in the list will be reused as needed starting from the beginning of the
                     list.   To  simplify  support  for  large  task  counts, the lists may follow a map with an
                     asterisk and repetition  count.   For  example  "map_mem:0x0f*4,0xf0*4".   For  predictable
                     binding results, all CPUs for each node in the job should be allocated to the job.

              mask_mem:<list>
                     Bind   by  setting  memory  masks  on  tasks  (or  ranks)  as  specified  where  <list>  is
                     <numa_mask_for_task_0>,<numa_mask_for_task_1>,...  The mapping is specified for a node  and
                     identical  mapping  is  applied to the tasks on every node (i.e. the lowest task ID on each
                     node is mapped to the first mask specified in the  list,  etc.).   NUMA  masks  are  always
                     interpreted  as  hexadecimal  values.  Note that masks must be preceded with a '0x' if they
                     don't begin with [0-9] so they are seen as numerical values.  If the number  of  tasks  (or
                     ranks)  exceeds the number of elements in this list, elements in the list will be reused as
                     needed starting from the beginning of the list.  To simplify support for large task counts,
                     the  lists  may  follow  a  mask  with  an  asterisk  and  repetition  count.   For example
                     "mask_mem:0*4,1*4".  For predictable binding results, all CPUs for each  node  in  the  job
                     should be allocated to the job.

              no[ne] don't bind tasks to memory (default)

              p[refer]
                     Prefer use of first specified NUMA node, but permit
                      use of other available NUMA nodes.

              q[uiet]
                     quietly bind before task runs (default)

              rank   bind by task rank (not recommended)

              sort   sort free cache pages (run zonesort on Intel KNL nodes)

              v[erbose]
                     verbosely report binding before task runs

       --mem-per-cpu=<size>[units]
              Minimum  memory  required per usable allocated CPU.  Default units are megabytes.  Different units
              can be specified using the suffix [K|M|G|T].  The default value is DefMemPerCPU  and  the  maximum
              value  is MaxMemPerCPU (see exception below). If configured, both parameters can be seen using the
              scontrol show config command.  Note that if the job's --mem-per-cpu value exceeds  the  configured
              MaxMemPerCPU, then the user's limit will be treated as a memory limit per task; --mem-per-cpu will
              be reduced to a value no larger than MaxMemPerCPU; --cpus-per-task will be set and  the  value  of
              --cpus-per-task  multiplied  by  the new --mem-per-cpu value will equal the original --mem-per-cpu
              value specified by the user. If the user already specified a value for --cpus-per-task, it will be
              respected and only the total amount of allocated CPUs will change.  This parameter would generally
              be used  if  individual  processors  are  allocated  to  jobs  (SelectType=select/cons_tres).   If
              resources  are  allocated  by core, socket, or whole nodes, then the number of CPUs allocated to a
              job may be higher than  the  task  count  and  the  value  of  --mem-per-cpu  should  be  adjusted
              accordingly.   Also  see  --mem  and  --mem-per-gpu.   The  --mem, --mem-per-cpu and --mem-per-gpu
              options are mutually exclusive.

              NOTE: If the final amount of memory requested by a job can't be satisfied  by  any  of  the  nodes
              configured in the partition, the job will be rejected.  This could happen if --mem-per-cpu is used
              with the --exclusive option for a job allocation and --mem-per-cpu times the number of CPUs  on  a
              node is greater than the total memory of that node.

              NOTE: This applies to usable allocated CPUs in a job allocation.  This is important when more than
              one thread per core is configured.  If a job requests --threads-per-core with fewer threads  on  a
              core  than exist on the core (or --hint=nomultithread which implies --threads-per-core=1), the job
              will be unable to use those extra threads on the core and those threads will not  be  included  in
              the  memory  per  CPU  calculation.  But  if  the job has access to all threads on the core, those
              threads will be included in the memory per CPU calculation even if  the  job  did  not  explicitly
              request those threads.

              In the following examples, each core has two threads.

              In  this  first  example,  two  tasks  can  run  on separate hyperthreads in the same core because
              --threads-per-core is not used. The third task uses both threads of the second core. The allocated
              memory per cpu includes all threads:

              $ salloc -n3 --mem-per-cpu=100
              salloc: Granted job allocation 17199
              $ sacct -j $SLURM_JOB_ID -X -o jobid%7,reqtres%35,alloctres%35
                JobID                             ReqTRES                           AllocTRES
              ------- ----------------------------------- -----------------------------------
                17199     billing=3,cpu=3,mem=300M,node=1     billing=4,cpu=4,mem=400M,node=1

              In this second example, because of --threads-per-core=1, each task is allocated an entire core but
              is only able to use one thread per core.  Allocated  CPUs  includes  all  threads  on  each  core.
              However, allocated memory per cpu includes only the usable thread in each core.

              $ salloc -n3 --mem-per-cpu=100 --threads-per-core=1
              salloc: Granted job allocation 17200
              $ sacct -j $SLURM_JOB_ID -X -o jobid%7,reqtres%35,alloctres%35
                JobID                             ReqTRES                           AllocTRES
              ------- ----------------------------------- -----------------------------------
                17200     billing=3,cpu=3,mem=300M,node=1     billing=6,cpu=6,mem=300M,node=1

       --mem-per-gpu=<size>[units]
              Minimum  memory  required per allocated GPU.  Default units are megabytes.  Different units can be
              specified using the suffix [K|M|G|T].  Default value is DefMemPerGPU and is available  on  both  a
              global and per partition basis.  If configured, the parameters can be seen using the scontrol show
              config and scontrol show partition commands.   Also  see  --mem.   The  --mem,  --mem-per-cpu  and
              --mem-per-gpu options are mutually exclusive.

       --mincpus=<n>
              Specify a minimum number of logical cpus/processors per node.

       --network=<type>
              Specify  information  pertaining  to  the switch or network.  The interpretation of type is system
              dependent.  This option is supported when running Slurm on a Cray natively. It is used to  request
              using  Network  Performance  Counters.  Only one value per request is valid.  All options are case
              in-sensitive.  In this configuration supported values include:

              system
                    Use the system-wide network performance counters. Only nodes requested will be marked in use
                    for  the job allocation. If the job does not fill up the entire system the rest of the nodes
                    are not able to be used by other jobs  using  NPC,  if  idle  their  state  will  appear  as
                    PerfCnts. These nodes are still available for other jobs not using NPC.

              blade Use  the  blade network performance counters. Only nodes requested will be marked in use for
                    the job allocation. If the job does not fill up the entire blade(s)  allocated  to  the  job
                    those  blade(s)  are  not  able to be used by other jobs using NPC, if idle their state will
                    appear as PerfCnts. These nodes are still available for other jobs not using NPC.

              In all cases the job allocation request must specify the --exclusive option. Otherwise the request
              will be denied.

              Also  with  any  of these options steps are not allowed to share blades, so resources would remain
              idle inside an allocation if the step running on a blade does not take up all  the  nodes  on  the
              blade.

              The  network  option  is  also available on systems with HPE Slingshot networks. It can be used to
              request a job VNI (to be used for communication between job steps in a job). It also can  be  used
              to  override  the  default  network  resources  allocated for the job step. Multiple values may be
              specified in a comma-separated list.

              tcs=<class1>[:<class2>]...
                    Set of traffic classes  to  configure  for  applications.   Supported  traffic  classes  are
                    DEDICATED_ACCESS,  LOW_LATENCY,  BULK_DATA, and BEST_EFFORT. The traffic classes may also be
                    specified as TC_DEDICATED_ACCESS, TC_LOW_LATENCY, TC_BULK_DATA, and TC_BEST_EFFORT.

              no_vni
                    Don't allocate any VNIs for this job (even if multi-node).

              job_vni
                    Allocate a job VNI for this job.

              single_node_vni
                    Allocate a job VNI for this job, even if it is a single-node job.

              adjust_limits
                    If set, slurmd will set an upper bound on network resource reservations by taking  the  per-
                    NIC  maximum  resource  quantity  and  subtracting the reserved or used values (whichever is
                    higher) for any system network services; this is the default.

              no_adjust_limits
                    If set, slurmd will calculate network resource reservations based only upon the per-resource
                    configuration default and number of tasks in the application; it will not set an upper bound
                    on those reservation requests based on resource usage  of  already-existing  system  network
                    services.   Setting  this  will  mean  more application launches could fail based on network
                    resource exhaustion, but if the application absolutely needs a certain amount  of  resources
                    to function, this option will ensure that.

              disable_rdzv_get
                    Disable  rendezvous  gets  in  Slingshot  NICs,  which  can  improve performance for certain
                    applications.

              def_<rsrc>=<val>
                    Per-CPU reserved allocation for this resource.

              res_<rsrc>=<val>
                    Per-node reserved allocation for this resource.  If set, overrides the per-CPU allocation.

              max_<rsrc>=<val>
                    Maximum per-node limit for this resource.

              depth=<depth>
                    Multiplier for per-CPU resource allocation.  Default is the number of reserved CPUs  on  the
                    node.

              The resources that may be requested are:

              txqs  Transmit command queues. The default is 2 per-CPU, maximum 1024 per-node.

              tgqs  Target command queues. The default is 1 per-CPU, maximum 512 per-node.

              eqs   Event queues. The default is 2 per-CPU, maximum 2047 per-node.

              cts   Counters. The default is 1 per-CPU, maximum 2047 per-node.

              tles  Trigger list entries. The default is 1 per-CPU, maximum 2048 per-node.

              ptes  Portable table entries. The default is 6 per-CPU, maximum 2048 per-node.

              les   List entries. The default is 16 per-CPU, maximum 16384 per-node.

              acs   Addressing contexts. The default is 4 per-CPU, maximum 1022 per-node.

       --nice[=adjustment]
              Run  the  job  with  an  adjusted  scheduling  priority within Slurm. With no adjustment value the
              scheduling priority is decreased by 100. A negative nice value increases the  priority,  otherwise
              decreases it. The adjustment range is +/- 2147483645. Only privileged users can specify a negative
              adjustment.

       --no-bell
              Silence salloc's use of the terminal bell. Also see the option --bell.

       -k, --no-kill[=off]
              Do not automatically terminate a job if one of the nodes it has been  allocated  fails.  The  user
              will  assume the responsibilities for fault-tolerance should a node fail.  The job allocation will
              not be revoked so the user may launch new job steps on the remaining nodes  in  their  allocation.
              This  option  does  not set the SLURM_NO_KILL environment variable.  Therefore, when a node fails,
              steps running on that node will be  killed  unless  the  SLURM_NO_KILL  environment  variable  was
              explicitly set or srun calls within the job allocation explicitly requested --no-kill.

              Specify  an  optional  argument  of  "off" to disable the effect of the SALLOC_NO_KILL environment
              variable.

              By default Slurm terminates the entire job allocation if any node fails in its range of  allocated
              nodes.

       --no-shell
              immediately  exit  after  allocating  resources, without running a command. However, the Slurm job
              will still be created and will remain active and will own the allocated resources as long as it is
              active.   You  will have a Slurm job id with no associated processes or tasks. You can submit srun
              commands against this resource allocation, if you specify the --jobid= option with the job  id  of
              this Slurm job.  Or, this can be used to temporarily reserve a set of resources so that other jobs
              cannot use them for some period of time. (Note that  the  Slurm  job  is  subject  to  the  normal
              constraints  on  jobs,  including  time  limits, so that eventually the job will terminate and the
              resources will be freed, or you can terminate the job manually using the scancel command.)

       -F, --nodefile=<node_file>
              Much like --nodelist, but the list is contained in a file of name node file. The node names of the
              list  may  also span multiple lines in the file. Duplicate node names in the file will be ignored.
              The order of the node names in the list is not important; the node names will be sorted by Slurm.

       -w, --nodelist=<node_name_list>
              Request a specific list of hosts.  The job will contain all of these hosts and possibly additional
              hosts  as needed to satisfy resource requirements.  The list may be specified as a comma-separated
              list of hosts, a range of hosts (host[1-5,7,...] for example), or a filename.  The host list  will
              be  assumed  to  be  a  filename if it contains a "/" character.  If you specify a minimum node or
              processor count larger than can be satisfied by the supplied host list, additional resources  will
              be  allocated  on  other  nodes as needed.  Duplicate node names in the list will be ignored.  The
              order of the node names in the list is not important; the node names will be sorted by Slurm.

       -N, --nodes=<minnodes>[-maxnodes]|<size_string>
              Request that a minimum of minnodes nodes be allocated to this job.  A maximum node count may  also
              be specified with maxnodes.  If only one number is specified, this is used as both the minimum and
              maximum  node  count.  Node  count  can  be  also  specified  as  size_string.   The   size_string
              specification identifies what nodes values should be used.  Multiple values may be specified using
              a comma separated list or with a step function by suffix containing a colon and number values with
              a  "-"  separator.   For  example,  "--nodes=1-15:4"  is  equivalent  to  "--nodes=1,5,9,13".  The
              partition's node limits supersede those of the job.  If a job's node limits  are  outside  of  the
              range  permitted  for  its  associated  partition,  the job will be left in a PENDING state.  This
              permits possible execution at a later time, when the partition limit is changed.  If  a  job  node
              limit  exceeds  the  number  of nodes configured in the partition, the job will be rejected.  Note
              that the environment variable SLURM_JOB_NUM_NODES will be set  to  the  count  of  nodes  actually
              allocated  to  the  job. See the ENVIRONMENT VARIABLES  section for more information. If -N is not
              specified, the default behavior is to allocate enough nodes to satisfy the requested resources  as
              expressed  by per-job specification options, e.g. -n, -c and --gpus.  The job will be allocated as
              many nodes as possible within the range specified and without delaying the initiation of the  job.
              The  node  count specification may include a numeric value followed by a suffix of "k" (multiplies
              numeric value by 1,024) or "m" (multiplies numeric value by 1,048,576).

              NOTE: This option cannot be used in with arbitrary distribution.

       -n, --ntasks=<number>
              salloc does not launch tasks, it requests an allocation of resources and  executed  some  command.
              This  option  advises the Slurm controller that job steps run within this allocation will launch a
              maximum of number tasks and sufficient resources are allocated to accomplish this.  The default is
              one task per node, but note that the --cpus-per-task option will change this default.

       --ntasks-per-core=<ntasks>
              Request  the  maximum  ntasks be invoked on each core.  Meant to be used with the --ntasks option.
              Related to --ntasks-per-node except at the core level instead of the node level. This option  will
              be  inhertited  by srun.  Slurm may allocate more cpus than what was requested in order to respect
              this option.
              NOTE: This option is not supported when using SelectType=select/linear.  This  value  can  not  be
              greater than --threads-per-core.

       --ntasks-per-gpu=<ntasks>
              Request  that  there are ntasks tasks invoked for every GPU.  This option can work in two ways: 1)
              either specify --ntasks in  addition,  in  which  case  a  type-less  GPU  specification  will  be
              automatically  determined  to  satisfy  --ntasks-per-gpu,  or 2) specify the GPUs wanted (e.g. via
              --gpus or --gres) without specifying --ntasks, and the total  task  count  will  be  automatically
              determined.   The  number of CPUs needed will be automatically increased if necessary to allow for
              any calculated task count.  This option will implicitly set  --tres-bind=gres/gpu:single:<ntasks>,
              but  that  can  be overridden with an explicit --tres-bind=gres/gpu specification.  This option is
              not compatible with a node range (i.e. -N<minnodes-maxnodes>).  This option is not compatible with
              --gpus-per-task,  --gpus-per-socket,  or  --ntasks-per-node.   This option is not supported unless
              SelectType=cons_tres is configured (either directly or indirectly on Cray systems).

       --ntasks-per-node=<ntasks>
              Request that ntasks be invoked on each node.  If used  with  the  --ntasks  option,  the  --ntasks
              option  will take precedence and the --ntasks-per-node will be treated as a maximum count of tasks
              per node.  Meant to be used with the --nodes option.  This is  related  to  --cpus-per-task=ncpus,
              but  does  not  require  knowledge of the actual number of cpus on each node. In some cases, it is
              more convenient to be able to request that no more than a specific number of tasks be  invoked  on
              each  node.  Examples  of  this  include  submitting  a  hybrid  MPI/OpenMP app where only one MPI
              "task/rank" should be assigned to each node while allowing the OpenMP portion to  utilize  all  of
              the  parallelism  present in the node, or submitting a single setup/cleanup/monitoring job to each
              node of a pre-existing allocation as one step in a larger job script.

       --ntasks-per-socket=<ntasks>
              Request the maximum ntasks be invoked on each socket.  Meant to be used with the --ntasks  option.
              Related  to  --ntasks-per-node  except  at the socket level instead of the node level.  NOTE: This
              option is not supported when using SelectType=select/linear.

       -O, --overcommit
              Overcommit resources.

              When applied to a job allocation (not including jobs requesting exclusive access to the nodes) the
              resources  are  allocated as if only one task per node is requested. This means that the requested
              number of cpus per task (-c, --cpus-per-task) are allocated per node rather than being  multiplied
              by  the  number of tasks. Options used to specify the number of tasks per node, socket, core, etc.
              are ignored.

              When applied to job step allocations (the srun  command  when  executed  within  an  existing  job
              allocation),  this  option  can be used to launch more than one task per CPU.  Normally, srun will
              not allocate more than one process  per  CPU.   By  specifying  --overcommit  you  are  explicitly
              allowing  more  than  one  process  per  CPU.  However  no  more than MAX_TASKS_PER_NODE tasks are
              permitted to execute per node. NOTE: MAX_TASKS_PER_NODE is defined in the file slurm.h and is  not
              a variable, it is set at Slurm build time.

       -s, --oversubscribe
              The  job  allocation  can  over-subscribe  resources with other running jobs.  The resources to be
              over-subscribed can be nodes, sockets, cores, and/or hyperthreads  depending  upon  configuration.
              The   default  over-subscribe  behavior  depends  on  system  configuration  and  the  partition's
              OverSubscribe option takes precedence over the job's  option.   This  option  may  result  in  the
              allocation  being  granted  sooner than if the --oversubscribe option was not set and allow higher
              system utilization, but  application  performance  will  likely  suffer  due  to  competition  for
              resources.  Also see the --exclusive option.

              NOTE: This option is mutually exclusive with --exclusive.

       -p, --partition=<partition_names>
              Request  a  specific partition for the resource allocation. If not specified, the default behavior
              is to allow the slurm controller to select the default  partition  as  designated  by  the  system
              administrator. If the job can use more than one partition, specify their names in a comma separate
              list and the one offering earliest initiation will be used with no regard given to  the  partition
              name  ordering  (although  higher  priority partitions will be considered first).  When the job is
              initiated, the name of the partition used will be placed first in the job record partition string.

       --prefer=<list>
              Nodes can have features assigned to them by the Slurm administrator.  Users can specify  which  of
              these  features  are  desired  but not required by their job using the prefer option.  This option
              operates independently from --constraint and will override whatever  is  set  there  if  possible.
              When  scheduling,  the  features  in  --prefer are tried first. If a node set isn't available with
              those features then --constraint is attempted.  See --constraint for more information, this option
              behaves the same way.

       --priority=<value>
              Request  a  specific  job  priority.  May be subject to configuration specific constraints.  value
              should either be a numeric value or "TOP" (for highest possible value).  Only Slurm operators  and
              administrators can set the priority of a job.

       --profile={all|none|<type>[,<type>...]}
              Enables  detailed  data collection by the acct_gather_profile plugin.  Detailed data are typically
              time-series that are stored in an HDF5 file for the job or an InfluxDB database depending  on  the
              configured plugin.

              All       All data types are collected. (Cannot be combined with other values.)

              None      No data types are collected. This is the default.
                         (Cannot be combined with other values.)

       Valid type values are:

              Energy Energy data is collected.

              Task   Task (I/O, Memory, ...) data is collected.

              Lustre Lustre data is collected.

              Network
                     Network (InfiniBand) data is collected.

       -q, --qos=<qos>
              Request  a quality of service for the job. QOS values can be defined for each user/cluster/account
              association in the Slurm database.  Users will be limited to their association's  defined  set  of
              qos's  when  the  Slurm  configuration  parameter, AccountingStorageEnforce, includes "qos" in its
              definition.

       -Q, --quiet
              Suppress informational messages from salloc. Errors will still be displayed.

       --reboot
              Force the allocated nodes to reboot before starting the job.  This is  only  supported  with  some
              system  configurations  and will otherwise be silently ignored. Only root, SlurmUser or admins can
              reboot nodes.

       --reservation=<reservation_names>
              Allocate resources for the job from the named reservation. If  the  job  can  use  more  than  one
              reservation,  specify  their  names  in  a  comma  separate  list  and  the  one offering earliest
              initiation. Each reservation will be considered in the order it was requested.   All  reservations
              will  be  listed  in  scontrol/squeue  through  the  life  of  the  job.   In accounting the first
              reservation will be seen and after the job starts the reservation used will replace it.

       --resv-ports[=count]
              Reserve communication ports for this job. Users can specify  the  number  of  port  they  want  to
              reserve.  The parameter MpiParams=ports=12000-12999 must be specified in slurm.conf. If the number
              of reserved ports is zero then no ports are reserved. Used  for  native  Cray's  PMI  only.   This
              option  can only be used if the slurmstepd step management is enabled.  This option applies to job
              allocations. See --stepmgr.

       --segment=<segment_size>
              When a block topology is used, this defines the size of the segments that will be used  to  create
              the job allocation.  No requirement would be placed on all segments for a job needing to be placed
              within the same higher-level block.

              NOTE: The segment size must always be evenly divisible by the requested node count.

              NOTE: The segment size must be less than or equal to the planning base block  size.  E.g.,  for  a
              system with 30 nodes as the planning base block size, "--segment 40" would be invalid

       --signal=[R:]<sig_num>[@sig_time]
              When  a  job  is  within sig_time seconds of its end time, send it the signal sig_num.  Due to the
              resolution of event handling by Slurm, the signal may be  sent  up  to  60  seconds  earlier  than
              specified.   sig_num  may  either be a signal number or name (e.g. "10" or "USR1").  sig_time must
              have an integer value between 0 and 65535.  By default, no signal is sent  before  the  job's  end
              time.   If  a sig_num is specified without any sig_time, the default time will be 60 seconds.  Use
              the "R:" option to allow this job to overlap with a reservation with MaxStartDelay set.   To  have
              the signal sent at preemption time see the send_user_signal PreemptParameter.

       --sockets-per-node=<sockets>
              Restrict  node  selection  to  nodes with at least the specified number of sockets. See additional
              information under -B option above when task/affinity plugin is enabled.
              NOTE: This option may implicitly set the number of tasks (if -n was not specified) as one task per
              requested thread.

       --spread-job
              Spread  the  job  allocation over as many nodes as possible and attempt to evenly distribute tasks
              across the allocated nodes.  This option disables the topology/tree plugin.

       --stepmgr
              Enable slurmstepd step management per-job if it isn't enabled system wide.  This enables job steps
              to  be  managed  by  a  single  extern slurmstepd associated with the job to manage steps. This is
              beneficial for jobs that submit many steps inside their allocations. PrologFlags=contain  must  be
              set.

       --switches=<count>[@max-time]
              When  a tree topology is used, this defines the maximum count of leaf switches desired for the job
              allocation and optionally the maximum time to wait for that number of switches. If Slurm finds  an
              allocation  containing  more  switches  than the count specified, the job remains pending until it
              either finds an allocation with desired switch count or the time limit expires.  It  there  is  no
              switch  count  limit,  there  is  no  delay  in starting the job.  Acceptable time formats include
              "minutes",  "minutes:seconds",  "hours:minutes:seconds",  "days-hours",  "days-hours:minutes"  and
              "days-hours:minutes:seconds".   The  job's  maximum  time  delay  may  be  limited  by  the system
              administrator using the  SchedulerParameters  configuration  parameter  with  the  max_switch_wait
              parameter option.  On a dragonfly network the only switch count supported is 1 since communication
              performance will be highest when a job is allocate resources on one leaf switch  or  more  than  2
              leaf switches.  The default max-time is the max_switch_wait SchedulerParameters.

       --thread-spec=<num>
              Count  of  specialized  threads per node reserved by the job for system operations and not used by
              the application. The application will not use  these  threads,  but  will  be  charged  for  their
              allocation.  This option can not be used with the --core-spec option.

              NOTE:  Explicitly setting a job's specialized thread value implicitly sets its --exclusive option,
              reserving entire nodes for the job.

       --threads-per-core=<threads>
              Restrict node selection to nodes with at least the specified number of threads per core.  In  task
              layout, use the specified maximum number of threads per core. NOTE: "Threads" refers to the number
              of processing units on each core rather than the number of application tasks to  be  launched  per
              core.  See additional information under -B option above when task/affinity plugin is enabled.
              NOTE: This option may implicitly set the number of tasks (if -n was not specified) as one task per
              requested thread.

       -t, --time=<time>
              Set a limit on the total run time of the job allocation. If the requested time limit  exceeds  the
              partition's  time  limit,  the  job  will  be left in a PENDING state (possibly indefinitely). The
              default time limit is the partition's default time limit. When the time  limit  is  reached,  each
              task  in  each  job  step  is  sent  SIGTERM  followed by SIGKILL. The interval between signals is
              specified by the Slurm configuration parameter KillWait. The OverTimeLimit configuration parameter
              may  permit  the job to run longer than scheduled. Time resolution is one minute and second values
              are rounded up to the next minute.

              A time limit of zero requests that no time limit  be  imposed.  Acceptable  time  formats  include
              "minutes",  "minutes:seconds",  "hours:minutes:seconds",  "days-hours",  "days-hours:minutes"  and
              "days-hours:minutes:seconds".

       --time-min=<time>
              Set a minimum time limit on the job allocation.  If specified, the job may have its  --time  limit
              lowered to a value no lower than --time-min if doing so permits the job to begin execution earlier
              than otherwise possible.  The job's time limit will not be changed  after  the  job  is  allocated
              resources.   This  is performed by a backfill scheduling algorithm to allocate resources otherwise
              reserved for higher priority jobs.  Acceptable time formats include "minutes",  "minutes:seconds",
              "hours:minutes:seconds", "days-hours", "days-hours:minutes" and "days-hours:minutes:seconds".

       --tmp=<size>[units]
              Specify  a  minimum  amount  of  temporary  disk  space  per  node.   Default units are megabytes.
              Different units can be specified using the suffix [K|M|G|T].

       --tres-bind=<tres>:[verbose,]<type>[+<tres>:
              [verbose,]<type>...]  Specify a list of tres with their task binding options. Currently  gres  are
              the only supported tres for this options. Specify gres as "gres/<gres_name>" (e.g. gres/gpu)

              Example: --tres-bind=gres/gpu:verbose,map:0,1,2,3+gres/nic:closest

              By default, most tres are not bound to individual tasks

              Supported binding type options for gres:

              closest   Bind  each  task to the gres(s) which are closest.  In a NUMA environment, each task may
                        be bound to more than one gres (i.e.  all gres in that NUMA environment).

              map:<list>
                        Bind  by  setting  gres  masks  on  tasks  (or  ranks)  as  specified  where  <list>  is
                        <gres_id_for_task_0>,<gres_id_for_task_1>,...   gres  IDs  are  interpreted  as  decimal
                        values. If the number of tasks (or ranks) exceeds the number of elements in  this  list,
                        elements  in  the list will be reused as needed starting from the beginning of the list.
                        To simplify support for large task counts, the lists may follow a map with  an  asterisk
                        and  repetition count. For example "map:0*4,1*4".  If the task/cgroup plugin is used and
                        ConstrainDevices is set in  cgroup.conf,  then  the  gres  IDs  are  zero-based  indexes
                        relative to the gress allocated to the job (e.g. the first gres is 0, even if the global
                        ID is 3). Otherwise, the gres IDs are global IDs, and all gres on each node in  the  job
                        should be allocated for predictable binding results.

              mask:<list>
                        Bind  by  setting  gres  masks  on  tasks  (or  ranks)  as  specified  where  <list>  is
                        <gres_mask_for_task_0>,<gres_mask_for_task_1>,... The mapping is specified  for  a  node
                        and  identical mapping is applied to the tasks on every node (i.e. the lowest task ID on
                        each node is mapped to the first mask specified in  the  list,  etc.).  gres  masks  are
                        always  interpreted  as hexadecimal values but can be preceded with an optional '0x'. To
                        simplify support for large task counts, the lists may follow a map with an asterisk  and
                        repetition  count.  For example "mask:0x0f*4,0xf0*4".  If the task/cgroup plugin is used
                        and ConstrainDevices is set in cgroup.conf, then the gres  IDs  are  zero-based  indexes
                        relative  to the gres allocated to the job (e.g. the first gres is 0, even if the global
                        ID is 3). Otherwise, the gres IDs are global IDs, and all gres on each node in  the  job
                        should be allocated for predictable binding results.

              none      Do  not  bind  tasks  to  this gres (turns off implicit binding from --tres-per-task and
                        --gpus-per-task).

              per_task:<gres_per_task>
                        Each task will be bound to the number of gres specified in  <gres_per_task>.  Tasks  are
                        preferentially assigned gres with affinity to cores in their allocation like in closest,
                        though they will take any gres if they are unavailable. If no affinity exists, the first
                        task  will  be  assigned  the  first x number of gres on the node etc.  Shared gres will
                        prefer to bind one sharing device per task if possible.

              single:<tasks_per_gres>
                        Like closest, except that each task can only be bound to a single gres, even when it can
                        be  bound to multiple gres that are equally close.  The gres to bind to is determined by
                        <tasks_per_gres>, where the first <tasks_per_gres> tasks are bound  to  the  first  gres
                        available,  the  second  <tasks_per_gres>  tasks are bound to the second gres available,
                        etc.  This is basically a block distribution of tasks onto  available  gres,  where  the
                        available gres are determined by the socket affinity of the task and the socket affinity
                        of the gres as specified in gres.conf's Cores parameter.

                        NOTE: Shared gres binding is currently limited to per_task or none

       --tres-per-task=<list>
              Specifies a comma-delimited list of trackable resources required for the job on each  task  to  be
              spawned   in  the  job's  resource  allocation.   The  format  for  each  entry  in  the  list  is
              "trestype[/tresname]:count".  The trestype is the type of trackable resource requested (e.g.  cpu,
              gres,  license,  etc).   The  tresname  is the name of the trackable resource, as can be seen with
              sacctmgr show tres. This is required when it exists for tres types such  as  gres,  license,  etc.
              (e.g.  gpu,  gpu:a100).   In  order  to request a license with this option, the license(s) must be
              defined in the AccountingStorageTRES parameter of slurm.conf.  The count is the  number  of  those
              resources.
              The count can have a suffix of
              "k" or "K" (multiple of 1024),
              "m" or "M" (multiple of 1024 x 1024),
              "g" or "G" (multiple of 1024 x 1024 x 1024),
              "t" or "T" (multiple of 1024 x 1024 x 1024 x 1024),
              "p" or "P" (multiple of 1024 x 1024 x 1024 x 1024 x 1024).
              Examples:
              --tres-per-task=cpu:4
              --tres-per-task=cpu:8,license/ansys:1
              --tres-per-task=gres/gpu:1
              --tres-per-task=gres/gpu:a100:2
              The  specified  resources  will  be  allocated  to  the job on each node.  The available trackable
              resources are configurable by the system administrator.
              NOTE: This option with gres/gpu or gres/shard will  implicitly  set  --tres-bind=per_task:(gpu  or
              shard)<tres_per_task>; this can be overridden with an explicit --tres-bind specification.
              NOTE: Invalid TRES for --tres-per-task include bb,billing,energy,fs,mem,node,pages,vmem.

       --uid=<user>
              Attempt  to  submit  and/or run a job as user instead of the invoking user id. The invoking user's
              credentials will be used to check access permissions for the target partition. This option is only
              valid  for  user  root.  This option may be used by user root may use this option to run jobs as a
              normal user in a RootOnly partition for example. If run as root, salloc will drop its  permissions
              to  the  uid specified after node allocation is successful. user may be the user name or numerical
              user ID.

       --usage
              Display brief help message and exit.

       --use-min-nodes
              If a range of node counts is given, prefer the smaller count.

       -v, --verbose
              Increase the verbosity of salloc's informational messages. Multiple errors will be displayed.

       -V, --version
              Display version information and exit.

       --wait-all-nodes=<value>
              Controls when the execution of the command begins with respect to when nodes  are  ready  for  use
              (i.e.  booted).   By  default,  the  salloc command will return as soon as the allocation is made.
              This default can  be  altered  using  the  salloc_wait_nodes  option  to  the  SchedulerParameters
              parameter in the slurm.conf file.

              0    Begin execution as soon as allocation can be made.  Do not wait for all nodes to be ready for
                   use (i.e. booted).

              1    Do not begin execution until all nodes are ready for use.

       --wckey=<wckey>
              Specify wckey to be used with job. If TrackWCKey=no (default) in  the  slurm.conf  this  value  is
              ignored.

       --x11[={all|first|last}]
              Sets up X11 forwarding on "all", "first" or "last" node(s) of the allocation.  This option is only
              enabled if Slurm was compiled with X11 support and PrologFlags=x11 is defined in  the  slurm.conf.
              Default is "all".

PERFORMANCE

       Executing  salloc  sends a remote procedure call to slurmctld. If enough calls from salloc or other Slurm
       client commands that send remote procedure calls to the slurmctld daemon come in at once, it  can  result
       in a degradation of performance of the slurmctld daemon, possibly resulting in a denial of service.

       Do not run salloc or other Slurm client commands that send remote procedure calls to slurmctld from loops
       in shell scripts or other programs. Ensure that programs limit calls to salloc to the  minimum  necessary
       for the information you are trying to gather.

INPUT ENVIRONMENT VARIABLES

       Upon  startup,  salloc  will  read and handle the options set in the following environment variables. The
       majority of these variables are set the same way the options are set, as defined above. For flag  options
       that  are  defined  to  expect no argument, the option can be enabled by setting the environment variable
       without a value (empty or NULL string), the string 'yes', or a non-zero number. Any other value  for  the
       environment  variable  will  result  in the option not being set.  There are a couple exceptions to these
       rules that are noted below.
       NOTE: Command line options always override environment variables settings.

       SALLOC_ACCOUNT        Same as -A, --account

       SALLOC_ACCTG_FREQ     Same as --acctg-freq

       SALLOC_BELL           Same as --bell

       SALLOC_BURST_BUFFER   Same as --bb

       SALLOC_CLUSTERS or SLURM_CLUSTERS
                             Same as --clusters

       SALLOC_CONSTRAINT     Same as -C, --constraint

       SALLOC_CONTAINER      Same as --container.

       SALLOC_CONTAINER_ID   Same as --container-id.

       SALLOC_CORE_SPEC      Same as --core-spec

       SALLOC_CPUS_PER_GPU   Same as --cpus-per-gpu

       SALLOC_DEBUG          Same as -v, --verbose, when set to 1, when set to 2 gives -vv, etc.

       SALLOC_DELAY_BOOT     Same as --delay-boot

       SALLOC_EXCLUSIVE      Same as --exclusive

       SALLOC_GPU_BIND       Same as --gpu-bind

       SALLOC_GPU_FREQ       Same as --gpu-freq

       SALLOC_GPUS           Same as -G, --gpus

       SALLOC_GPUS_PER_NODE  Same as --gpus-per-node

       SALLOC_GPUS_PER_TASK  Same as --gpus-per-task

       SALLOC_GRES           Same as --gres

       SALLOC_GRES_FLAGS     Same as --gres-flags

       SALLOC_HINT or SLURM_HINT
                             Same as --hint

       SALLOC_IMMEDIATE      Same as -I, --immediate

       SALLOC_KILL_CMD       Same as -K, --kill-command

       SALLOC_MEM_BIND       Same as --mem-bind

       SALLOC_MEM_PER_CPU    Same as --mem-per-cpu

       SALLOC_MEM_PER_GPU    Same as --mem-per-gpu

       SALLOC_MEM_PER_NODE   Same as --mem

       SALLOC_NETWORK        Same as --network

       SALLOC_NO_BELL        Same as --no-bell

       SALLOC_NO_KILL        Same as -k, --no-kill

       SALLOC_OVERCOMMIT     Same as -O, --overcommit

       SALLOC_PARTITION      Same as -p, --partition

       SALLOC_POWER          Same as --power

       SALLOC_PROFILE        Same as --profile

       SALLOC_QOS            Same as --qos

       SALLOC_REQ_SWITCH     When a tree topology is used, this defines the maximum count  of  switches  desired
                             for  the  job allocation and optionally the maximum time to wait for that number of
                             switches. See --switches.

       SALLOC_RESERVATION    Same as --reservation

       SALLOC_SIGNAL         Same as --signal

       SALLOC_SPREAD_JOB     Same as --spread-job

       SALLOC_THREAD_SPEC    Same as --thread-spec

       SALLOC_THREADS_PER_CORE
                             Same as --threads-per-core

       SALLOC_TIMELIMIT      Same as -t, --time

       SALLOC_TRES_BIND      Same as --tres-bind

       SALLOC_TRES_PER_TASK  Same as --tres-per-task

       SALLOC_USE_MIN_NODES  Same as --use-min-nodes

       SALLOC_WAIT_ALL_NODES Same as --wait-all-nodes. Must be set to 0 or 1 to disable or enable the option.

       SALLOC_WAIT4SWITCH    Max time waiting for requested switches. See --switches

       SALLOC_WCKEY          Same as --wckey

       SLURM_CONF            The location of the Slurm configuration file.

       SLURM_DEBUG_FLAGS     Specify debug flags for salloc to use. See DebugFlags in the slurm.conf(5) man page
                             for  a  full  list  of  flags.  The  environment variable takes precedence over the
                             setting in the slurm.conf.

       SLURM_EXIT_ERROR      Specifies the exit code generated when a Slurm error occurs (e.g. invalid options).
                             This  can  be  used  by a script to distinguish application exit codes from various
                             Slurm error conditions.  Also see SLURM_EXIT_IMMEDIATE.

       SLURM_EXIT_IMMEDIATE  Specifies the exit code generated when the --immediate option is used and resources
                             are  not  currently  available.   This  can  be  used  by  a  script to distinguish
                             application  exit  codes  from  various   Slurm   error   conditions.    Also   see
                             SLURM_EXIT_ERROR.

OUTPUT ENVIRONMENT VARIABLES

       salloc will set the following environment variables in the environment of the executed program:

       SLURM_*_HET_GROUP_#
              For  a  heterogeneous  job  allocation,  the  environment  variables  are  set separately for each
              component.

       SLURM_CLUSTER_NAME
              Name of the cluster on which the job is executing.

       SLURM_CONTAINER
              OCI Bundle for job.  Only set if --container is specified.

       SLURM_CONTAINER_ID
              OCI id for job.  Only set if --container-id is specified.

       SLURM_CPUS_PER_GPU
              Number of CPUs requested per allocated GPU.  Only set if the --cpus-per-gpu option is specified.

       SLURM_CPUS_PER_TASK
              Number of CPUs requested per  task.   Only  set  if  either  the  --cpus-per-task  option  or  the
              --tres-per-task=cpu:# option is specified.

       SLURM_DIST_PLANESIZE
              Plane distribution size. Only set for plane distributions.  See -m, --distribution.

       SLURM_DISTRIBUTION
              Only set if the -m, --distribution option is specified.

       SLURM_GPU_BIND
              Requested binding of tasks to GPU.  Only set if the --gpu-bind option is specified.

       SLURM_GPU_FREQ
              Requested GPU frequency.  Only set if the --gpu-freq option is specified.

       SLURM_GPUS
              Number of GPUs requested.  Only set if the -G, --gpus option is specified.

       SLURM_GPUS_PER_NODE
              Requested GPU count per allocated node.  Only set if the --gpus-per-node option is specified.

       SLURM_GPUS_PER_SOCKET
              Requested GPU count per allocated socket.  Only set if the --gpus-per-socket option is specified.

       SLURM_HET_SIZE
              Set to count of components in heterogeneous job.

       SLURM_JOB_ACCOUNT
              Account name associated of the job allocation.

       SLURM_JOB_CPUS_PER_NODE
              Count  of  CPUs  available  to  the  job  on  the  nodes  in  the  allocation,  using  the  format
              CPU_count[(xnumber_of_nodes)][,CPU_count     [(xnumber_of_nodes)]     ...].      For      example:
              SLURM_JOB_CPUS_PER_NODE='72(x2),36'  indicates  that  on  the first and second nodes (as listed by
              SLURM_JOB_NODELIST) the allocation has 72 CPUs, while the third  node  has  36  CPUs.   NOTE:  The
              select/linear  plugin  allocates  entire  nodes to jobs, so the value indicates the total count of
              CPUs on allocated nodes. The select/cons_tres plugin allocates individual CPUs to  jobs,  so  this
              number indicates the number of CPUs allocated to the job.

       SLURM_JOB_END_TIME
              The UNIX timestamp for a job's projected end time.

       SLURM_JOB_GPUS
              The  global  GPU IDs of the GPUs allocated to this job. The GPU IDs are not relative to any device
              cgroup, even if devices are constrained with task/cgroup.  Only set in batch and interactive jobs.

       SLURM_JOB_ID
              The ID of the job allocation.

       SLURM_JOB_NODELIST
              List of nodes allocated to the job.

       SLURM_JOB_NUM_NODES
              Total number of nodes in the job allocation.

       SLURM_JOB_PARTITION
              Name of the partition in which the job is running.

       SLURM_JOB_QOS
              Quality Of Service (QOS) of the job allocation.

       SLURM_JOB_RESERVATION
              Advanced reservation containing the job allocation, if any.

       SLURM_JOB_START_TIME
              UNIX timestamp for a job's start time.

       SLURM_JOBID
              The ID of the job allocation. See SLURM_JOB_ID. Included for backwards compatibility.

       SLURM_MEM_BIND
              Set to value of the --mem-bind option.

       SLURM_MEM_BIND_LIST
              Set to bit mask used for memory binding.

       SLURM_MEM_BIND_PREFER
              Set to "prefer" if the --mem-bind option includes the prefer option.

       SLURM_MEM_BIND_SORT
              Sort free cache pages (run zonesort on Intel KNL nodes)

       SLURM_MEM_BIND_TYPE
              Set to the memory binding type specified with the --mem-bind option.  Possible values are  "none",
              "rank", "map_map", "mask_mem" and "local".

       SLURM_MEM_BIND_VERBOSE
              Set to "verbose" if the --mem-bind option includes the verbose option.  Set to "quiet" otherwise.

       SLURM_MEM_PER_CPU
              Same as --mem-per-cpu

       SLURM_MEM_PER_GPU
              Requested memory per allocated GPU.  Only set if the --mem-per-gpu option is specified.

       SLURM_MEM_PER_NODE
              Same as --mem

       SLURM_NNODES
              Total  number  of  nodes  in  the job allocation. See SLURM_JOB_NUM_NODES.  Included for backwards
              compatibility.

       SLURM_NODELIST
              List of nodes allocated to the job. See SLURM_JOB_NODELIST. Included for backwards compatibility.

       SLURM_NPROCS
              Set to value of the --ntasks option, if specified. Or,  if  either  of  the  --ntasks-per-node  or
              --ntasks-per-gpu  options are specified, set to the number of tasks in the job.  See SLURM_NTASKS.
              Included for backwards compatibility.

       SLURM_NTASKS
              Set to value of the --ntasks option, if specified. Or,  if  either  of  the  --ntasks-per-node  or
              --ntasks-per-gpu options are specified, set to the number of tasks in the job.

       SLURM_NTASKS_PER_CORE
              Set to value of the --ntasks-per-core option, if specified.

       SLURM_NTASKS_PER_GPU
              Set to value of the --ntasks-per-gpu option, if specified.

       SLURM_NTASKS_PER_NODE
              Set to value of the --ntasks-per-node option, if specified.

       SLURM_NTASKS_PER_SOCKET
              Set to value of the --ntasks-per-socket option, if specified.

       SLURM_OVERCOMMIT
              Set to 1 if --overcommit was specified.

       SLURM_PROFILE
              Same as --profile

       SLURM_SHARDS_ON_NODE
              Number of GPU Shards available to the step on this node.

       SLURM_SUBMIT_DIR
              The  directory from which salloc was invoked or, if applicable, the directory specified by the -D,
              --chdir option.

       SLURM_SUBMIT_HOST
              The hostname of the computer from which salloc was invoked.

       SLURM_TASKS_PER_NODE
              Number of tasks to be initiated on each node. Values are comma separated and in the same order  as
              SLURM_JOB_NODELIST.   If two or more consecutive nodes are to have the same task count, that count
              is   followed   by   "(x#)"   where    "#"    is    the    repetition    count.    For    example,
              "SLURM_TASKS_PER_NODE=2(x3),1"  indicates  that  the first three nodes will each execute two tasks
              and the fourth node will execute one task.

       SLURM_THREADS_PER_CORE
              This is only set if --threads-per-core or SALLOC_THREADS_PER_CORE were specified. The  value  will
              be  set  to  the value specified by --threads-per-core or SALLOC_THREADS_PER_CORE. This is used by
              subsequent srun calls within the job allocation.

       SLURM_TRES_PER_TASK
              Set to the value of --tres-per-task. If --cpus-per-task or --gpus-per-task  is  specified,  it  is
              also set in SLURM_TRES_PER_TASK as if it were specified in --tres-per-task.

SIGNALS

       While  salloc  is  waiting  for  a  PENDING  job allocation, most signals will cause salloc to revoke the
       allocation request and exit.

       However if the allocation has been granted and salloc has already started  the  specified  command,  then
       salloc will ignore most signals.  salloc will not exit or release the allocation until the command exits.
       One notable exception is SIGHUP. A SIGHUP signal will cause salloc to release  the  allocation  and  exit
       without  waiting for the command to finish.  Another exception is SIGTERM, which will be forwarded to the
       spawned process.

EXAMPLES

       To get an allocation, and open a new xterm in which srun commands may be typed interactively:

              $ salloc -N16 xterm
              salloc: Granted job allocation 65537
              # (at this point the xterm appears, and salloc waits for xterm to exit)
              salloc: Relinquishing job allocation 65537

       To grab an allocation of nodes and launch a parallel application on one command line:

              $ salloc -N5 srun -n10 myprogram

       To create a heterogeneous job with 3 components, each allocating a unique set of nodes:

              $ salloc -w node[2-3] : -w node4 : -w node[5-7] bash
              salloc: job 32294 queued and waiting for resources
              salloc: job 32294 has been allocated resources
              salloc: Granted job allocation 32294

COPYING

       Copyright (C) 2006-2007 The Regents of the University of  California.   Produced  at  Lawrence  Livermore
       National Laboratory (cf, DISCLAIMER).
       Copyright (C) 2008-2010 Lawrence Livermore National Security.
       Copyright (C) 2010-2022 SchedMD LLC.

       This    file    is    part    of    Slurm,   a   resource   management   program.    For   details,   see
       <https://slurm.schedmd.com/>.

       Slurm is free software; you can redistribute it and/or modify it under  the  terms  of  the  GNU  General
       Public License as published by the Free Software Foundation; either version 2 of the License, or (at your
       option) any later version.

       Slurm is distributed in the hope that it will be useful, but  WITHOUT  ANY  WARRANTY;  without  even  the
       implied  warranty  of  MERCHANTABILITY  or  FITNESS  FOR A PARTICULAR PURPOSE. See the GNU General Public
       License for more details.

SEE ALSO

       sinfo(1), sattach(1), sbatch(1), squeue(1),  scancel(1),  scontrol(1),  slurm.conf(5),  sched_setaffinity
       (2), numa (3)