xenial (5) gres.conf.5.gz

Provided by: slurm-client_15.08.7-1build1_amd64 bug

NAME

       gres.conf - Slurm configuration file for generic resource management.

DESCRIPTION

       gres.conf  is an ASCII file which describes the configuration of generic resources on each
       compute node. Each node must contain a gres.conf file  if  generic  resources  are  to  be
       scheduled  by  Slurm.   The  file  location can be modified at system build time using the
       DEFAULT_SLURM_CONF parameter or at execution time by setting  the  SLURM_CONF  environment
       variable. The file will always be located in the same directory as the slurm.conf file. If
       generic resource counts are set by the gres plugin function node_config_load(), this  file
       may be optional.

       Parameter  names are case insensitive.  Any text following a "#" in the configuration file
       is treated as a comment through the end of that line.  Changes to the  configuration  file
       take  effect  upon  restart  of  Slurm  daemons,  daemon  receipt of the SIGHUP signal, or
       execution of the command "scontrol reconfigure" unless otherwise noted.

       The overall configuration parameters available include:

       Count  Number of resources of this type available on this node.  The default value is  set
              to  the  number  of  File values specified (if any), otherwise the default value is
              one. A suffix of "K", "M", "G", "T" or "P" may be used to multiply  the  number  by
              1024, 1048576, 1073741824, etc. respectively.

       CPUs   Specify  the  CPU  index numbers for the specific CPUs which can use this resource.
              For example, it may be strongly preferable  to  use  specific  CPUs  with  specific
              devices (e.g. on a NUMA architecture). Multiple CPUs may be specified using a comma
              delimited list or a range may be specified using a "-" separator (e.g. "0,1,2,3" or
              "0-3").   If  specified,  then  only the identified CPUs can be allocated with each
              generic resource; an attempt to use  other  CPUs  will  not  be  honored.   If  not
              specified,  then  any  CPU can be used with the resources, which also increases the
              speed of Slurm's scheduling algorithm.  If any CPU can be effectively used with the
              resources,  then  do  not  specify  the CPUs option for improved speed in the Slurm
              scheduling logic.

              Since Slurm must be able to perform resource management on  heterogeneous  clusters
              having  various  CPU  ID  numbering  schemes,  use the Slurm CPU index numbers here
              (CPU_ID = Board_ID x threads_per_board + Socket_ID x threads_per_socket + Core_ID x
              threads_per_core + Thread_ID).

       File   Fully  qualified pathname of the device files associated with a resource.  The file
              name parsing logic includes support for simple regular expressions as shown in  the
              example.   This  field  is  generally  required  if enforcement of generic resource
              allocations is to be supported (i.e. prevents a users from making use of  resources
              allocated to a different user).  If File is specified then Count must be either set
              to the number of file names specified or not set (the default value is  the  number
              of files specified).  Slurm must track the utilization of each individual device If
              device file names are specified, which involves more overhead  than  just  tracking
              the  device counts.  Use the File parameter only if the Count is not sufficient for
              tracking purposes.  NOTE: If you specify the File parameter for a resource on  some
              node, the option must be specified on all nodes and Slurm will track the assignment
              of each specific resource on each node. Otherwise Slurm will only track a count  of
              allocated resources rather than the state of each individual device file.

       Name   Name  of the generic resource. Any desired name may be used.  Each generic resource
              has an optional  plugin  which  can  provide  resource-specific  options.   Generic
              resources that currently include an optional plugin are:

              gpu    Graphics Processing Unit

              nic    Network Interface Card

              mic    Intel Many Integrated Core (MIC) processor

       NodeName
              An  optional  NodeName specification can be used to permit one gres.conf file to be
              used for all compute nodes in a cluster by specifying the node(s)  that  each  line
              should apply to.  The NodeName specification can use a Slurm hostlist specification
              as shown in the example below.

       Type   An arbitrary string identifying the type of  device.   For  example,  a  particular
              model  of  GPU.   If  Type  is  specified, then Count is limited in size (currently
              1024).

EXAMPLES

       ##################################################################
       # Slurm's Generic Resource (GRES) configuration file
       ##################################################################
       # Configure support for our four GPUs
       Name=gpu Type=gtx560 File=/dev/nvidia0 CPUs=0,1
       Name=gpu Type=gtx560 File=/dev/nvidia1 CPUs=0,1
       Name=gpu Type=tesla  File=/dev/nvidia2 CPUs=2,3
       Name=gpu Type=tesla  File=/dev/nvidia3 CPUs=2,3
       Name=bandwidth Count=20M

       ##################################################################
       # Slurm's Generic Resource (GRES) configuration file
       # Use a single gres.conf file for all compute nodes
       ##################################################################
       NodeName=tux[0-15]  Name=gpu File=/dev/nvidia[0-3]
       NodeName=tux[16-31] Name=gpu File=/dev/nvidia[0-7]

COPYING

       Copyright (C) 2010 The Regents of the University  of  California.   Produced  at  Lawrence
       Livermore National Laboratory (cf, DISCLAIMER).
       Copyright (C) 2010-2014 SchedMD LLC.

       This   file   is  part  of  Slurm,  a  resource  management  program.   For  details,  see
       <http://slurm.schedmd.com/>.

       Slurm is free software; you can redistribute it and/or modify it under the  terms  of  the
       GNU  General Public License as published by the Free Software Foundation; either version 2
       of the License, or (at your option) any later version.

       Slurm is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without
       even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
       GNU General Public License for more details.

SEE ALSO

       slurm.conf(5)