Provided by: simrisc_14.05.00-4_amd64 bug

NAME

       simrisc - This program performs simulations in the context of breast cancer

SYNOPSIS

       simrisc [options] analyses

       The  analyses  argument  is  the name of the file specifying the analyses to perform.  See
       section ANALYSES for details.

DESCRIPTION

       Simrisc was originally designed around 2010 by Marcel Greuter at  the  University  Medical
       Center Groningen, and thereafter modified in 2015 by Chris de Jonge.

Changes introduced in version 14.04.00

       o      Parameters affected by spread: true
              Parameters   that  may  vary  are  specified  using  triplets:  value,  spread  and
              distribution. In all cases the spread values and distribution names  are  optional:
              they  can  both  be  omitted or both must be specified. If these parameters are not
              specified then their value parameter won’t vary if spread: true is specified;

       o      The Mammo, Tomo, and MRI modalities are  provided  with  std.dev  and  distribution
              parameters for their Dose, M, Beta, Specificity, and Sensitivity parameters;

       o      When  spread: true is specified the actually used and original parameter values are
              listed in a file, by  default  spread-$.txt,  where  $  is  replaced  by  the  loop
              iteration  index.  Use  the  option  -s  to  specify  a  non-default  filename (cf.
              simrisc(1));

       o      Age ranges no longer have trailing colons;

       o      The Case-specific data matrix defines an extra (18th) column, showing  the  results
              of the screening rounds for each simulated case;

       o      The order of the beir7 beta and eta parameters is reversed: eta is specified first,
              followed by beta. The spread and distribution parameters following  beta  apply  to
              beta, and not to eta, which is a fixed value.

OPTIONS

       Short  options  are  provided between parentheses, immediately following their long option
       equivalents. Several parameters specify the path-names of files produced by simrisc. If  a
       path-name  starts with a tilde character (~) then the tilde is replaced by the user’s home
       directory. An initial + is replaced by the program’s base  directory  (see  option  base).
       When  an  analysis uses multiple iterations then `$’ characters in filename specifications
       are replaced by the analysis’ interation index.

       All single-letter options referring to filesystem  entries  (directories,  filenames)  are
       capitalized, all other single-letter options are lowercase.

       o      --base=basedir (-B)
              the  base  directory  where  the  output  files  will be written. By default ./. If
              basedir doesn’t exist it is created by the program.  If  the  directory  cannot  be
              created   and   exception   is   thrown,   terminating  the  program.  The  basedir
              specifications may specify relative or absolute directory locations;

       o      --config=path (-C)
              the location of the configuration file. By default ~/.config/simrisc is used.

       o      --data=path (-D)
              path name of the file to contain the data of the cases generated by the  simulation
              (default:  ’<base>/data-$.txt’).  If  a  data  file should not be written specify !
              (mnemonic: the logical not operator, i.e., --data !).  See  section  OUTPUT  for  a
              description of the generated data;

       o      --death-age=age (-a)
              run  one  simulation using a specific natural death-age.  This option also requires
              the specification of tumor-age, and is mutually exclusive with the case option;

       o      --err (-e)
              before version 14.05.00 the Beir7 risk  vector  was  computed  using  an  incorrect
              algorithm. The error was fixed in simrisc 14.05.00.  When it is required to use the
              erroneous algorithm, e.g., to compare the results of  simulations  with  previously
              obtained  results,  option  -e  must be specified. However, option -e should not be
              used for real simulations;

       o      --help (-h)
              shows help information and terminates;

       o      --last-case=nCases (-l)
              perform simulations until nCases cases have been analyzed and only write  the  data
              for  the  final case to the data file. The rounds and sensitivity files contain the
              summarized results of all nCases analyzed cases;

       o      --one-analysis (-o)
              the program’s arguments specify the parameters of a single  analysis,  rather  than
              the  name  of  an analyses-specification file. The program’s arguments are optional
              and are used to alter the parameter values as defined in  the  config  file  or  to
              define label specifications. See section ANALYSES for details;

       o      --parameters=path (-P)
              path  name  of  the  file  showing  the  actually used parameter specifications. By
              default no parameter file is written;

       o      --rounds=path (-R)
              path name of the file to containing the  summary  info  of  the  simulation  rounds
              (default:  ’<base>/rounds-$.txt’). If a rounds file should not be written specify !
              (i.e., --rounds !). See section OUTPUT for a description of the  generated  summary
              info;

       o      --spread=path (-s)
              path  name  of  the file to contain the actually used and original parameter values
              when spread: true is  specified  (default:  ’<base>/spread-$.txt’).  If  this  file
              should not be written specify ! (mnemonic: the logical not operator, i.e., --spread
              !). See section OUTPUT for a sample of its content;

       o      --sensitivity=path (-S)
              path name  of  the  file  to  containing  the  summary  info  of  the  simulation’s
              sensitivity  data (default: ’<base>/sensitivity.txt’). If a sensitivity file should
              not be written specify  !  (i.e.,  --sensitivity  !).  See  section  OUTPUT  for  a
              description of the produced sensitivity summary;

       o      --tumor-age=age (-t)
              run  one  simulation  using  a  specific  tumor  self-detect age.  This option also
              requires the specification of death-age, and is mutually exclusive  with  the  case
              option;

       o      --verbose (-V)
              provides additional information while running;

       o      --version (-v)
              shows simrisc’s version information and terminates;

ANALYSES

       Unless  the  one-analysis option is used the program’s first and only required argument is
       the name of a file providing the details of the  analyses  to  perform.  Those  files  are
       called analysis files and must be standard ascii text files. I.e., they only contain 7-bit
       ascii printable and white-space characters. The identifiers used in analysis files and  in
       configuration files are interpreted case sensitively.

       Parameter  specifications  starting  with uppercase letters (like Analysis: and Scenario:)
       specify (sub)sections and contain no additional  specifications.  Specifications  starting
       with  lowercase  letters  (like  ageGroup:) are followed by actual parameter values. For a
       complete overview refer to the simriscparams(7) man-page.

       Analysis files may define multiple analyses. Each analysis specification must begin with a
       line containing

           Analysis:

       At  each Analysis: specification the program’s initial configuration is reset: the default
       option values are used unless  redefined  by  explicitly  provided  command-line  options.
       Explicitly  provided command-line options cannot be altered by specifications in Analysis:
       sections and remain active while simrisc is running.

       Following Analysis: lines the  characteristics  of  the  analysis  are  specified.   These
       specifications may, in the following order:

       o      define  label:  lines  (label:  lines, when used, must immediately follow Analysis:
              lines). The text following label: lines is written at the top of the output files;

       o      alter default simrisc options;

       o      redefine parameter specifications of configuration files;

       All specifications in Analysis: sections are optional.  An Analysis: specification  merely
       containing  the  line  Analysis:  defines  an  analysis  using  the  explicitly  specified
       command-line options or the default program options and using the parameter specifications
       provided in the configuration file.

       Empty  lines,  trailing white-space, and all characters on lines starting at the hash-mark
       (#) are ignored and may be used anywhere in analysis files.

       Lines not conforming to the above description result in error messages, causing simrisc to
       end.

       When  parameters  of  configuration  file sections (cf. simriscparams(7)) are omitted from
       Analysis: sections then the parameters as specified in the configuration file are used.

       When program options are specified their long option names must be used. E.g.:

           base:       /tmp/
           last-case:  20

       Command-line options always overrule options specified in analysis files.

       Multiple analysis sections should not use identically named output files,  as  the  output
       files are (re)written for each separate analysis.

       Analysis   sections  are  commonly  used  to  alter  the  default  specifications  of  the
       configuration file. E.g., the default number of iterations equals 1. By specifying

           iterations: 3

       the analysis performs 3 iterations.

       Parameters are either read from the configuration file or they are redefined in  Analysis:
       sections.  E.g., in de provided configuration file screening rounds use two-year intervals
       between the ages of 50 and 74. To use screening rounds  using  5-year  intervals,  between
       ages 50 and 65, then an Analysis: specification could be, e.g.,

       Screening:
           round:     50  Mammo MRI
           round:     55  Mammo MRI
           round:     60  Mammo MRI
           round:     65  Mammo MRI

       When   the   --one-analysis  option  is  used  parameters  may  be  altered  by  providing
       comma-separated parameter specifications  as  program  command-line  arguments.  E.g.,  to
       perform one analysis, writing the data file to /tmp/data, simulating 1000 cases, and using
       20 as seed for the random number generator the command

           simrisc -D /tmp/data -o Scenario:, cases: 1000, seed: 20

       can be used. Note that when using the one-analysis option  parameter  section  names  must
       precede parameter specifications. E.g., since the parameters cases and seed are defined in
       the `Scenario’ section (cf. simriscparams(7)) they  must  be  preceded  by  the  Scenario:
       specification.

       Subsequent Analysis: specifications in analysis files use the program options as specified
       on the command line (or, if not specified, the  default  program  options)  and  uses  the
       configuration file’s parameter specifications. So when an Analysis: specification modifies
       parameters, then subsequent Analysis:  sections  start  from  the  unmodified  option  and
       parameter specifications.

       When  Analysis:  sections specify base: directory locations then file specifications using
       an initial `+’ character to represent the base directory (like --data +/data.txt) use  the
       base directory specified at the Analysis:’s base: specification.

OUTPUT

       The  first lines of the generated files contain time stamps showing the date and time when
       the files were written and the used SimRisc version. Here is an example, following the RFC
       2822 format for the timestamp:

           Thu, 08 Apr 2021 21:43:09 +0200 (SimRisc V. 14.04.00)

       If  label:  lines  are  used  then the time stamp is followed by the label specifications,
       which is then followed by an empty line. After this header the file’s  specific  data  are
       shown.

       The  data  in  all files (except for the file listing the actually used parameters (option
       --parameters (P))) are written using the standard comma-separated format (cf.  RFC  4180).
       The initial lines contain table headings and column labels documenting the meanings of the
       various columns. Likewise there is a final line ending the tables.

       Data of simulated cases

       For each simulated case the values of the following variables are  written  to  file  (one
       line of comma-separated values per simulated case):

       o      case: the (0-based) case-index;

       o      cause of death: either Natural or Tumor;

       o      death age: the case’s age of death;

       o      natural death age: the case’s natural age of death (if no tumor occurs);

       o      death status: a numeric index specifying how and at what stage the case died:
              1: natural death in the pre-screening phase,
              2: natural death in the screening phase,
              3: natural death in the post-screening phase,
              4: tumor caused death in the pre-screening phase,
              5: tumor caused death in the screening phase,
              6: tumor caused death in the post-screening phase;

       o      tumor present: Yes if the simulation resulted in a tumor, No if no tumor occurred;

       o      tumor detected: Yes if the tumor was detected, No if not;

       o      interval tumor: Yes if the tumor was an interval tumor, No if not;

       o      tumor  diameter:  the tumor’s diameter in mm when it was detected. 0.00 is shown if
              no tumor occurred. In the exceptional case where the simulation  produced  a  tumor
              whose diameter exceeded 1000 mm the value 1001 is shown.

       o      tumor doubling days: the time (in days) it takes for the tumor to double its size;

       o      tumor  preclinical  period: the age at which the tumor is potentially detectible by
              Mammographic screening;

       o      tumor onset age: the age at which the tumor first occurred;

       o      tumor self-detect age: the age at which the tumor was self-detected.  This  age  is
              the  result  of  the simulation, and may exceed the case’s actual death age (if so,
              the case’s data report that no tumor is present);

       o      tumor death age: the age at which the tumor caused or would have caused  he  case’s
              death.  The  simulation process uses ages ranging from 0 through 100. If the age at
              which the tumor causes the case’s death exceeds 100, then 100.00 is reported;

       o      costs: the case’s screening and (if appliccable) treatment costs;

       o      self-detection indicator: 1 if the tumor was  self-detected,  0  if  not  (also  if
              there’s no tumor);

       o      detection  round: 0-based round index at which the tumor was detected (or 1) if the
              tumor was self-detected, 0 if not (also if there’s no tumor).

       o      screening rounds: this column contains show which screening rounds were attended by
              the  simulated  cases, and if so whether false negative or false positive diagnoses
              were made. The following digits are used:

       o      0: the case did not attend this screening round;

       o      1: the case did attend this screening round;

       o      2: the case did  attend  this  screening  round,  resulting  in  a  false  negative
              diagnosis;

       o      3:  the  case  did  attend  this  screening  round,  resulting  in a false positive
              diagnosis.  There are as many digits as screening rounds. The leftmost digit refers
              to  the  first  screening  round,  the rightmost digit to the last screening round.
              E.g., using 12 screening rounds the following indicators could be obtained:

                  0011311110000

              Using screening round indices (which are also  used  to  refer  to  rounds  in  the
              rounds-$.txt  files), this case did not attent screening rounds 0, 1, 9, 10, 11 and
              12, and at 4 a false positive diagnosis was obtained.

       Actually used spread-values

       When spread: true is specified then by default the actually used and  orgiginal  parameter
       values  are  written to the file spread-$.txt, where $ is replaced by the loop’s iteration
       index. Here is a sample from the content of such a file, showing the  values  of  modality
       Mammo’s Beta parameters:

           Beta:
             nr: 1  using -4.37906 (configured: -4.38)
             nr: 2  using 0.490436 (configured: 0.49)
             nr: 3  using -1.33749 (configured: -1.34)
             nr: 4  using -7.22025 (configured: -7.18)

FILES

       o      ~/.config/simrisc: the default location of the program’s configuration file;

       o      the  simrisc  distribution  archive  contains  the  default  configuration  file as
              simrisc-VERSION/stdconfig/simrisc, where VERSION is replaced  by  simrisc’s  actual
              release version;

       o      when  installing  simrisc  using Linux distribution archives (e.g., .deb files) the
              default      configuration      file      is      commonly       available       as
              /usr/shared/doc/simrisc/simrisc.gz

SEE ALSO

       simriscparams(7)

COPYRIGHT

       This  is  free  software,  distributed  under  the terms of the GNU General Public License
       (GPL).

AUTHOR

       Frank B. Brokken (f.b.brokken@rug.nl),