Provided by: sisu_5.3.2-1_all bug

NAME

       sisu - documents: markup, structuring, publishing in multiple standard formats, and search

SYNOPSIS

       sisu [-short-options|--long-options] [filename/wildcard]

       sisu [-abCcDdeFGghIikLMmNnoPpQqRrSsTtUuVvWwXxYyZ_0-9] [filename/wildcard]

       sisu  --txt  --html  --epub  --odt  --pdf --wordmap --sqlite --manpage --texinfo --sisupod
       --source --qrcode [filename/wildcard]

       sisu [-Ddcv] [instruction] [filename/wildcard]

       sisu --pg (--createdb|update [filename/wildcard]|--dropall)

       sisu [operations]

       sisu [-CcFLSVvW]

       sisu (--configure|--webrick|--sample-search-form)

SISU - MANUAL,

       RALPH AMISSAH

WHAT IS SISU?

INTRODUCTION - WHAT IS SISU?


       SiSU is a lightweight markup based document creation  and  publishing  framework  that  is
       controlled  from  the  command  line. Prepare documents for SiSU using your text editor of
       choice, then use SiSU to generate various output document formats.

        From a single lightly prepared document (plain-text UTF-8 ) sisu  custom  builds  several
       standard  output  formats which share a common (text object) numbering system for citation
       of content within a document (that also has implications  for  search).  The  sisu  engine
       works  with  an  abstraction  of  the  document's  structure  and content from which it is
       possible to generate different forms of representation of the  document.   SiSU  produces:
       plain-text,  HTML, XHTML, XML, EPUB, ODF: ODT (Opendocument), LaTeX, PDF, and populates an
       SQL database ( PostgreSQL or SQLite ) with text objects, roughly, paragraph  sized  chunks
       so that document searches are done at this level of granularity.

         Outputs  share  a common citation numbering system, associated with text objects and any
       semantic meta-data provided about the document.

       SiSU also provides concordance files,  document  content  certificates  and  manifests  of
       generated output. Book indexes may be made.

         Some  document  markup  samples  are  provided  in  the  package  sisu  -markup-samples.
       Homepages:

       * <http://www.sisudoc.org/>

       * <http://www.jus.uio.no/sisu>

COMMANDS SUMMARY

DESCRIPTION


       SiSU is a document publishing system,  that  from  a  simple  single  marked-up  document,
       produces  multiple  output  formats  including:  plaintext,  HTML, XHTML, XML, EPUB, ODT (
       OpenDocument ( ODF ) text), LaTeX, PDF, info, and SQL ( PostgreSQL and SQLite  )  ,  which
       share  text  object  numbers ("object citation numbering") and the same document structure
       information. For more see: <http://sisudoc.org> or <http://www.jus.uio.no/sisu>

DOCUMENT PROCESSING COMMAND FLAGS

       -a [filename/wildcard]
              produces plaintext with Unix linefeeds and  without  markup,  (object  numbers  are
              omitted),  has  footnotes  at  end  of  each  paragraph that contains them [ -A for
              equivalent dos (linefeed) output file] [see -e  for  endnotes].  (Options  include:
              --endnotes  for  endnotes  --footnotes  for  footnotes at the end of each paragraph
              --unix for unix linefeed (default) --msdos for msdos linefeed)

       --ao [filename/wildcard/url]
              assumed for most  other  flags,  creates  new  intermediate  files  for  processing
              (abstract  objects, document abstraction) that is used in all subsequent processing
              of other output. This step is assumed for most processing flags. To skip it see -n.
              Alias -m. (sisu v5)

       -b [filename/wildcard]
              see --xhtml

       --by-* see --output-by-*

       -C     configure/initialise   shared  output  directory  files  initialize  shared  output
              directory (config files such as css and dtd files are not updated if  they  already
              exist  unless  modifier  is  used).  -C  --init-site configure/initialise site more
              extensive than -C on its own, shared output directory files/force update,  existing
              shared  output  config files such as css and dtd files are updated if this modifier
              is used.

       -CC    see --configure

       -c [filename/wildcard]
              see --color-toggle

       --color
              see --color-on

       --color-off
              turn off color in output to terminal

       --color-on
              turn on color in output to terminal

       --color-toggle [filename/wildcard]
              screen toggle ansi screen colour on or off depending on default set (unless -c flag
              is  used:  if sisurc colour default is set to 'true', output to screen will be with
              colour, if sisurc colour default is set to 'false' or is  undefined  screen  output
              will be without colour). Alias -c

       --configure
              configure/initialise   shared  output  directory  files  initialize  shared  output
              directory (config files such as css and dtd files are not updated if  they  already
              exist   unless   modifier   is   used).   The   equivalent   of:   -C   --init-site
              configure/initialise site, more  extensive  than  -C  on  its  own,  shared  output
              directory  files/force  update, existing shared output config files such as css and
              dtd files are updated if -CC is used.

       --concordance [filename/wildcard]
              produces concordance (wordmap) a rudimentary index of all the words in a  document.
              (Concordance  files  are  not  generated for documents of over 260,000 words unless
              this limit is increased in the file sisurc.yml). Alias -w

       -D [instruction] [filename]
              see --pg

       -d [--db-[database type (sqlite|pg)]] --[instruction] [filename]
              see --sqlite

       --dal [filename/wildcard/url]
              assumed for most  other  flags,  creates  new  intermediate  files  for  processing
              (abstract  objects, document abstraction) that is used in all subsequent processing
              of other output. This step is assumed for most processing flags. To skip it see -n.
              Renamed --ao (abstract objects) in sisu v5. Alias -m

       --delete [filename/wildcard]
              see --zap

       --docbook [filename/wildcard/url]
              docbook smart text (sisu v5)

       --dump[=directory_path] [filename/wildcard]
              places output in directory specified, if none is specified in the current directory
              (pwd). Unlike  using  default  settings  HTML  files  have  embedded  css.  Compare
              --redirect

       -e [filename/wildcard]
              see --epub

       --epub [filename/wildcard]
              produces an epub document, [sisu version >=2 ] (filename.epub). Alias -e

       --exc-*
              exclude output feature, overrides configuration settings --exc-ocn, (exclude object
              citation numbering, (switches off object citation numbering ) , affects html  (seg,
              scroll),  epub,  xhtml,  xml, pdf) ; --exc-toc, (exclude table of contents, affects
              html (scroll), epub, pdf) ; --exc-links-to-manifest, --exc-manifest-links, (exclude
              links  to manifest, affects html (seg, scroll)); --exc-search-form, (exclude search
              form, affects html (seg, scroll), manifest); --exc-minitoc, (exclude mini table  of
              contents,  affects  html  (seg),  concordance,  manifest);  --exc-manifest-minitoc,
              (exclude mini table of contents, affects  manifest);  --exc-html-minitoc,  (exclude
              mini  table  of  contents, affects html (seg), concordance); --exc-html-navigation,
              (exclude  navigation,  affects  html  (seg));  --exc-html-navigation-bar,  (exclude
              navigation  bar, affects html (seg)); --exc-html-search-form, (exclude search form,
              affects html (seg, scroll));  --exc-html-right-pane,  (exclude  right  pane/column,
              affects  html  (seg, scroll)); --exc-html-top-band, (exclude top band, affects html
              (seg, scroll), concordance (minitoc forced on to provide seg  navigation));  --exc-
              segsubtoc  (exclude  sub  table  of  contents, affects html (seg), epub) ; see also
              --inc-*

       -F [--webserv=webrick]
              see --sample-search-form

       -f [optional string part of filename]
              see --find

       --fictionbook [filename/wildcard/url]
              fictionbook smart text (sisu v5)

       --find [optional string part of filename]
              without match string, glob all .sst .ssm files  in  directory  (including  language
              subdirectories). With match string, find files that match given string in directory
              (including language subdirectories). Alias -f, --glob, -G

       -G [optional string part of filename]
              see --find

       -g [filename/wildcard]
              see --git

       --git [filename/wildcard]
              produces or updates markup source file structure in a git  repo  (experimental  and
              subject to change). Alias -g

       --glob [optional string part of filename]
              see --find

       -h [filename/wildcard]
              see --html

       --harvest *.ss[tm]
              makes  two  lists of sisu output based on the sisu markup documents in a directory:
              list of author and authors works (year and titles), and; list by topic with  titles
              and   author.   Makes   use   of  header  metadata  fields  (author,  title,  date,
              topic_register). Can be used with maintenance (-M) and remote placement (-R) flags.

       --help [topic]
              provides help on  the  selected  topic,  where  topics  (keywords)  include:  list,
              (com)mands,  short(cuts),  (mod)ifiers,  (env)ironment,  markup,  syntax,  headers,
              headings,  endnotes,  tables,  example,  customise,  skin,   (dir)ectories,   path,
              (lang)uage,  db,  install,  setup,  (conf)igure,  convert,  termsheet, search, sql,
              features, license.

       --html [filename/wildcard]
              produces html output, in two forms  (i)  segmented  text  with  table  of  contents
              (toc.html  and  index.html)  and  (ii) the document in a single file (scroll.html).
              Alias -h

       --html-scroll [filename/wildcard]
              produces html output, the document in a single  file  (scroll.html)  only.  Compare
              --html-seg and --html

       --html-seg [filename/wildcard]
              produces  html  output,  segmented  text  with  table  of  contents  (toc.html  and
              index.html). Compare --html-scroll and --html

       --html-strict [filename/wildcard]
              produces html with --strict option. see --strict

       -I [filename/wildcard]
              see --texinfo

       -i [filename/wildcard]
              see --manpage

       --i18n-*
              these flags affect output by filetype and  filename):  --i18n-mono  (--monolingual)
              output  filenames  without  language  code  for  default language ('en' or as set);
              --i18n-multi  (--multilingual)  language  code  provided  as  part  of  the  output
              filename,  this  is  the default. Where output is in one language only the language
              code may not be desired. see also --output-by-*

       --inc-*
              include output feature, overrides configuration settings, (usually the  default  if
              none  set),  has  precedence  over  --exc-*  (exclude  output feature). Some detail
              provided under --exc-*, see --exc-*

       -j [filename/wildcard]
              copies images associated with  a  file  for  use  by  html,  xhtml  &  xml  outputs
              (automatically invoked by --dump & redirect).

       -k     see --color-off

       --keep-processing-files [filename/wildcard/url]
              see --maintenance

       -M [filename/wildcard/url]
              see --maintenance

       -m [filename/wildcard/url]
              see --dal (document abstraction level/layer)

       --machine [filename/wildcard/url]
              see --dal (document abstraction level/layer)

       --maintenance [filename/wildcard/url]
              maintenance  mode,  interim  processing  files  are  preserved  and their locations
              indicated. (also see -V). Aliases -M and --keep-processing-files.

       --markdown [filename/wildcard/url]
              markdown smart text (sisu v5)

       --manpage [filename/wildcard]
              produces man page of file, not suitable for all outputs. Alias -i

       --monolingual
              see --i18n-*

       --multilingual
              see --i18n-*

       -N [filename/wildcard/url]
              document digest or document content certificate ( DCC ) as md5 digest tree  of  the
              document: the digest for the document, and digests for each object contained within
              the document (together with information on  software  versions  that  produced  it)
              (digest.txt). -NV for verbose digest output to screen.

       -n [filename/wildcard/url]
              skip  the  creation of intermediate processing files (document abstraction) if they
              already exist, this skips the equivalent of -m which is otherwise assumed  by  most
              processing flags.

       --no-* see --exc-*

       -o [filename/wildcard/url]
              see --odt

       --ocn  see --inc-ocn and --exc-ocn

       --odf [filename/wildcard/url]
              see --odt

       --odt [filename/wildcard/url]
              output basic document in opendocument file format (opendocument.odt). Alias -o

       --output-by-*
              select  output  directory  structure  from  3  alternatives:  --output-by-language,
              (language directory (based on language code) with filetype (html, epub,  pdf  etc.)
              subdirectories);  --output-by-filetype, (filetype directories with language code as
              part of filename); --output-by-filename, (filename directories with  language  code
              as part of filename). This is configurable. Alias --by-*

       -P [language_directory/filename language_directory]
              see --po4a

       -p [filename/wildcard]
              see --pdf

       --papersize-(a4|a5|b5|letter|legal)
              in conjunction with --pdf set pdf papersize, overriding any configuration settings,
              to set more than one papersize repeat the option --pdf --papersize-a4  --papersize-
              letter. See also --papersize=*

              --papersize=a4,a5,b5,letter,legal  in  conjunction  with  --pdf  set pdf papersize,
              overriding any configuration settings, to set more than one  papersize  list  after
              the equal sign with a comma separator --papersize=a4,letter. See also --papersize-*

       --pdf [filename/wildcard]
              produces LaTeX pdf (portrait.pdf & landscape.pdf). Orientation and papersize may be
              set on the command-line. Default paper size is set  in  config  file,  or  document
              header,  or  provided  with  additional command line parameter, e.g. --papersize-a4
              preset sizes include: 'A4', U.S. 'letter' and 'legal' and book sizes 'A5' and  'B5'
              (system  defaults  to  A4), and; --landscape or --portrait, so: e.g. "sisu --pdf-a4
              --pdf-letter --landscape --verbose [filename/wildcard]" or "sisu --pdf  --landscape
              --a4  --letter  --verbose  [filename/wildcard]". --pdf defaults to both landscape &
              portrait output, and a4 if no other papersizes  are  configured.   Related  options
              --pdf-landscape --pdf-portrait --pdf-papersize-* --pdf-papersize=[list]. Alias -p

       --pdf-l [filename/wildcard]
              See --pdf-landscape

       --pdf-landscape [filename/wildcard]
              sets  orientation,  produces  LaTeX pdf landscape.pdf. Default paper size is set in
              config  file,  or  document  header,  or  provided  with  additional  command  line
              parameter,  e.g.  --papersize-a4  preset  sizes  include:  'A4',  U.S. 'letter' and
              'legal' and book sizes 'A5' and --papersize-* or --papersize=[list]. Alias  --pdf-l
              or in conjunction with --pdf --landscape

       --pdf-p [filename/wildcard]
              See --pdf-portrait

       --pdf-portrait [filename/wildcard]
              sets orientation, produces LaTeX pdf portrait.pdf.pdf. Default paper size is set in
              config  file,  or  document  header,  or  provided  with  additional  command  line
              parameter,  e.g.  --papersize-a4  preset  sizes  include:  'A4',  U.S. 'letter' and
              'legal' and book sizes 'A5' and --papersize-* or --papersize=[list]. Alias  --pdf-p
              or in conjunction with --pdf --portrait

       --pg [instruction] [filename]
              database  PostgreSQL ( --pgsql may be used instead) possible instructions, include:
              --createdb; --create; --dropall; --import [filename]; --update [filename]; --remove
              [filename]; see database section below. Alias -D

       --po [language_directory/filename language_directory]
              see --po4a

       --po4a [language_directory/filename language_directory]
              produces  .pot and po files for the file in the languages specified by the language
              directory.  SiSU markup is placed in subdirectories named with the  language  code,
              e.g.  en/  fr/ es/. The sisu config file must set the output directory structure to
              multilingual. v3, experimental

       -Q [filename/wildcard]
              see --qrcode

       -q [filename/wildcard]
              see --quiet

       --qrcode [filename/wildcard]
              generate QR code image of metadata (used in manifest). v3 only.

       --quiet [filename/wildcard]
              quiet less output to screen.

       -R [filename/wildcard]
              see --rsync

       -r [filename/wildcard]
              see --scp

       --redirect[=directory_path] [filename/wildcard]
              places output in subdirectory under  specified  directory,  subdirectory  uses  the
              filename  (without  the  suffix).  If  no  output directory is specified places the
              subdirectory under the current directory (pwd). Unlike using default settings  HTML
              files have embedded css. Compare --dump

       --rst [filename/wildcard/url]
              ReST (rST restructured text) smart text (sisu v5)

       --rsync [filename/wildcard]
              copies  sisu output files to remote host using rsync. This requires that sisurc.yml
              has been provided with information on hostname and username, and that you have your
              "keys"  and  ssh agent in place. Note the behavior of rsync different if -R is used
              with other flags from if used alone. Alone the rsync --delete  parameter  is  sent,
              useful  for  cleaning  the  remote  directory  (when -R is used together with other
              flags, it is not). Also see --scp. Alias -R

       -S     see --sisupod

       -S [filename/wildcard]
              see --sisupod

       -s [filename/wildcard]
              see --source

       --sample-search-form [--db=(pgsql|sqlite)] [--webserv=webrick]
              generate examples of (naive) cgi search form for SQLite or PgSQL  depends  on  your
              already  having  used  sisu  to  populate  an SQLite or PgSQL database, (the SQLite
              version scans the output directories for existing sisu_sqlite databases, so  it  is
              first  necessary  to create them, before generating the search form) see --sqlite &
              --pg and the database section below. Optional additional  parameters  include:  url
              location  of  webserver  search  form and db: --webserv-search='[url]'; location of
              webserver output: --webserv-output='[url]';  cgi  search  form  link  name:  --cgi-
              search-form-name='[name.cgi]'; for pgsql, database user: --db-user='[username]'. If
              the optional parameter --webserv=webrick is passed, the cgi examples  created  will
              be set up to use the default port set for use by the webrick server, (otherwise the
              port is left blank and the system setting used, usually 80). The samples are dumped
              in  the  present  work  directory which must be writable, (with screen instructions
              given that they be copied to the cgi-bin directory). Alias -F

       --scp [filename/wildcard]
              copies sisu output files to remote host using scp. This  requires  that  sisurc.yml
              has been provided with information on hostname and username, and that you have your
              "keys" and ssh agent in place. Also see --rsync. Alias -r

       --sqlite --[instruction] [filename]
              database type set to SQLite, this produces one of two possible  databases,  without
              additional database related instructions it produces a discreet SQLite file for the
              document processed; with  additional  instructions  it  produces  a  common  SQLite
              database  of  all processed documents that (come from the same document preparation
              directory and as a result) share the same  output  directory  base  path  (possible
              instructions   include:   --createdb;  --create;  --dropall;  --import  [filename];
              --update [filename]; --remove [filename]); see database section below. Alias -d

       --sisupod
              produces a sisupod a zipped sisu directory of markup files  including  sisu  markup
              source  files and the directories local configuration file, images and skins. Note:
              this only includes the configuration files or skins contained  in  is  tested  only
              with zsh). Alias -S

       --sisupod [filename/wildcard]
              produces  a  zipped  file  of the prepared document specified along with associated
              images, by default named sisupod.zip they  may  alternatively  be  named  with  the
              filename  extension  .ssp This provides a quick way of gathering the relevant parts
              of a sisu document which can then for example be emailed. A sisupod  includes  sisu
              markup source file, (along with associated documents if a master file, or available
              in multilingual versions), together with related images and  skin.   SiSU  commands
              can  be  run directly against a sisupod contained in a local directory, or provided
              as a url on a remote site. As there is a security  issue  with  skins  provided  by
              other  users, they are not applied unless the flag --trust or --trusted is added to
              the command instruction, it is recommended that file that  are  not  your  own  are
              treated as untrusted. The directory structure of the unzipped file is understood by
              sisu, and sisu commands can be run within it.  Note: if you wish to  send  multiple
              files,  it  quickly  becomes more space efficient to zip the sisu markup directory,
              rather  than  the  individual  files  for  sending).  See  the  -S  option  without
              [filename/wildcard]. Alias -S

       --source [filename/wildcard]
              copies sisu markup file to output directory. Alias -s

       --strict
              together  with  --html,  produces  more  w3c compliant html, for example not having
              purely numeric identifiers for text, the location object url#33 becomes url#o33

       -T [filename/wildcard (*.termsheet.rb)]
              standard form document builder, preprocessing feature

       -t [filename/wildcard]
              see --txt

       --texinfo [filename/wildcard]
              produces texinfo and info file, (view with pinfo). Alias -I

       --textile [filename/wildcard/url]
              textile smart text (sisu v5)

       --txt [filename/wildcard]
              produces plaintext with Unix linefeeds and  without  markup,  (object  numbers  are
              omitted),  has  footnotes  at  end  of  each  paragraph that contains them [ -A for
              equivalent dos (linefeed) output file] [see -e  for  endnotes].  (Options  include:
              --endnotes  for  endnotes  --footnotes  for  footnotes at the end of each paragraph
              --unix for unix linefeed (default) --msdos for msdos linefeed). Alias -t

       --txt-asciitext [filename/wildcard]
              see --asciitext

       --txt-markdown [filename/wildcard]
              see --markdown

       --txt-rst [filename/wildcard]
              see --rst

       --txt-textile [filename/wildcard]
              see --textile

       -U [filename/wildcard]
              see --urls

       -u [filename/wildcard]
              provides url mapping of output files for the flags requested for  processing,  also
              see -U

       --urls [filename/wildcard]
              prints url output list/map for the available processing flags options and resulting
              files that could be requested, (can be used to get a list of processing options  in
              relation  to  a  file,  together  with  information  on  the  output  that would be
              produced), -u provides url output mapping for those flags requested for processing.
              The default assumes sisu_webrick is running and provides webrick url mappings where
              appropriate, but these can be switched to file system paths in sisurc.yml. Alias -U

       -V     on its own, provides SiSU version and environment information (sisu --help env)

       -V [filename/wildcard]
              even more verbose than the -v flag.

       -v     on its own, provides SiSU version information

       -v [filename/wildcard]
              see --verbose

       --v3 [filename/wildcard]
              invokes the sisu v3 document parser/generator. You may run sisu3 instead.

       --v4 [filename/wildcard]
              invokes the sisu v4 document parser/generator. This is the default and is  normally
              omitted.

       --verbose [filename/wildcard]
              provides  verbose  output  of  what is being generated, where output is placed (and
              error messages if any), as with -u flag provides a url mapping of files created for
              each of the processing flag requests. Alias -v

       -W     see --webrick

       -w [filename/wildcard]
              see --concordance

       --webrick
              starts  ruby'  s  webrick  webserver points at sisu output directories, the default
              port is set to 8081 and can be changed in the resource configuration files.   [tip:
              the  webrick  server requires link suffixes, so html output should be created using
              the -h option rather than -H ; also, note -F webrick ]. Alias -W

       --wordmap [filename/wildcard]
              see --concordance

       --xhtml [filename/wildcard]
              produces xhtml/ XML output for browser viewing (sax parsing). Alias -b

       --xml-dom [filename/wildcard]
              produces XML output with deep document structure, in the nature of dom. Alias -X

       --xml-sax [filename/wildcard]
              produces XML output shallow structure (sax parsing). Alias -x

       -X [filename/wildcard]
              see --xml-dom

       -x [filename/wildcard]
              see --xml-sax

       -Y [filename/wildcard]
              produces a short sitemap entry for the document,  based  on  html  output  and  the
              sisu_manifest. --sitemaps generates/updates the sitemap index of existing sitemaps.
              (Experimental, [g,y,m announcement this week])

       -y [filename/wildcard]
              produces an html summary of output generated (hyperlinked to content) and  document
              specific  metadata  (sisu_manifest.html).  This step is assumed for most processing
              flags.

       -Z [filename/wildcard]
              see --zap

       --zap [filename/wildcard]
              Zap, if used with other processing flags deletes output files of the type about  to
              be  processed,  prior  to  processing. If -Z is used as the lone processing related
              flag (or in conjunction with a combination of -[mMvVq]), will  remove  the  related
              document output directory. Alias -Z

COMMAND LINE MODIFIERS

       --no-ocn
              [with  --html  --pdf  or  --epub]  switches off object citation numbering.  Produce
              output without identifying numbers in margins of html or LaTeX /pdf output.

       --no-annotate
              strips output text of editor endnotes[^*1] denoted by asterisk or dagger/plus sign

       --no-asterisk
              strips output text of editor endnotes[^*2] denoted by asterisk sign

       --no-dagger
              strips output text of editor endnotes[^+1] denoted by dagger/plus sign

DATABASE COMMANDS

        dbi - database interface

        -D or --pgsql set for PostgreSQL -d or --sqlite default set for SQLite -d  is  modifiable
       with --db=[database type (PgSQL or SQLite ) ]

       --pg -v --createall
              initial  step,  creates required relations (tables, indexes) in existing PostgreSQL
              database (a database should be created manually and given the same name as  working
              directory,  as  requested)  (rb.dbi) [ -dv --createall SQLite equivalent] it may be
              necessary to run sisu -Dv --createdb  initially  NOTE:  at  the  present  time  for
              PostgreSQL  it  may be necessary to manually create the database. The command would
              be directory name (without path)]. Please use only alphanumerics and underscores.

       --pg -v --import
              [filename/wildcard] imports data specified to PostgreSQL db (rb.dbi) [ -dv --import
              SQLite equivalent]

       --pg -v --update
              [filename/wildcard]  updates/imports specified data to PostgreSQL db (rb.dbi) [ -dv
              --update SQLite equivalent]

       --pg --remove
              [filename/wildcard] removes specified data to PostgreSQL db (rb.dbi) [ -d  --remove
              SQLite equivalent]

       --pg --dropall
              kills  data" and drops ( PostgreSQL or SQLite ) db, tables & indexes [ -d --dropall
              SQLite equivalent]

               The -v is for verbose output.

SHORTCUTS, SHORTHAND FOR MULTIPLE FLAGS

       --update [filename/wildcard]
              Checks existing file output and runs the flags  required  to  update  this  output.
              This  means  that  if only html and pdf output was requested on previous runs, only
              the -hp files will be applied, and only these will be generated this time, together
              with  the  summary.  This can be very convenient, if you offer different outputs of
              different files, and just want to do the same again.

       -0 to -5 [filename or wildcard]
              Default  shorthand  mappings   (for   v3,   note   that   the   defaults   can   be
              changed/configured in the sisurc.yml file):

       -0     -NQhewpotbxXyYv  [this is the default action run when no options are give, i.e.  on
              'sisu [filename]']

       -1     -Qhewpoty

       -2     -NQhewpotbxXy

       -3     -NQhewpotbxXyY

       -4     -NQhewpotbxXDyY --update

       -5     -NQhewpotbxXDyYv --update

               add -v for verbose mode and -c to toggle color state, e.g. sisu -2vc [filename  or
              wildcard]

               consider -u for appended url info or -v for verbose output

COMMAND LINE WITH FLAGS - BATCH PROCESSING

         In  the data directory run sisu -mh filename or wildcard eg. "sisu -h cisg.sst" or "sisu
       -h *.{sst,ssm}" to produce html version of all documents.

        Running sisu (alone without any flags, filenames or wildcards) brings up the  interactive
       help, as does any sisu command that is not recognised. Enter to escape.

HELP

SISU MANUAL

        The most up to date information on sisu should be contained in the sisu_manual, available
       at:

        <http://sisudoc.org/sisu/sisu_manual/>

        The manual can be generated from source,  found  respectively,  either  within  the  SiSU
       tarball or installed locally at:

        /usr/share/doc/sisu/markup-samples/sisu_manual

        move to the respective directory and type e.g.:

        sisu sisu_manual.ssm

SISU MAN PAGES

        If SiSU is installed on your system usual man commands should be available, try:

        man sisu

         Most  SiSU man pages are generated directly from sisu documents that are used to prepare
       the sisu manual, the sources files for which are located within the SiSU tarball at:

         ./data/doc/sisu/markup-samples/sisu_manual

        Once installed, directory equivalent to:

         /usr/share/doc/sisu/markup-samples/sisu_manual

        Available man pages are converted back to html using man2html:

         /usr/share/doc/sisu/html/

         ./data/doc/sisu/html

        An online version of the sisu man page is available here:

        * various sisu man pages <http://www.jus.uio.no/sisu/man/> [^9]

        * sisu.1 <http://www.jus.uio.no/sisu/man/sisu.1.html> [^10]

SISU BUILT-IN INTERACTIVE HELP, [DISCONTINUED]

        This fell out of date and has been discontinued.

INTRODUCTION TO SISU MARKUP[^11]

SUMMARY

        SiSU source documents are plaintext ( UTF-8 )[^12] files

        All paragraphs are separated by an empty line.

        Markup is comprised of:

        * at the top of a document, the document header made up of semantic meta-data  about  the
       document  and  if  desired  additional  processing  instructions  (such  an instruction to
       automatically number headings from a particular level down)

        * followed  by  the  prepared  substantive  text  of  which  the  most  important  single
       characteristic is the markup of different heading levels, which define the primary outline
       of the document structure. Markup of substantive text includes:

         * heading levels defines document structure

         * text basic attributes, italics, bold etc.

         * grouped text (objects), which are to be treated differently, such as code
         blocks or poems.

         * footnotes/endnotes

        * linked text and images

        * paragraph actions, such as indent, bulleted, numbered-lists, etc.

        Some interactive help on markup is available, by typing sisu and selecting markup or sisu
       --help markup

        To check the markup in a file:

        sisu --identify [filename].sst

        For brief descriptive summary of markup history

        sisu --query-history

        or if for a particular version:

        sisu --query-0.38

MARKUP EXAMPLES

ONLINE

         Online  markup examples are available together with the respective outputs produced from
       <http://www.jus.uio.no/sisu/SiSU/examples.html>                  or                   from
       <http://www.jus.uio.no/sisu/sisu_examples/>

         There  is  of course this document, which provides a cursory overview of sisu markup and
       the respective output produced: <http://www.jus.uio.no/sisu/sisu_markup/>

        an alternative presentation of markup syntax: /usr/share/doc/sisu/on_markup.txt.gz

INSTALLED

        With SiSU installed sample skins may be found in: /usr/share/doc/sisu/markup-samples  (or
       equivalent   directory)   and   if   sisu   -markup-samples   is   installed  also  under:
       /usr/share/doc/sisu/markup-samples-non-free

MARKUP OF HEADERS

        Headers contain either: semantic meta-data about a document, which can  be  used  by  any
       output module of the program, or; processing instructions.

         Note: the first line of a document may include information on the markup version used in
       the form of a comment. Comments are a percentage mark at the start of a paragraph (and  as
       the first character in a line of text) followed by a space and the comment:

       % this would be a comment

SAMPLE HEADER

         This  current  document is loaded by a master document that has a header similar to this
       one:

       % SiSU master 4.0

       @title: SiSU
         :subtitle: Manual

       @creator:
         :author: Amissah, Ralph

       @publisher: [publisher name]

       @rights: Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3

       @classify:
         :topic_register: SiSU:manual;electronic documents:SiSU:manual
         :subject: ebook, epublishing, electronic book, electronic publishing,
           electronic document, electronic citation, data structure,
            citation systems, search

       % used_by: manual

       @date:
         :published: 2008-05-22
         :created: 2002-08-28
         :issued: 2002-08-28
         :available: 2002-08-28
         :modified: 2010-03-03

       @make:
         :num_top: 1
         :breaks: new=C; break=1
         :bold: /Gnu|Debian|Ruby|SiSU/
         :home_button_text: {SiSU}http://sisudoc.org; {git}http://git.sisudoc.org
         :footer: {SiSU}http://sisudoc.org; {git}http://git.sisudoc.org
         :manpage: name=sisu - documents: markup, structuring, publishing in multiple standard formats, and search;
            synopsis=sisu [-abcDdeFhIiMmNnopqRrSsTtUuVvwXxYyZz0-9] [filename/wildcard ]
            . sisu [-Ddcv] [instruction]
            . sisu [-CcFLSVvW]
            . sisu --v4 [operations]
            . sisu --v3 [operations]

       @links:
         { SiSU Homepage }http://www.sisudoc.org/
         { SiSU Manual }http://www.sisudoc.org/sisu/sisu_manual/
         { Book Samples & Markup Examples }http://www.jus.uio.no/sisu/SiSU/examples.html
         { SiSU Download }http://www.jus.uio.no/sisu/SiSU/download.html
         { SiSU Changelog }http://www.jus.uio.no/sisu/SiSU/changelog.html
         { SiSU Git repo }http://git.sisudoc.org/?p=code/sisu.git;a=summary
         { SiSU List Archives }http://lists.sisudoc.org/pipermail/sisu/
         { SiSU @ Debian }http://packages.qa.debian.org/s/sisu.html
         { SiSU Project @ Debian }http://qa.debian.org/developer.php?login=sisu@lists.sisudoc.org
         { SiSU @ Wikipedia }http://en.wikipedia.org/wiki/SiSU

AVAILABLE HEADERS

        Header tags appear at the beginning of a document and provide  meta  information  on  the
       document  (such as the Dublin Core ) , or information as to how the document as a whole is
       to be processed. All header instructions take the form @headername: or on  the  next  line
       and indented by once space :subheadername: All Dublin Core meta tags are available

        @indentifier: information or instructions

         where  the  "identifier"  is  a  tag recognised by the program, and the "information" or
       "instructions" belong to the tag/identifier specified

        Note: a header where used should only be used once; all headers apart  from  @title:  are
       optional; the @structure: header is used to describe document structure, and can be useful
       to know.

        This is a sample header

       % SiSU 2.0 [declared file-type identifier with markup version]

       @title: [title text] [this header is the only one that is mandatory]
         :subtitle: [subtitle if any]
         :language: English

       @creator:
         :author: [Lastname, First names]
         :illustrator: [Lastname, First names]
         :translator: [Lastname, First names]
         :prepared_by: [Lastname, First names]

       @date:
         :published: [year or yyyy-mm-dd]
         :created: [year or yyyy-mm-dd]
         :issued: [year or yyyy-mm-dd]
         :available: [year or yyyy-mm-dd]
         :modified: [year or yyyy-mm-dd]
         :valid: [year or yyyy-mm-dd]
         :added_to_site: [year or yyyy-mm-dd]
         :translated: [year or yyyy-mm-dd]

       @rights:
         :copyright: Copyright (C) [Year and Holder]
         :license: [Use License granted]
         :text: [Year and Holder]
         :translation: [Name, Year]
         :illustrations: [Name, Year]

       @classify:
         :topic_register: SiSU:markup sample:book;book:novel:fantasy
         :type:
         :subject:
         :description:
         :keywords:
         :abstract:
         :loc: [Library of Congress classification]
         :dewey: [Dewey classification

       @identify:
         :isbn: [ISBN]
         :oclc:

       @links: { SiSU }http://www.sisudoc.org
         { FSF }http://www.fsf.org

       @make:
         :num_top: 1
         :headings: [text to match for each level
           (e.g. PART; Chapter; Section; Article; or another: none; BOOK|FIRST|SECOND; none; CHAPTER;)
         :breaks: new=:C; break=1
         :promo: sisu, ruby, sisu_search_libre, open_society
         :bold: [regular expression of words/phrases to be made bold]
         :italics: [regular expression of words/phrases to italicise]
         :home_button_text: {SiSU}http://sisudoc.org; {git}http://git.sisudoc.org
         :footer: {SiSU}http://sisudoc.org; {git}http://git.sisudoc.org

       @original:
         :language: [language]

       @notes:
         :comment:
         :prefix: [prefix is placed just after table of contents]

MARKUP OF SUBSTANTIVE TEXT

HEADING LEVELS

        Heading levels are :A~ ,:B~ ,:C~ ,1~ ,2~ ,3~ ... :A - :C being part /  section  headings,
       followed  by other heading levels, and 1 -6 being headings followed by substantive text or
       sub-headings. :A~ usually the title :A~? conditional level 1 heading (used where a  stand-
       alone document may be imported into another)

         :A~  [heading  text]  Top  level  heading [this usually has similar content to the title
       @title: ] NOTE: the heading levels described here are in 0.38 notation, see heading

        :B~ [heading text] Second level heading [this is a heading level divider]

        :C~ [heading text] Third level heading [this is a heading level divider]

        1~ [heading text] Top level heading preceding substantive text of document or sub-heading
       2,  the heading level that would normally be marked 1. or 2. or 3. etc. in a document, and
       the level on which sisu by default would break html output into named segments, names  are
       provided   automatically   if  none  are  given  (a  number),  otherwise  takes  the  form
       1~my_filename_for_this_segment

        2~ [heading text] Second level heading preceding substantive text  of  document  or  sub-
       heading  3 , the heading level that would normally be marked 1.1 or 1.2 or 1.3 or 2.1 etc.
       in a document.

        3~ [heading text] Third level heading preceding substantive text of document, that  would
       normally be marked 1.1.1 or 1.1.2 or 1.2.1 or 2.1.1 etc. in a document

       1~filename level 1 heading,

       % the primary division such as Chapter that is followed by substantive text, and may be further subdivided (this is the level on which by default html segments are made)

FONT ATTRIBUTES

        markup example:
       normal text,  *{emphasis}*, !{bold text}!, /{italics}/, _{underscore}_, "{citation}",
       ^{superscript}^, ,{subscript},, +{inserted text}+, -{strikethrough}-, #{monospace}#

       normal text

       *{emphasis}* [note: can be configured to be represented by bold, italics or underscore]

       !{bold text}!

       /{italics}/

       _{underscore}_

       "{citation}"

       ^{superscript}^

       ,{subscript},

       +{inserted text}+

       -{strikethrough}-

       #{monospace}#

        resulting output:

         normal  text,  emphasis,  bold  text  ,  italics, underscore, "citation", ^superscript^,
       [subscript], ++inserted text++, --strikethrough--, monospace

        normal text

        emphasis [note: can be configured to be represented by bold, italics or underscore]

        bold text

        italics

        underscore

        "citation"

        ^superscript^

        [subscript]

        ++inserted text++


        --strikethrough--
        monospace

INDENTATION AND BULLETS

        markup example:
       ordinary paragraph

       _1 indent paragraph one step

       _2 indent paragraph two steps

       _9 indent paragraph nine steps

        resulting output:

        ordinary paragraph

         indent paragraph one step

           indent paragraph two steps

                         indent paragraph nine steps

        markup example:
       _* bullet text

       _1* bullet text, first indent

       _2* bullet text, two step indent

        resulting output:

        * bullet text

         * bullet text, first indent

           * bullet text, two step indent

        Numbered List (not to be confused with headings/titles, (document structure))

        markup example:
       # numbered list                numbered list 1., 2., 3, etc.

       _# numbered list numbered list indented a., b., c., d., etc.

HANGING INDENTS

        markup example:
       _0_1 first line no indent,
       rest of paragraph indented one step

       _1_0 first line indented,
       rest of paragraph no indent

       in each case level may be 0-9

        resulting output:

        first line no indent, rest of paragraph indented one step; first line no
         indent, rest of paragraph indented one step; first line no indent, rest of
         paragraph indented one step; first line no indent, rest of paragraph indented
         one step; first line no indent, rest of paragraph indented one step; first
         line no indent, rest of paragraph indented one step; first line no indent,
         rest of paragraph indented one step; first line no indent, rest of paragraph
         indented one step; first line no indent, rest of paragraph indented one step;

        A regular paragraph.

        first line indented, rest of paragraph no indent first line indented, rest  of  paragraph
       no  indent  first  line indented, rest of paragraph no indent first line indented, rest of
       paragraph no indent first line indented, rest of paragraph no indent first line  indented,
       rest  of  paragraph  no indent first line indented, rest of paragraph no indent first line
       indented, rest of paragraph no indent first line indented, rest  of  paragraph  no  indent
       first line indented, rest of paragraph no indent first line indented, rest of paragraph no
       indent

        in each case level may be 0-9

        live-build
         A collection of scripts used to build customized Debian

         Livesystems.
         .I live-build
         was formerly known as live-helper, and even earlier known as live-package.

        live-build

         A collection of scripts used to build customized Debian
         Livesystems.  live-build
         was formerly known as live-helper, and even earlier known as live-package.

FOOTNOTES / ENDNOTES

        Footnotes and endnotes are marked up at the location where they would be indicated within
       a  text.  They are automatically numbered. The output type determines whether footnotes or
       endnotes will be produced

        markup example:

       ~{ a footnote or endnote }~

        resulting output:

        [^13]

        markup example:
       normal text~{ self contained endnote marker & endnote in one }~ continues

        resulting output:

        normal text[^14] continues

        markup example:
       normal text ~{* unnumbered asterisk footnote/endnote, insert multiple asterisks if required }~ continues

       normal text ~{** another unnumbered asterisk footnote/endnote }~ continues

        resulting output:

        normal text [^*] continues

        normal text [^**] continues

        markup example:

       normal text ~[* editors notes, numbered asterisk footnote/endnote series ]~ continues

       normal text ~[+ editors notes, numbered plus symbol footnote/endnote series ]~ continues

        resulting output:

        normal text [^*3] continues

        normal text [^+2] continues

        Alternative endnote pair notation for footnotes/endnotes:
       % note the endnote marker "~^"

       normal text~^ continues

       ^~ endnote text following the paragraph in which the marker occurs

        the standard and pair notation cannot be mixed in the same document

LINKS

NAKED URLS WITHIN TEXT, DEALING WITH URLS

        urls found within text are marked up automatically. A url within  text  is  automatically
       hyperlinked  to  itself  and  by  default  decorated  with  angled braces, unless they are
       contained within a code block (in which case they are passed as normal text),  or  escaped
       by a preceding underscore (in which case the decoration is omitted).

        markup example:
       normal text http://www.sisudoc.org/ continues

        resulting output:

        normal text <http://www.sisudoc.org/> continues

        An escaped url without decoration

        markup example:
       normal text _http://www.sisudoc.org/ continues

       deb _http://www.jus.uio.no/sisu/archive unstable main non-free

        resulting output:

        normal text <_http://www.sisudoc.org/> continues

        deb <_http://www.jus.uio.no/sisu/archive> unstable main non-free

         where a code block is used there is neither decoration nor hyperlinking, code blocks are
       discussed later in this document

        resulting output:
       deb http://www.jus.uio.no/sisu/archive unstable main non-free
       deb-src http://www.jus.uio.no/sisu/archive unstable main non-free

LINKING TEXT

        To link text or an image to a url the markup is as follows

        markup example:
       about { SiSU }http://url.org markup

        resulting output:

        aboutSiSU <http://www.sisudoc.org/> markup

        A shortcut notation is available so the url link may also be provided automatically as  a
       footnote

        markup example:
       about {~^ SiSU }http://url.org markup

        resulting output:

        aboutSiSU <http://www.sisudoc.org/> [^15] markup

        Internal document links to a tagged location, including an ocn

        markup example:
       about { text links }#link_text

        resulting output:

        about ⌠text links⌡⌈link_text⌋

        Shared document collection link

        markup example:
       about { SiSU book markup examples }:SiSU/examples.html

        resulting output:

        about ⌠ SiSU book markup examples⌡⌈:SiSU/examples.html⌋

LINKING IMAGES

        markup example:
       { tux.png 64x80 }image

       % various url linked images

       {tux.png 64x80 "a better way" }http://www.sisudoc.org/

       {GnuDebianLinuxRubyBetterWay.png 100x101 "Way Better - with Gnu/Linux, Debian and Ruby" }http://www.sisudoc.org/

       {~^ ruby_logo.png "Ruby" }http://www.ruby-lang.org/en/

        resulting output:

        [ tux.png ]

        tux.png 64x80 "Gnu/Linux - a better way" <http://www.sisudoc.org/>

         GnuDebianLinuxRubyBetterWay.png  100x101  "Way Better - with Gnu/Linux, Debian and Ruby"
       <http://www.sisudoc.org/>

        ruby_logo.png 70x90 "Ruby" <http://www.ruby-lang.org/en/> [^16]

        linked url footnote shortcut
       {~^ [text to link] }http://url.org

       % maps to: { [text to link] }http://url.org ~{ http://url.org }~

       % which produces hyper-linked text within a document/paragraph, with an endnote providing the url for the text location used in the hyperlink

       text marker *~name

        note at a heading level the same is automatically achieved by providing names to headings
       1,  2  and  3 i.e. 2~[name] and 3~[name] or in the case of auto-heading numbering, without
       further intervention.

LINK SHORTCUT FOR MULTIPLE VERSIONS OF A SISU DOCUMENT IN THE SAME DIRECTORY

       TREE

        markup example:
       !_ /{"Viral Spiral"}/, David Bollier

       { "Viral Spiral", David Bollier [3sS]}viral_spiral.david_bollier.sst

           Viral    Spiral    ,    David    Bollier     "Viral     Spiral",     David     Bollier
       <http://corundum/sisu_manual/en/manifest/viral_spiral.david_bollier.html>
            document                                                                     manifest
       <http://corundum/sisu_manual/en/manifest/viral_spiral.david_bollier.html>
             ⌠html,                                                                     segmented
       text⌡「http://corundum/sisu_manual/en/html/viral_spiral.david_bollier.html」
             ⌠html,                     scroll,                    document                    in
       one⌡「http://corundum/sisu_manual/en/html/viral_spiral.david_bollier.html」
             ⌠epub⌡「http://corundum/sisu_manual/en/epub/viral_spiral.david_bollier.epub」
             ⌠pdf,
       landscape⌡「http://corundum/sisu_manual/en/pdf/viral_spiral.david_bollier.pdf」
             ⌠pdf, portrait⌡「http://corundum/sisu_manual/en/pdf/viral_spiral.david_bollier.pdf」
             ⌠odf:                     odt,                     open                     document
       text⌡「http://corundum/sisu_manual/en/odt/viral_spiral.david_bollier.odt」
             ⌠xhtml
       scroll⌡「http://corundum/sisu_manual/en/xhtml/viral_spiral.david_bollier.xhtml」
             ⌠xml, sax⌡「http://corundum/sisu_manual/en/xml/viral_spiral.david_bollier.xml」
             ⌠xml, dom⌡「http://corundum/sisu_manual/en/xml/viral_spiral.david_bollier.xml」
             ⌠concordance⌡「http://corundum/sisu_manual/en/html/viral_spiral.david_bollier.html」
             ⌠dcc,                  document                  content                 certificate
       (digests)⌡「http://corundum/sisu_manual/en/digest/viral_spiral.david_bollier.txt」
             ⌠markup                                                                       source
       text⌡「http://corundum/sisu_manual/en/src/viral_spiral.david_bollier.sst」
             ⌠markup                                source                               (zipped)
       pod⌡「http://corundum/sisu_manual/en/pod/viral_spiral.david_bollier.sst.zip

GROUPED TEXT

TABLES

        Tables may be prepared in two either of two forms

        markup example:
       table{ c3; 40; 30; 30;

       This is a table
       this would become column two of row one
       column three of row one is here

       And here begins another row
       column two of row two
       column three of row two, and so on

       }table

        resulting output:

       This is a table|this would become column two  of  row  one|column  three  of  row  one  is
       here』And  here  begins  another row|column two of row two|column three of row two, and so
       on』

        a second form may be easier to work with in cases where there is not much information  in
       each column

        markup example: [^18]
       !_ Table 3.1: Contributors to Wikipedia, January 2001 - June 2005

       {table~h 24; 12; 12; 12; 12; 12; 12;}
                                       |Jan. 2001|Jan. 2002|Jan. 2003|Jan. 2004|July 2004|June 2006
       Contributors*                   |       10|      472|    2,188|    9,653|   25,011|   48,721
       Active contributors**           |        9|      212|      846|    3,228|    8,442|   16,945
       Very active contributors***     |        0|       31|      190|      692|    1,639|    3,016
       No. of English language articles|       25|   16,000|  101,000|  190,000|  320,000|  630,000
       No. of articles, all languages  |       25|   19,000|  138,000|  490,000|  862,000|1,600,000

       * Contributed at least ten times; ** at least 5 times in last month; *** more than 100 times in last month.

        resulting output:

         Table 3.1: Contributors to Wikipedia, January 2001 - June 2005 |Jan. 2001|Jan. 2002|Jan.
       2003|Jan. 2004|July 2004|June 2006』Contributors*|10|472|2,188|9,653|25,011|48,721』Active
       contributors**|9|212|846|3,228|8,442|16,945』Very                                   active
       contributors***|0|31|190|692|1,639|3,016』No.         of         English          language
       articles|25|16,000|101,000|190,000|320,000|630,000』No.       of       articles,       all
       languages|25|19,000|138,000|490,000|862,000|1,600,000』

        * Contributed at least ten times; ** at least 5 times in last month; ***  more  than  100
       times in last month.

POEM

        basic markup:
       poem{

         Your poem here

       }poem

       Each verse in a poem is given an object number.

        markup example:
       poem{

                           `Fury said to a
                          mouse, That he
                        met in the
                      house,
                   "Let us
                     both go to
                       law:  I will
                         prosecute
                           YOU.  --Come,
                              I'll take no
                               denial; We
                            must have a
                        trial:  For
                     really this
                  morning I've
                 nothing
                to do."
                  Said the
                    mouse to the
                      cur, "Such
                        a trial,
                          dear Sir,
                                With
                            no jury
                         or judge,
                       would be
                     wasting
                    our
                     breath."
                      "I'll be
                        judge, I'll
                          be jury,"
                                Said
                           cunning
                             old Fury:
                            "I'll
                             try the
                                whole
                                 cause,
                                    and
                               condemn
                              you
                             to
                              death."'

       }poem

        resulting output:
                           `Fury said to a
                          mouse, That he
                        met in the
                      house,
                   "Let us
                     both go to
                       law:  I will
                         prosecute
                           YOU.  --Come,
                              I'll take no
                               denial; We
                            must have a
                        trial:  For
                     really this
                  morning I've
                 nothing
                to do."
                  Said the
                    mouse to the
                      cur, "Such
                        a trial,
                          dear Sir,
                                With
                            no jury
                         or judge,
                       would be
                     wasting
                    our
                     breath."
                      "I'll be
                        judge, I'll
                          be jury,"
                                Said
                           cunning
                             old Fury:
                            "I'll
                             try the
                                whole
                                 cause,
                                    and
                               condemn
                              you
                             to
                              death."'

GROUP

        basic markup:
       group{

         Your grouped text here

       }group

       A group is treated as an object and given a single object number.

        markup example:
       group{

                           `Fury said to a
                          mouse, That he
                        met in the
                      house,
                   "Let us
                     both go to
                       law:  I will
                         prosecute
                           YOU.  --Come,
                              I'll take no
                               denial; We
                            must have a
                        trial:  For
                     really this
                  morning I've
                 nothing
                to do."
                  Said the
                    mouse to the
                      cur, "Such
                        a trial,
                          dear Sir,
                                With
                            no jury
                         or judge,
                       would be
                     wasting
                    our
                     breath."
                      "I'll be
                        judge, I'll
                          be jury,"
                                Said
                           cunning
                             old Fury:
                            "I'll
                             try the
                                whole
                                 cause,
                                    and
                               condemn
                              you
                             to
                              death."'

       }group

        resulting output:
                           `Fury said to a
                          mouse, That he
                        met in the
                      house,
                   "Let us
                     both go to
                       law:  I will
                         prosecute
                           YOU.  --Come,
                              I'll take no
                               denial; We
                            must have a
                        trial:  For
                     really this
                  morning I've
                 nothing
                to do."
                  Said the
                    mouse to the
                      cur, "Such
                        a trial,
                          dear Sir,
                                With
                            no jury
                         or judge,
                       would be
                     wasting
                    our
                     breath."
                      "I'll be
                        judge, I'll
                          be jury,"
                                Said
                           cunning
                             old Fury:
                            "I'll
                             try the
                                whole
                                 cause,
                                    and
                               condemn
                              you
                             to
                              death."'

CODE

         Code  tags  code{  ... }code (used as with other group tags described above) are used to
       escape regular sisu markup, and have been used extensively within this document to provide
       examples  of  SiSU  markup. You cannot however use code tags to escape code tags. They are
       however used in the same way as group or poem tags.

        A code-block is treated as an object and given a single  object  number.  [an  option  to
       number each line of code may be considered at some later time]

        use of code tags instead of poem compared, resulting output:
                           `Fury said to a
                          mouse, That he
                        met in the
                      house,
                   "Let us
                     both go to
                       law:  I will
                         prosecute
                           YOU.  --Come,
                              I'll take no
                               denial; We
                            must have a
                        trial:  For
                     really this
                  morning I've
                 nothing
                to do."
                  Said the
                    mouse to the
                      cur, "Such
                        a trial,
                          dear Sir,
                                With
                            no jury
                         or judge,
                       would be
                     wasting
                    our
                     breath."
                      "I'll be
                        judge, I'll
                          be jury,"
                                Said
                           cunning
                             old Fury:
                            "I'll
                             try the
                                whole
                                 cause,
                                    and
                               condemn
                              you
                             to
                              death."'

        From SiSU 2.7.7 on you can number codeblocks by placing a hash after the opening code tag
       code{# as demonstrated here:

       1  |                    `Fury said to a
       2  |                   mouse, That he
       3  |                 met in the
       4  |               house,
       5  |            "Let us
       6  |              both go to
       7  |                law:  I will
       8  |                  prosecute
       9  |                    YOU.  --Come,
       10 |                       I'll take no
       11 |                        denial; We
       12 |                     must have a
       13 |                 trial:  For
       14 |              really this
       15 |           morning I've
       16 |          nothing
       17 |         to do."
       18 |           Said the
       19 |             mouse to the
       20 |               cur, "Such
       21 |                 a trial,
       22 |                   dear Sir,
       23 |                         With
       24 |                     no jury
       25 |                  or judge,
       26 |                would be
       27 |              wasting
       28 |             our
       29 |              breath."
       30 |               "I'll be
       31 |                 judge, I'll
       32 |                   be jury,"
       33 |                         Said
       34 |                    cunning
       35 |                      old Fury:
       36 |                     "I'll
       37 |                      try the
       38 |                         whole
       39 |                          cause,
       40 |                             and
       41 |                        condemn
       42 |                       you
       43 |                      to
       44 |                       death."'

ADDITIONAL BREAKS - LINEBREAKS WITHIN OBJECTS, COLUMN AND PAGE-BREAKS

LINE-BREAKS

        To break a line within a "paragraph object", two backslashes \\ with a space before and a
       space or newline after them may be used.

       To break a line within a "paragraph object",
       two backslashes \\ with a space before
       and a space or newline after them \\
       may be used.

         The  html  break  br  enclosed  in  angle brackets (though undocumented) is available in
       versions prior to 3.0.13 and 2.9.7 (it remains  available  for  the  time  being,  but  is
       depreciated).

        To draw a dividing line dividing paragraphs, see the section on page breaks.

PAGE BREAKS

         Page  breaks are only relevant and honored in some output formats. A page break or a new
       page may be inserted manually using the following markup on a line on its own:

        page new =\= or breaks the page, starts a new page.

        page break -\- or breaks a column, starts a new column, if using columns, else breaks the
       page, starts a new page.

        page break line across page -..- draws a dividing line, dividing paragraphs

        page break:
       -\\-

       or
       <:pb>

        page (break) new:
       =\\=
       or
       <:pn>

        page (break) line across page (dividing paragraphs):
       -..-

BOOK INDEX

         To  make  an index append to paragraph the book index term relates to it, using an equal
       sign and curly braces.

        Currently two levels are provided, a main term and if needed a sub-term.   Sub-terms  are
       separated from the main term by a colon.
         Paragraph containing main term and sub-term.
         ={Main term:sub-term}

         The  index  syntax  starts  on a new line, but there should not be an empty line between
       paragraph and index markup.

        The structure of the resulting index would be:

         Main term, 1
           sub-term, 1

        Several terms may relate to a paragraph, they are separated by a semicolon. If  the  term
       refers to more than one paragraph, indicate the number of paragraphs.

         Paragraph containing main term, second term and sub-term.
         ={first term; second term: sub-term}

        The structure of the resulting index would be:
         First term, 1,
         Second term, 1,
           sub-term, 1

         If multiple sub-terms appear under one paragraph, they are separated under the main term
       heading from each other by a pipe symbol.
         Paragraph containing main term, second term and sub-term.
         ={Main term:
             sub-term+2|second sub-term;
           Another term
          }

         A paragraph that continues discussion of the first sub-term

        The plus one in the example provided indicates the first sub-term  spans  one  additional
       paragraph. The logical structure of the resulting index would be:
         Main term, 1,
           sub-term, 1-3,
           second sub-term, 1,
         Another term, 1

COMPOSITE DOCUMENTS MARKUP

         It  is  possible  to  build a document by creating a master document that requires other
       documents. The documents required may  be  complete  documents  that  could  be  generated
       independently,  or they could be markup snippets, prepared so as to be easily available to
       be placed within another text. If the calling document is a master  document  (built  from
       other  documents),  it should be named with the suffix .ssm Within this document you would
       provide information on the other documents that should be included within the text.  These
       may  be  other documents that would be processed in a regular way, or markup bits prepared
       only  for  inclusion  within  a  master  document  .sst  regular  markup  file,  or   .ssi
       (insert/information)  A  secondary  file  of  the  composite  document  is  built prior to
       processing with the same prefix and the suffix ._sst

        basic markup for importing a document into a master document

       << filename1.sst

       << filename2.ssi

        The form described above should be relied on. Within the Vim editor  it  results  in  the
       text thus linked becoming hyperlinked to the document it is calling in which is convenient
       for editing.

SUBSTITUTIONS


       markup example:
       The current Debian is ${debian_stable} the next debian will be ${debian_testing}

       Configure substitution in _sisu/sisu_document_make

       @make:
       :substitute: /${debian_stable}/,'*{Wheezy}*' /${debian_testing}/,'*{Jessie}*'

       resulting output:

        The current Debian is Wheezy the next debian will be Jessie

        Configure substitution in _sisu/sisu_document_make

SISU FILETYPES

        SiSU has plaintext and binary filetypes, and can process either type of document.

.SST .SSM .SSI MARKED UP PLAIN TEXT

       SiSU   documents are prepared as plain-text (utf-8) files with SiSU markup. They may  make
              reference  to  and  contain images (for example), which are stored in the directory
              beneath them _sisu/image. 〔b¤SiSU plaintext markup files are of three  types  that
              may  be  distinguished  by  the  file  extension  used:  regular  text .sst; master
              documents, composite documents that  incorporate  other  text,  which  can  be  any
              regular  text  or  text  insert; and inserts the contents of which are like regular
              text except these are marked .ssi and are not processed.

               SiSU processing can be done directly  against  a  sisu  documents;  which  may  be
              located locally or on a remote server for which a url is provided.

               SiSU source markup can be shared with the command:

               sisu -s [filename]

SISU TEXT - REGULAR FILES (.SST)

        The most common form of document in SiSU, see the section on SiSU markup.

SISU MASTER FILES (.SSM)

         Composite  documents  which incorporate other SiSU documents which may be either regular
       SiSU text .sst which may be generated independently, or inserts prepared  solely  for  the
       purpose of being incorporated into one or more master documents.

         The  mechanism  by which master files incorporate other documents is described as one of
       the headings under under SiSU markup in the SiSU manual.

        Note: Master documents may be prepared  in  a  similar  way  to  regular  documents,  and
       processing  will occur normally if a .sst file is renamed .ssm without requiring any other
       documents; the .ssm marker flags that the document may contain other documents.

        Note: a secondary file of the composite document is built prior to  processing  with  the
       same prefix and the suffix ._sst [^19]

SISU INSERT FILES (.SSI)

         Inserts  are documents prepared solely for the purpose of being incorporated into one or
       more master documents. They resemble regular SiSU text files except they  are  ignored  by
       the  SiSU  processor.  Making a file a .ssi file is a quick and convenient way of flagging
       that it is not intended that the file should be processed on its own.

SISUPOD, ZIPPED BINARY CONTAINER (SISUPOD.ZIP, .SSP)

        A sisupod is a zipped SiSU text file or set of SiSU text files and any associated  images
       that they contain (this will be extended to include sound and multimedia-files)

       SiSU   plaintext  files  rely on a recognised directory structure to find contents such as
              images associated with documents, but all images  for  example  for  all  documents
              contained  in a directory are located in the sub-directory _sisu/image. Without the
              ability to create a sisupod it can be inconvenient to manually identify  all  other
              files  associated  with  a document. A sisupod automatically bundles all associated
              files with the document that is turned into a pod.

               The structure of the sisupod is such that it may  for  example  contain  a  single
              document  and its associated images; a master document and its associated documents
              and anything else; or the zipped contents of a whole  directory  of  prepared  SiSU
              documents.

               The command to create a sisupod is:

               sisu -S [filename]

               Alternatively, make a pod of the contents of a whole directory:

               sisu -S

                SiSU  processing  can  be  done  directly against a sisupod; which may be located
              locally or on a remote server for which a url is provided.

               <http://www.sisudoc.org/sisu/sisu_commands>

               <http://www.sisudoc.org/sisu/sisu_manual>

CONFIGURATION

CONFIGURATION FILES

CONFIG.YML

        SiSU configration parameters are adjusted in the configuration file, which can be used to
       override the defaults set. This includes such things as which directory interim processing
       should be done in and where the generated output should be placed.

        The SiSU configuration file is a yaml file, which means indentation is significant.

        SiSU resource configuration is determined by looking  at  the  following  files  if  they
       exist:

         ./_sisu/v4/sisurc.yml

         ./_sisu/sisurc.yml

         ~/.sisu/v4/sisurc.yml

         ~/.sisu/sisurc.yml

         /etc/sisu/v4/sisurc.yml

         /etc/sisu/sisurc.yml

        The search is in the order listed, and the first one found is used.

         In  the  absence  of  instructions in any of these it falls back to the internal program
       defaults.

        Configuration determines the output and processing directories and  the  database  access
       details.

        If SiSU is installed a sample sisurc.yml may be found in /etc/sisu/sisurc.yml

SISU_DOCUMENT_MAKE

         Most  sisu document headers relate to metadata, the exception is the @make: header which
       provides processing related information. The default contents of the @make header  may  be
       set by placing them in a file sisu_document_make.

        The search order is as for resource configuration:

         ./_sisu/v4/sisu_document_make

         ./_sisu/sisu_document_make

         ~/.sisu/v4/sisu_document_make

         ~/.sisu/sisu_document_make

         /etc/sisu/v4/sisu_document_make

         /etc/sisu/sisu_document_make

         A  sample  sisu_document_make  can be found in the _sisu/ directory under along with the
       provided sisu markup samples.

CSS - CASCADING STYLE SHEETS (FOR HTML, XHTML AND XML)

        CSS files to modify the appearance of SiSU html, XHTML  or  XML  may  be  placed  in  the
       configuration  directory:  ./_sisu/css  ;  ~/.sisu/css or; /etc/sisu/css and these will be
       copied to the output directories with the command sisu -CC.

        The basic CSS file for html output is html. css, placing a file of that name in directory
       _sisu/css or equivalent will result in the default file of that name being overwritten.

        HTML: html. css

        XML DOM: dom.css

        XML SAX: sax.css

        XHTML: xhtml. css

        The default homepage may use homepage.css or html. css

         Under  consideration  is  to permit the placement of a CSS file with a different name in
       directory _sisu/css directory or equivalent.[^20]

ORGANISING CONTENT - DIRECTORY STRUCTURE AND MAPPING

        SiSU v3 has new options for the source directory tree, and output directory structures of
       which there are 3 alternatives.

DOCUMENT SOURCE DIRECTORY

         The  document  source  directory  is the directory in which sisu processing commands are
       given. It contains the sisu source files (.sst .ssm .ssi), or (for sisu  v3  may  contain)
       subdirectories  with  language  codes  which contain the sisu source files, so all English
       files would go in subdirectory en/, French in fr/, Spanish in es/ and  so  on.  ISO  639-1
       codes  are  used  (as  varied  by  po4a). A list of available languages (and possible sub-
       directory names) can be obtained with the command "sisu --help lang" The list of languages
       is limited to langagues supported by XeTeX polyglosia.

GENERAL DIRECTORIES

       % files stored at this level e.g. sisu_manual.sst or
       % for sisu v3 may be under language sub-directories
       % e.g.

        ./subject_name/en

        ./subject_name/fr

        ./subject_name/es

        ./subject_name/_sisu

        ./subject_name/_sisu/css

        ./subject_name/_sisu/image

DOCUMENT OUTPUT DIRECTORY STRUCTURES

OUTPUT DIRECTORY ROOT

         The  output  directory  root  can  be  set  in  the  sisurc.yml  file.  Under  the root,
       subdirectories are made for each directory in which a document set resides. If you have  a
       directory  named  poems  or  conventions,  that directory will be created under the output
       directory root and the output for all documents contained in the directory of a particular
       name  will  be generated to subdirectories beneath that directory (poem or conventions). A
       document will be placed in a subdirectory of the  same  name  as  the  document  with  the
       filetype identifier stripped (.sst .ssm)

        The last part of a directory path, representing the sub-directory in which a document set
       resides, is the directory name that will be  used  for  the  output  directory.  This  has
       implications  for the organisation of document collections as it could make sense to place
       documents of a particular subject, or type  within  a  directory  identifying  them.  This
       grouping  as  suggested  could  be  by subject (sales_law, english_literature); or just as
       conveniently by some other classification (X University). The mapping  means  it  is  also
       possible  to  place  in  the  same  output directory documents that are for organisational
       purposes kept separately, for example documents  on  a  given  subject  of  two  different
       institutions  may be kept in two different directories of the same name, under a directory
       named after each institution, and these would be output  to  the  same  output  directory.
       Skins  could  be  associated  with  each  institution  on  a directory basis and resulting
       documents will take on the appropriate different appearance.

ALTERNATIVE OUTPUT STRUCTURES

        There are 3 possibile output structures described as being, by language, by  filetype  or
       by filename, the selection is made in sisurc.yml

       #% output_dir_structure_by: language; filetype; or filename
       output_dir_structure_by: language   #(language & filetype, preferred?)
       #output_dir_structure_by: filetype
       #output_dir_structure_by: filename  #(default, closest to original v1 & v2)

BY LANGUAGE

        The by language directory structure places output files

        The by language directory structure separates output files by language code (all files of
       a given language), and within the language directory by filetype.

        Its selection is configured in sisurc.yml

        output_dir_structure_by: language

           |-- en
           |-- epub
           |-- hashes
           |-- html
           | |-- viral_spiral.david_bollier
           | |-- manifest
           | |-- qrcode
           | |-- odt
           | |-- pdf
           | |-- sitemaps
           | |-- txt
           | |-- xhtml
           | `-- xml
           |-- po4a
           | `-- live-manual
           |     |-- po
           |     |-- fr
           |     `-- pot
           `-- _sisu
               |-- css
               |-- image
               |-- image_sys -> ../../_sisu/image_sys
               `-- xml
                   |-- rnc
                   |-- rng
                   `-- xsd

        #by: language subject_dir/en/manifest/filename.html

BY FILETYPE

        The by filetype directory structure separates output files by filetype, all html files in
       one directory pdfs in another and so on. Filenames are given a language extension.

        Its selection is configured in sisurc.yml

        output_dir_structure_by: filetype

           |-- epub
           |-- hashes
           |-- html
           |-- viral_spiral.david_bollier
           |-- manifest
           |-- qrcode
           |-- odt
           |-- pdf
           |-- po4a
           |-- live-manual
           |     |-- po
           |     |-- fr
           |     `-- pot
           |-- _sisu
           | |-- css
           | |-- image
           | |-- image_sys -> ../../_sisu/image_sys
           | `-- xml
           |     |-- rnc
           |     |-- rng
           |     `-- xsd
           |-- sitemaps
           |-- txt
           |-- xhtml
           `-- xml

        #by: filetype subject_dir/html/filename/manifest.en.html

BY FILENAME

         The  by  filename  directory  structure  places  most  output  of a particular file (the
       different filetypes) in a common directory.

        Its selection is configured in sisurc.yml

        output_dir_structure_by: filename

           |-- epub
           |-- po4a
           |-- live-manual
           |     |-- po
           |     |-- fr
           |     `-- pot
           |-- _sisu
           | |-- css
           | |-- image
           | |-- image_sys -> ../../_sisu/image_sys
           | `-- xml
           |     |-- rnc
           |     |-- rng
           |     `-- xsd
           |-- sitemaps
           |-- src
           |-- pod
           `-- viral_spiral.david_bollier

        #by: filename subject_dir/filename/manifest.en.html

REMOTE DIRECTORIES

        ./subject_name/

       % containing sub_directories named after the generated files from which they are made

        ./subject_name/src

       % contains shared source files text and binary e.g. sisu_manual.sst and sisu_manual.sst.zip

        ./subject_name/_sisu

       % configuration file e.g. sisurc.yml

        ./subject_name/_sisu/skin

       % skins in various skin directories doc, dir, site, yml

        ./subject_name/_sisu/css

        ./subject_name/_sisu/image

       % images for documents contained in this directory

        ./subject_name/_sisu/mm

SISUPOD

        ./sisupod/

       % files stored at this level e.g. sisu_manual.sst

        ./sisupod/_sisu

       % configuration file e.g. sisurc.yml

        ./sisupod/_sisu/skin

       % skins in various skin directories doc, dir, site, yml

        ./sisupod/_sisu/css

        ./sisupod/_sisu/image

       % images for documents contained in this directory

        ./sisupod/_sisu/mm

ORGANISING CONTENT

HOMEPAGES

        SiSU is about the ability to auto-generate documents. Home pages are regarded  as  custom
       built  items, and are not created by SiSU.  More accurately, SiSU has a default home page,
       which will not be appropriate for use with other sites, and the means to provide your  own
       home page instead in one of two ways as part of a site's configuration, these being:

         1.  through  placing your home page and other custom built documents in the subdirectory
       _sisu/home/ (this probably being the easier and more convenient option)

        2. through providing what you want as the home page in a skin,

        Document sets are contained in directories, usually organised by site  or  subject.  Each
       directory  can/should  have  its  own homepage. See the section on directory structure and
       organisation of content.

HOME PAGE AND OTHER CUSTOM BUILT PAGES IN A SUB-DIRECTORY

        Custom built pages,  including  the  home  page  index.html  may  be  placed  within  the
       configuration  directory  _sisu/home/  in  any  of  the locations that is searched for the
       configuration directory, namely ./_sisu ; ~/_sisu ; /etc/sisu From there they  are  copied
       to the root of the output directory with the command:

        sisu -CC

MARKUP AND OUTPUT EXAMPLES

MARKUP EXAMPLES

        Current markup examples and document output samples are provided off <http://sisudoc.org>
       or <http://www.jus.uio.no/sisu> and in  the  sisu  -markup-sample  package  available  off
       <http://sources.sisudoc.org>

         For some documents hardly any markup at all is required at all, other than a header, and
       an indication that the levels to be taken into account by the program  in  generating  its
       output are.

SISU MARKUP SAMPLES

         A  few  additional  sample  books  prepared as sisu markup samples, output formats to be
       generated using SiSU are contained  in  a  separate  package  sisu  -markup-samples.  sisu
       -markup-samples  contains  books (prepared using sisu markup), that were released by their
       authors various licenses mostly different Creative Commons licences  that  do  not  permit
       inclusion in the Debian Project as they have requirements that do not meet the Debian Free
       Software Guidelines for various reasons, most commonly that they require that the original
       substantive  text  remain  unchanged,  and  sometimes  that  the  works  be used only non-
       commercially.

        Accelerando, Charles Stross (2005) accelerando.charles_stross.sst

             Alice's      Adventures      in      Wonderland,      Lewis      Carroll      (1865)
       alices_adventures_in_wonderland.lewis_carroll.sst

        CONTENT, Cory Doctorow (2008) content.cory_doctorow.sst

               Democratizing        Innovation,        Eric        von        Hippel       (2005)
       democratizing_innovation.eric_von_hippel.sst

           Down    and    Out    in    the    Magic     Kingdom,     Cory     Doctorow     (2003)
       down_and_out_in_the_magic_kingdom.cory_doctorow.sst

        For the Win, Cory Doctorow (2010) for_the_win.cory_doctorow.sst

         Free  as  in Freedom - Richard Stallman's Crusade for Free Software, Sam Williams (2002)
       free_as_in_freedom.richard_stallman_crusade_for_free_software.sam_williams.sst

        Free as in Freedom 2.0 - Richard Stallman and the Free Software Revolution, Sam  Williams
       (2002),               Richard               M.               Stallman               (2010)
       free_as_in_freedom_2.richard_stallman_and_the_free_software_revolution.sam_williams.richard_stallman.sst

        Free Culture - How Big Media Uses Technology and the Law to Lock Down Culture and Control
       Creativity, Lawrence Lessig (2004) free_culture.lawrence_lessig.sst

        Free For All - How Linux and the Free Software Movement Undercut the  High  Tech  Titans,
       Peter Wayner (2002) free_for_all.peter_wayner.sst

        GNU GENERAL PUBLIC LICENSE v2, Free Software Foundation (1991) gpl2.fsf.sst

        GNU GENERAL PUBLIC LICENSE v3, Free Software Foundation (2007) gpl3.fsf.sst

        Gulliver's Travels, Jonathan Swift (1726 / 1735) gullivers_travels.jonathan_swift.sst

        Little Brother, Cory Doctorow (2008) little_brother.cory_doctorow.sst

             The      Cathederal      and      the      Bazaar,      Eric      Raymond     (2000)
       the_cathedral_and_the_bazaar.eric_s_raymond.sst

         The  Public  Domain  -  Enclosing  the  Commons  of  the  Mind,   James   Boyle   (2008)
       the_public_domain.james_boyle.sst

         The  Wealth  of  Networks - How Social Production Transforms Markets and Freedom, Yochai
       Benkler (2006) the_wealth_of_networks.yochai_benkler.sst

              Through       the       Looking       Glass,       Lewis       Carroll       (1871)
       through_the_looking_glass.lewis_carroll.sst

         Two  Bits  -  The  Cultural  Significance  of  Free  Software,  Christopher Kelty (2008)
       two_bits.christopher_kelty.sst

           UN    Contracts     for     International     Sale     of     Goods,     UN     (1980)
       un_contracts_international_sale_of_goods_convention_1980.sst

        Viral Spiral, David Bollier (2008) viral_spiral.david_bollier.sst

SISU SEARCH - INTRODUCTION

         SiSU  output  can  easily and conveniently be indexed by a number of standalone indexing
       tools, such as Lucene, Hyperestraier.

        Because the document structure of sites created is clearly defined, and the  text  object
       citation  system  is  available  hypothetically  at  least, for all forms of output, it is
       possible to search the sql database, and either read results from that database,  or  just
       as simply map the results to the html output, which has richer text markup.

         In addition to this SiSU has the ability to populate a relational sql type database with
       documents at an object level, with objects numbers that are shared across different output
       types,  which  make them searchable with that degree of granularity. Basically, your match
       criteria is met by these documents and at these locations within each document, which  can
       be viewed within the database directly or in various output formats.

SQL

POPULATING SQL TYPE DATABASES

         SiSU feeds sisu markupd documents into sql type databases PostgreSQL [^21] and/or SQLite
       [^22] database together with information related to document structure.

        This is one of the more interesting output forms, as  all  the  structural  data  of  the
       documents  are  retained (though can be ignored by the user of the database should they so
       choose). All site texts/documents are (currently) streamed to four tables:

        * one containing semantic (and other) headers, including, title, author,
         subject, (the
         .I Dublin Core.
         ..);

        * another the substantive texts by individual "paragraph" (or object) - along
         with structural information, each paragraph being identifiable by its
         paragraph number (if it has one which almost all of them do), and the
         substantive text of each paragraph quite naturally being searchable (both in
         formatted and clean text versions for searching); and

        * a third containing endnotes cross-referenced back to the paragraph from
         which they are referenced (both in formatted and clean text versions for
         searching).

        * a fourth table with a one to one relation with the headers table contains
         full text versions of output, eg. pdf, html, xml, and
         .I ascii.

        There is of course the possibility to add further structures.

        At this level SiSU loads a relational database with documents chunked into objects, their
       smallest  logical  structurally  constituent  parts,  as  text  objects, with their object
       citation number and all other structural information needed  to  construct  the  document.
       Text is stored (at this text object level) with and without elementary markup tagging, the
       stripped version being so as to facilitate ease of searching.

        Being able to search a relational database at an object  level  with  the  SiSU  citation
       system  is  an  effective  way  of locating content generated by SiSU.  As individual text
       objects of a document stored (and indexed) together with object numbers, and all  versions
       of  the  document have the same numbering, complex searches can be tailored to return just
       the locations of the search results relevant for all available output formats,  with  live
       links  to  the  precise  locations  in  the  database  or  in  html/xml documents; or, the
       structural information provided makes it possible to  search  the  full  contents  of  the
       database  and  have  headings  in which search content appears, or to search only headings
       etc. (as the Dublin Core is incorporated it is easy to make use of that as well).

POSTGRESQL

NAME

        SiSU  -  Structured  information,  Serialized  Units  -  a  document  publishing  system,
       postgresql dependency package

DESCRIPTION

         Information  related  to  using postgresql with sisu (and related to the sisu_postgresql
       dependency package, which is a dummy package to install dependencies needed  for  SiSU  to
       populate a postgresql database, this being part of SiSU - man sisu) .

SYNOPSIS

        sisu -D [instruction] [filename/wildcard if required]

        sisu -D --pg --[instruction] [filename/wildcard if required]

COMMANDS

         Mappings  to  two  databases  are  provided  by default, postgresql and sqlite, the same
       commands are used within sisu to construct and populate databases however  -d  (lowercase)
       denotes  sqlite  and  -D (uppercase) denotes postgresql, alternatively --sqlite or --pgsql
       may be used

        -D or --pgsql may be used interchangeably.

CREATE AND DESTROY DATABASE

       --pgsql --createall
              initial step, creates required relations (tables, indexes) in existing (postgresql)
              database  (a database should be created manually and given the same name as working
              directory, as requested) (rb.dbi)

       sisu -D --createdb
              creates database where no database existed before

       sisu -D --create
              creates database tables where no database tables existed before

       sisu -D --Dropall
              destroys database (including all its content)! kills data and drops tables, indexes
              and database associated with a given directory (and directories of the same name).

       sisu -D --recreate
              destroys existing database and builds a new empty database structure

IMPORT AND REMOVE DOCUMENTS

       sisu -D --import -v [filename/wildcard]
              populates database with the contents of the file. Imports documents(s) specified to
              a postgresql database (at an object level).

       sisu -D --update -v [filename/wildcard]
              updates file contents in database

       sisu -D --remove -v [filename/wildcard]
              removes specified document from postgresql database.

SQLITE

NAME

        SiSU - Structured information, Serialized Units - a document publishing system.

DESCRIPTION

        Information related to using sqlite with sisu (and related to the sisu_sqlite  dependency
       package,  which  is a dummy package to install dependencies needed for SiSU to populate an
       sqlite database, this being part of SiSU - man sisu) .

SYNOPSIS

        sisu -d [instruction] [filename/wildcard if required]

        sisu -d --(sqlite|pg) --[instruction] [filename/wildcard if required]

COMMANDS

        Mappings to two databases are provided  by  default,  postgresql  and  sqlite,  the  same
       commands  are  used within sisu to construct and populate databases however -d (lowercase)
       denotes sqlite and -D (uppercase) denotes postgresql, alternatively  --sqlite  or  --pgsql
       may be used

        -d or --sqlite may be used interchangeably.

CREATE AND DESTROY DATABASE

       --sqlite --createall
              initial  step,  creates  required  relations (tables, indexes) in existing (sqlite)
              database (a database should be created manually and given the same name as  working
              directory, as requested) (rb.dbi)

       sisu -d --createdb
              creates database where no database existed before

       sisu -d --create
              creates database tables where no database tables existed before

       sisu -d --dropall
              destroys database (including all its content)! kills data and drops tables, indexes
              and database associated with a given directory (and directories of the same name).

       sisu -d --recreate
              destroys existing database and builds a new empty database structure

IMPORT AND REMOVE DOCUMENTS

       sisu -d --import -v [filename/wildcard]
              populates database with the contents of the file. Imports documents(s) specified to
              an sqlite database (at an object level).

       sisu -d --update -v [filename/wildcard]
              updates file contents in database

       sisu -d --remove -v [filename/wildcard]
              removes specified document from sqlite database.

INTRODUCTION

SEARCH - DATABASE FRONTEND SAMPLE, UTILISING DATABASE AND SISU FEATURES,

       INCLUDING OBJECT CITATION NUMBERING (BACKEND CURRENTLY POSTGRESQL)

         Sample  search  frontend  <http://search.sisudoc.org>  [^23] A small database and sample
       query front-end (search from) that makes use  of  the  citation  system,  object  citation
       numbering to demonstrates functionality.[^24]

         SiSU can provide information on which documents are matched and at what locations within
       each document the matches are found. These results are relevant across all  outputs  using
       object  citation  numbering, which includes html, XML, EPUB, LaTeX, PDF and indeed the SQL
       database. You can then refer to one of the other outputs or in the SQL database expand the
       text within the matched objects (paragraphs) in the documents matched.

         Note you may set results either for documents matched and object number locations within
       each matched document meeting the search criteria; or display the names of  the  documents
       matched along with the objects (paragraphs) that meet the search criteria.[^25]

       sisu -F --webserv-webrick
              builds a cgi web search frontend for the database created

               The following is feedback on the setup on a machine provided by the help command:

               sisu --help sql
              Postgresql
                user:             ralph
                current db set:   SiSU_sisu
                port:             5432
                dbi connect:      DBI:Pg:database=SiSU_sisu;port=5432
              sqlite
                current db set:   /home/ralph/sisu_www/sisu/sisu_sqlite.db
                dbi connect       DBI:SQLite:/home/ralph/sisu_www/sisu/sisu_sqlite.db

               Note on databases built

               By default, [unless otherwise specified] databases are built on a directory basis,
              from collections of documents within that directory. The name of the directory  you
              choose  to  work  from  is  used as the database name, i.e. if you are working in a
              directory called /home/ralph/ebook the database SiSU_ebook is  used.  [otherwise  a
              manual mapping for the collection is necessary]

SEARCH FORM

       sisu -F
              generates  a  sample  search  form,  which  must  be  copied  to the web-server cgi
              directory

       sisu -F --webserv-webrick
              generates a sample search form for use with  the  webrick  server,  which  must  be
              copied to the web-server cgi directory

       sisu -W
              starts  the  webrick  server  which  should  be available wherever sisu is properly
              installed

               The generated search form must be copied manually to the  webserver  directory  as
              instructed

SISU_WEBRICK

NAME

        SiSU - Structured information, Serialized Units - a document publishing system

SYNOPSIS

        sisu_webrick [port]

        or

        sisu -W [port]

DESCRIPTION

         sisu_webrick is part of SiSU (man sisu) sisu_webrick starts Ruby SiSU output is written,
       providing a list of these directories (assuming SiSU is in use and they exist).

        The default port for sisu_webrick is set to 8081, this may be modified in the yaml  file:
       ~/.sisu/sisurc.yml  a  sample  of  which  is  provided  as /etc/sisu/sisurc.yml (or in the
       equivalent directory on your system).

SUMMARY OF MAN PAGE

        sisu_webrick, may be started on it's own with the command: sisu_webrick [port]  or  using
       the sisu command with the -W flag: sisu -W [port]

        where no port is given and settings are unchanged the default port is 8081

DOCUMENT PROCESSING COMMAND FLAGS

         sisu  -W  [port] starts Ruby Webrick web-server, serving SiSU output directories, on the
       port provided, or if no port is provided  and  the  defaults  have  not  been  changed  in
       ~/.sisu/sisurc.yaml then on port 8081

SUMMARY OF FEATURES

         *  sparse/minimal  markup (clean utf-8 source texts). Documents are prepared in a single
       UTF-8 file using a minimalistic mnemonic syntax. Typical literature, documents  like  "War
       and Peace" require almost no markup, and most of the headers are optional.

         * markup is easily readable/parsable by the human eye, (basic markup is simpler and more
       sparse than the most basic HTML ) , [this may also be converted to XML representations  of
       the same input/source document].

         *  markup  defines  document  structure (this may be done once in a header pattern-match
       description, or for heading levels individually); basic text  attributes  (bold,  italics,
       underscore,  strike-through  etc.)  as  required;  and semantic information related to the
       document (header information, extended beyond the Dublin core and easily further  extended
       as  required);  the  headers  may  also  contain  processing instructions.  SiSU markup is
       primarily an abstraction of document structure and  document  metadata  to  permit  taking
       advantage  of  the  basic  strengths  of  existing  alternative practical standard ways of
       representing documents [be that browser  viewing,  paper  publication,  sql  search  etc.]
       (html, epub, xml, odf, latex, pdf, sql)

          *   for   output  produces  reasonably  elegant  output  of  established  industry  and
       institutionally accepted open  standard  formats.[3]  takes  advantage  of  the  different
       strengths  of  various  standard  formats  for  representing documents, amongst the output
       formats currently supported are:

        * HTML - both as a single scrollable text and a segmented document

        * XHTML

        * EPUB

        * XML - both in sax and dom style xml structures for further development as required

        * ODT - Open Document Format text, the iso standard for document storage

        * LaTeX - used to generate pdf

        * PDF (via LaTeX )

        * SQL - population of an sql database ( PostgreSQL or SQLite )  ,  (at  the  same  object
       level that is used to cite text within a document)

        Also produces: concordance files; document content certificates (md5 or sha256 digests of
       headings, paragraphs, images etc.) and html manifests (and sitemaps of content). (b) takes
       advantage  of  the  strengths  implicit  in  these very different output types, (e.g. PDFs
       produced using typesetting of LaTeX, databases populated with documents at  an  individual
       object/paragraph level, making possible granular search (and related possibilities))

        * ensuring content can be cited in a meaningful way regardless of selected output format.
       Online publishing (and publishing in multiple document formats)  lacks  a  useful  way  of
       citing  text internally within documents (important to academics generally and to lawyers)
       as page numbers are meaningless across browsers and  formats.  sisu  seeks  to  provide  a
       common way of pinpoint the text within a document, (which can be utilized for citation and
       by search engines).  The outputs share a common numbering system that  is  meaningful  (to
       man  and  machine) across all digital outputs whether paper, screen, or database oriented,
       (pdf, HTML, EPUB, xml, sqlite,  postgresql)  ,  this  numbering  system  can  be  used  to
       reference content.

         *  Granular  search  within  documents.   SQL databases are populated at an object level
       (roughly headings, paragraphs, verse, tables) and become searchable with  that  degree  of
       granularity,  the  output  information  provides  the  object/paragraph  numbers which are
       relevant across all generated outputs; it is also possible to look at  just  the  matching
       paragraphs  of  the documents in the database; [output indexing also work well with search
       indexing tools like hyperestraier].

        * long term maintainability of document collections  in  a  world  of  changing  formats,
       having  a  very sparsely marked-up source document base. there is a considerable degree of
       future-proofing, output representations are "upgradeable", and new document formats may be
       added.  e.g.  addition  of  odf  (open  document text) module in 2006, epub in 2009 and in
       future html5 output sometime in future, without modification of existing prepared texts

        * SQL search aside, documents are generated as required and static once generated.

        * documents produced are static files, and may be batch processed, this needs to be  done
       only once but may be repeated for various reasons as desired (updated content, addition of
       new output formats, updated technology document presentations/representations)

        * document source ( plaintext utf-8) if shared on the  net  may  be  used  as  input  and
       processed locally to produce the different document outputs

         *  document  source  may  be  bundled together (automatically) with associated documents
       (multiple language versions or master document with inclusions) and images and sent  as  a
       zip  file  called  a  sisupod,  if shared on the net these too may be processed locally to
       produce the desired document outputs

        * generated document outputs may automatically be posted to remote sites.

        * for basic document generation, the only software dependency is Ruby, and a few standard
       Unix  tools  (this covers plaintext, HTML, EPUB, XML, ODF, LaTeX ) . To use a database you
       of course need that, and to convert the LaTeX generated to pdf,  a  latex  processor  like
       tetex or texlive.

        * as a developers tool it is flexible and extensible

        Syntax highlighting for SiSU markup is available for a number of text editors.

        SiSU is less about document layout than about finding a way with little markup to be able
       to construct an abstract representation of a document that makes it  possible  to  produce
       multiple  representations of it which may be rather different from each other and used for
       different purposes, whether layout and publishing, or search of content

        i.e. to be able to take advantage from this minimal preparation starting point of some of
       the strengths of rather different established ways of representing documents for different
       purposes, whether for search (relational database, or indexed  flat  files  generated  for
       that  purpose  whether  of complete documents, or say of files made up of objects), online
       viewing (e.g. html, xml, pdf) , or paper publication (e.g. pdf) ...

        the solution arrived at is by extracting structural information about the document (about
       headings within the document) and by tracking objects (which are serialized and also given
       hash values) in the manner described. It makes possible  representations  that  are  quite
       different  from  those offered at present. For example objects could be saved individually
       and identified by their hashes, with an index of how the objects relate to each  other  to
       form a document.

       1.     objects   include:   headings,   paragraphs,   verse,   tables,   images,  but  not
              footnotes/endnotes which are numbered separately and tied to the object from  which
              they are referenced.

       2.     i.e. the HTML, PDF, EPUB, ODT outputs are each built individually and optimised for
              that form of presentation, rather than for example the html being a  saved  version
              of the odf, or the pdf being a saved version of the html.

       3.     the different heading levels

       4.     units  of  text,  primarily  paragraphs and headings, also any tables, poems, code-
              blocks

       5.     An open standard format for e-books

       6.     Open Document Format ( ODF ) text

       7.     Specification submitted by Adobe to ISO to become a  full  open  ISO  specification
              <http://www.linux-watch.com/news/NS7542722606.html>

       8.     ISO standard ISO/IEC 26300:2006

       *1.    square brackets

       *2.    square brackets

       +1.    square brackets

       9.     <http://www.jus.uio.no/sisu/man/>

       10.    <http://www.jus.uio.no/sisu/man/sisu.1.html>

       11.    From  sometime  after SiSU 0.58 it should be possible to describe SiSU markup using
              SiSU, which though not an original design goal is useful.

       12.    files should be prepared using UTF-8 character encoding

       13.    a footnote or endnote

       14.    self contained endnote marker & endnote in one

       *.     unnumbered asterisk footnote/endnote, insert multiple asterisks if required

       **.    another unnumbered asterisk footnote/endnote

       *3.    editors notes, numbered asterisk footnote/endnote series

       +2.    editors notes, numbered plus symbol footnote/endnote series

       15.    <http://www.sisudoc.org/>

       16.    <http://www.ruby-lang.org/en/>

       18.    Table     from     the     Wealth     of     Networks     by     Yochai     Benkler
              <http://www.jus.uio.no/sisu/the_wealth_of_networks.yochai_benkler>

       19.    is  not  a regular file to be worked on, and thus less likely that people will have
              "accidents", working on a .ssc file that is overwritten by  subsequent  processing.
              It  may  be  however  that when the resulting file is shared .ssc is an appropriate
              suffix to use.

       20.    SiSU has worked this way in the past, though this was dropped as it was thought the
              complexity  outweighed  the  flexibility,  however, the balance was rather fine and
              this behaviour could be reinstated.

       21.    <http://www.postgresql.org/>                      <http://advocacy.postgresql.org/>
              <http://en.wikipedia.org/wiki/Postgresql>

       22.    <http://www.hwaci.com/sw/sqlite/> <http://en.wikipedia.org/wiki/Sqlite>

       23.    <http://search.sisudoc.org>

       24.    (which  could be extended further with current back-end). As regards scaling of the
              database, it is as scalable as the database (here Postgresql) and hardware allow.

       25.    of this feature when demonstrated to an IBM software innovations evaluator in  2004
              he  said  to  paraphrase:  this  could be of interest to us. We have large document
              management systems, you can search hundreds of thousands of documents  and  we  can
              tell you which documents meet your search criteria, but there is no way we can tell
              you without opening each document where within each your matches are found.

       SEE ALSO
                     sisu(1),
                     sisu-epub(1),
                     sisu-harvest(1),
                     sisu-html(1),
                     sisu-odf(1),
                     sisu-pdf(1),
                     sisu-pg(1),
                     sisu-sqlite(1),
                     sisu-txt(1).
                     sisu_vim(7)

       HOMEPAGE
                     More information about SiSU can be  found  at  <http://www.sisudoc.org/>  or
              <http://www.jus.uio.no/sisu/>

       SOURCE
                     <http://sources.sisudoc.org/>

       AUTHOR
                     SiSU is written by Ralph Amissah <ralph@amissah.com>