lunar (1) xhtml2ps.1.gz

Provided by: html2ps_1.0b7-6_all bug

NAME

       html2ps - convert HTML to PostScript

SYNOPSIS

       html2ps  [  -2cdDFghHLnORtTuUv  ]  [  -b  URL  ]  [  -C  string  ]  [  -e  encoding ] [ -f
       file[:file[:...]]  ] [ -i num ] [ -k file ] [ -l lang ] [ -m num ] [ -M num ] [ -N num ] [
       -o file ] [ -r path ] [ -s num ] [ -S string ] [ -W string ] [ -x num ] [ URL|file ]

DESCRIPTION

       The  program  html2ps converts HTML to PostScript. The HTML code can be retrieved from one
       or more URL:s or local files,  specified  as  parameters  on  the  command  line.   If  no
       parameter is given, html2ps reads from standard input.

       Note:  To  avoid  unnecessary  network  traffic,  one  can  rebuild  an  already generated
       PostScript file with new options. This is done by running html2ps with  the  new  options,
       and with the old PostScript file as input (not applicable for all options).

OPTIONS

       All options have a short (case sensitive), and a long (case insensitive) form.

       -2 --twoup
              Two column (2-up) output. The default is one column per page.

       -b URL --base URL
              Use  URL as a base to expand relative references for in-line images. This is useful
              if you have downloaded a document to a local file.  The URL should then be the  URL
              of the original document.

       -c --check
              Check  the  syntax of the HTML file (using an external syntax checker). The default
              is to not make a syntax check.

       -C string --toc string
              Generate a table of contents (ToC). The value should be a string consisting of  one
              of the letters 'f', 'h', or 't', optionally combined with the letter 'b':

              b      The ToC will be printed first. This requires that Ghostscript is installed.
              f      The ToC will be generated from the links in the converted document.
              h      The  ToC  will  be  generated  from  headings  and  titles  in the converted
                     documents. Note that if the document author  for  some  strange  reason  has
                     chosen  to  use  some  other  means  to represent the headings than the HTML
                     elements H1,...,H6, you are out of luck!
              t      The ToC will be generated from links having the  attribute  rev=TOC  in  the
                     converted document.

       -d --debug
              Generate  debugging  information.  You should always use this option when reporting
              problems with html2ps.

       -D --dsc --DSC
              Generate DSC compliant PostScript. This requires Ghostscript  and  can  take  quite
              some  time  to do. Note that a PostScript file generated with this option cannot be
              used as input to html2ps for reformatting later.

       -e encoding --encoding encoding
              The document encoding. Currently recognized values are ISO-8859-1,  EUC-JP,  SHIFT-
              JIS,  and  ISO-2022-JP  (other  EUC-xx  encodings  may  also  work). The default is
              ISO-8859-1.

       -f file[:file[:...]] --rcfile file[:file[:...]]
              A colon separated list of configuration file names to use instead  of  the  default
              personal  configuration  file  $HOME/.html2psrc.   Definitions  made  in  one  file
              override definitions in previous files (the last  file  in  the  list  has  highest
              precedence).  An  empty  file name (as in ':file', 'file1::file3', or 'file:') will
              expand to the default personal file. The environment variable HTML2PSPATH  is  used
              to  specify  the  directories  where to search for these files. (Note: this is only
              supposed to be used on the command line, not in a configuration file.)

       -F --frame
              Draw a frame around the text on each page. The default is to not draw a frame.

       -g --grayscale
              Convert colour images to grayscale images. Note that the PostScript  file  will  be
              smaller  when  the  images  are  converted to grayscale. The default is to generate
              colour images.

       -h --help
              Show usage information.

       -H --hyphenate
              Hyphenate the text. This requires TeX hyphenation pattern files.

       -i num --scaleimage num
              Scale in-line images with a factor num The default is 1.

       -k file --cookie file
              Enable cookie support, using a netscape formatted  cookie  file  (requires  libwww-
              perl).

       -l lang --language lang
              Specifies the language of the document (overrides an eventual LANG attribute of the
              BODY   element).    The   language   should   be   given   according   to   RFC1766
              (ftp://ftp.nordu.net/rfc/rfc1766.txt)             and            ISO            639
              (http://www.w3.org/WAI/ER/IG/ert/iso639.htm).

       -L --landscape
              Generate code for printing in landscape mode. The default is portrait mode.

       -m num --scalemath num
              Scale mathematical formulas with a factor num The default is 1.

       -M num --mainchapter num
              Specifies the start number for automatic numbering of headings (by setting the seq-
              number parameter), the default is 1.

       -n --number
              Insert page numbers. The default is to not number the pages.

       -N num --startno num
              Specifies the starting page number, the default is 1.

       -o file --output file
              Write the PostScript code to file.  The default is to write to standard output.

       -O --original
              Use  PostScript  original images if they exist. For example, if a document contains
              an image figure.gif, and an encapsulated PostScript file named figure.ps exists  in
              the  same  directory,  that  file will be use instead. This only work for documents
              read as local files. Note: if the PostScript  file  is  large  or  contains  bitmap
              images,  this must be combined with the -D option. In HTML 4.0 this can be achieved
              in a much better way with:

              <OBJECT data="figure.ps" type="application/postscript">
              <OBJECT data="figure.gif" type="image/gif">
              <PRE>[Maybe some ASCII art for text browsers]</PRE>
              </OBJECT>
              </OBJECT>

       -r path --rootdir path
              When a document is read from a local file, this value specifies  a  base  directory
              for  resolving  relative  links  starting  with "/".  Typically, this should be the
              directory where your web server's home page resides.

       -R --xref
              Insert cross references at every link to within the set of converted documents.

       -s num --scaledoc num
              Scale the entire document with a factor num The default is 1.

       -S string --style string
              This option complements/overrides definitions made in the configuration files.  The
              string  must  follow the configuration file syntax. (Note: this is only supposed to
              be used on the command line, not in a configuration file.)

       -t --titlepage
              Generate a title page. The default is to not generate one.

       -T --text
              Text mode, ignore images. The default is to include the images.

       -u --underline
              Underline text that constitutes a hypertext link. The default is to not underline.

       -U --colour
              Produce colour output for text and background,  when  specified.   The  default  is
              black text on white background (mnemonic: coloUr ;-).

       -v --version
              Print information about the current version of html2ps.

       -W string --web string
              Process  a  web of documents by recursively retrieve and convert documents that are
              referenced with hyperlinks. When dealing with remote documents it will of course be
              necessary  to impose restrictions, to avoid downloading the entire web... The value
              should be a string consisting of one of the letters 'a', 'b',  'l',  'r',  or  's',
              optionally  combined  with  a  combination  of the letters 'p', 'L', and a positive
              integer:

              a      Follow all links.
              b      Follow only links to within the same  directory,  or  below,  as  the  start
                     document.
              l      Follow only links specified with "<LINK rel=NEXT>" in the document.
              p      Prompt  for  each  remote  document. This mode will automatically be entered
                     after the first 50 documents.
              r      Follow only relative links.
              s      Follow only links to within the same server as the start document.
              L      With this option, the order in which the documents are  processed  will  be:
                     first  all top level documents, then the documents linked to from these etc.
                     For example, if the document A has links to B and C, and B has a link to  D,
                     the  order  will  be A-B-C-D.  By default, each document will be followed by
                     the first document it links to etc; so the default order for the example  is
                     A-B-D-C.
              #      A  positive  integer giving the number of recursive levels. The default is 4
                     (when the option is present).

       -x num --duplex num
              Generate postscript code for single or double sided printing.   No  default,  valid
              values are:

              0      Single sided.
              1      Double sided.
              2      Double sided, opposite page reversed (tumble mode).

BUGS

       (This is incomplete.)

       The  CELLSPACING  attribute  of  the  TABLE element is not implemented as described in the
       specification; instead the value of the CELLPADDING attribute is  increased  by  half  the
       value of CELLSPACING.

       Rendering   HTML   tables  well  is  a  non-trivial  task.  For  "real"  tables,  that  is
       representation of tabular data, html2ps usually generates  reasonably  good  output.  When
       tables  are  used  for  layout  purposes,  the result varies from good to useless. This is
       because a table cell is never broken across pages. So if a table contains a  cell  with  a
       lot  of content, the entire table may have to be scaled down in size in order to make this
       cell fit on a single page. Sometimes this may even result in unreadable output.

       Page breaks are occasionally done in bad places:  for  example  directly  after  a  (long)
       heading, and before the last line in a paragraph.

ENVIRONMENT

       HTML2PSPATH
              This  variable  specifies  the  directories  to  search for configuration files. It
              should be a colon separated list of directory names. Use a dot '.'  to  denote  the
              current  directory. An empty directory name (as in ':dir', 'dir1::dir3', or 'dir:')
              will expand to the directory where the global configuration file  is.  The  default
              value  is  '.:',  that  is: search the current directory first, and then the global
              one.

FILES

       $HOME/.html2psrc
              User configuration file, see html2psrc(5).

SEE ALSO

       html2psrc(5), perl(1), setlocale(3), strftime(3), weblint(1)

VERSION

       This manpage describes html2ps version 1.0 beta7.

AVAILABILITY

       http://user.it.uu.se/~jan/html2ps.html

AUTHOR

       Jan Karrman (jan@it.uu.se)