Provided by: stx2any_1.56-2.1_all bug

NAME

       html2stx - convert HTML documents into Stx

SYNOPSIS

       html2stx [ file ]

DESCRIPTION

       html2stx  takes the given file, which should contain an HTML document, and converts it to structured text
       (Stx).  If no file is given, standard input is read instead.

       The program does not attempt to convert every  possibly  convertible  piece  of  markup  into  Stx.   For
       example, <font> tags are simply ignored.  This tends to result in a nice, clean, beautiful document.  (If
       it doesn't, the source document probably does not contain enough information to start with.)

OPTIONS

       None.

DIAGNOSTICS

       html2stx is a python script and will throw an exception if something  goes  amiss.   In  this  case,  the
       return value will be non-zero.

SEE ALSO

       stx2any (1), Stx-ref.html

BUGS

           •   The word wrapping algorithm is probably not very clever.

           •   Sometimes there are extra linebreaks in the output.

           •   Probably many others.

AUTHOR

       This manual page was written by Panu A. Kalliokoski.

       html2stx  is  derived  from the html2text utility by Aaron Swartz.  html2text is a utility for converting
       html into “Markdown” structured text; the changes required to make it work for  Stx  were  done  by  Panu
       Kalliokoski.