Provided by: liblatex-tounicode-perl_0.54-2_all bug

NAME

       LaTeX::ToUnicode::Tables - Character tables for LaTeX::ToUnicode

VERSION

       version 0.54

CONSTANTS

   @LIGATURES
       Standard TeX character sequences (not \commands) which need to be replaced: "---" with U+2014 (em dash),
       etc.  Includes: em dash, en dash, inverted exclamation, inverted question, left double quote, right
       double quote, left single quote, right single quote. They are replaced in that order.

   %MARKUPS
       Hash where keys are the names of formatting commands like "\tt", without the backslash, namely: "bf cal
       em it rm sc sf sl small tt". Values are the obvious HTML equivalent where one exists, given as the tag
       name without the angle brackets: "b em i tt". Otherwise the value is the empty string.

   %ARGUMENT_COMMANDS
       Hash where keys are the names of TeX commands taking arguments that we handle, without the backslash,
       such as "enquote". Each value is a reference to a list of two strings, the first being the text to insert
       before the argument, the second being the text to insert after. For example, for "enquote" the value is
       "["`", "'"]". The inserted text is subject to further replacements.

       Only three such commands are currently handled: "\emph", "\enquote", and "\path".

   %CONTROL_SYMBOLS
       A hash where the keys are non-alphabetic "\command"s (without the backslash), other than accents and
       special cases. These don't take arguments. Although some of these have Unicode equivalents, such as the
       "\," thin space, it seems better to keep the output as simple as possible; small spacing tweaks in TeX
       aren't usually desirable in plain text or HTML.

       The values are single-quoted strings '\x{...}', not double-quoted literal characters <"\x{...}">, to ease
       future parsing of the TeX/text/HTML.

       This hash is necessary because TeX's parsing rules for control symbols are different from control words:
       no space or other token is needed to terminate control symbols.

   %CONTROL_WORDS
       Keys are names of argument-less commands, such as "\LaTeX" (without the backslash). Values are the
       replacements, often the empty string.

   %SYMBOLS
       Keys are the commands for extended characters, such as "\AA" (without the backslash.)

   %ACCENT_SYMBOLS
       Two-level hash of accented characters like "\'{a}". The keys of this hash are the accent symbols (without
       the backslash), such as "`" and "'". The corresponding values are hash references where the keys are the
       base letters and the values are single-quoted '\x{....}' strings.

   %ACCENT_LETTERS
       Same as %ACCENT_SYMBOLS, except the keys are accents that are alphabetic, such as "\c" (without the
       backslash as always).

       As with control sequences, it's necessary to distinguish symbols and alphabetic commands because of the
       different parsing rules.

   %GERMAN
       Character sequences (not necessarily commands) as defined by the package `german'/`ngerman', e.g. ""a" (a
       with umlaut), ""s" (german sharp s) or "`" (german left quote). Note the missing backslash.

       The keys of this hash are the literal character sequences.

AUTHOR

       Gerhard Gossen <gerhard.gossen@googlemail.com>, Boris Veytsman <boris@varphi.com>, Karl Berry
       <karl@freefriends.org>

       <https://github.com/borisveytsman/bibtexperllibs>

COPYRIGHT AND LICENSE

       Copyright 2010-2023 Gerhard Gossen, Boris Veytsman, Karl Berry

       This is free software; you can redistribute it and/or modify it under the same terms as the Perl5
       programming language system itself.