Provided by: manpages-zh_1.5.2-1_all bug

NAME

       Unicode - 16

 (DESCRIPTION)
         ISO  10646    (Universal  Character  Set,  UCS).   UCS ,  (round-trip
       compatibility), UCS ,

       UCS ,: Greek, Cyrillic, Hebrew,Arabic, Armenian,  Gregorian,  Japanese,
       Chinese,   Hiragana,  Katakana,  Korean,  Hangul,  Devangari,  Bengali,
       Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, alayam,  Thai,  Lao,
       Bopomofo,.,  Tibetian,  Khmer,  Runic, Ethiopian, Hieroglyphics,  Indo-
       European , ,  .1993  ,  .  ,  ,  TeX,  PostScript,  MS-DOS,  Macintosh,
       Videotext, OCR, , , , , .

       UCS   (ISO 10646)  31 , ,  65534  (0x0000-0xfffd,   (Basic Multilingual
       Plane,BMP)), , ( Hieroglyphics) , ,  16  BMP .

        0x0000  0x007f UCS US-ASCII ,  0x0000  0x00ff ISO 8859-1 Latin-1

 (COMBINING CHARACTERS)
        UCS (combining characters).  .  .  UCS , , .  . , Umlaut-A  (  A)  UCS
       0x00c4, " A""": 0x0041 0x0308

 (IMPLEMENTATION LEVELS)
       , ISO 10646 UCS :

        1 (Level 1)
                 Hangul Jamo ( , Hangul ).

        2 (Level 2)
                1,   .    (  Hebrew,  Arabic,  Devangari,  Bengali,  Gurmukhi,
                Gujarati,  Oriya,  Tamil,  Telugo,  Kannada,  Malayalam,  Thai
                Lao).

        3 (Level 3)
                 UCS .

       Unicode   Unicode  1.1  ISO 10646 ,  3 UCS ( Basic Multilingual Plane).
       Unicode 1.1  ISO 10646 .

LINUX UNICODE (UNICODE UNDER LINUX)

        Linux , ,  1 BMP.  , .  linux  C wchar_t  32 UCS4

        UTF-8 ISO 8859-1 wctomb, mbtowc, wprintf wchar_t .

 (PRIVATE AREA)
        BMP , 0xe000  0xf8ff .  Linux ,   0xe000   0xefff  ,   0xf000   0xf8ff
       linux    linux   .H.   Peter  Anvin(<Peter.Anvin@linux.org>,  Yggdrasil
       Computing,Inc)  linux .   Unicode  DEC VT100 , ,  Klingon .

 (LITERATURE)
       * Information technology - Universal Multiple-Octet Coded Character Set
         (UCS)   -   Part   1:  Architecture  and  Basic  Multilingual  Plane.
         International Standard ISO 10646-1,  International  Organization  for
         Standardization, Geneva, 1993.

          UCS , , , .  ,  www.iso.ch.

       * The Unicode Standard - Worldwide Character Encoding Version 1.0.  The
         Unicode Consortium, Addison-Wesley, Reading, MA, 1991.

         Unicode  1.1.4 , 1.0  ftp.unicode.org .  Unicode 2.0  1996 .

       * S. Harbison, G. Steele. C  -  A  Reference  Manual.  Fourth  edition,
         Prentice Hall, Englewood Cliffs, 1995, ISBN 0-13-326224-3.

          C .  1994 ISO C  (ISO/IEC 9899:1990),  C .

 (BUGS)
       ,linux UCS  C .

 (AUTHOR)
       Markus Kuhn <mskuhn@cip.informatik.uni-erlangen.de>

(SEE ALSO)

       utf-8(7) http://www.linuxforum.net/books/UTF-8-Unicode.html

[]

       mapping <mapping@263.net>

[]

       2000/11/06

linuxman:

       http://cmpp.linuxforum.net