Provided by:
manpages-zh_1.5.2-1_all 
NAME
Unicode - 16
(DESCRIPTION)
ISO 10646 (Universal Character Set, UCS). UCS , (round-trip
compatibility), UCS ,
UCS ,: Greek, Cyrillic, Hebrew,Arabic, Armenian, Gregorian, Japanese,
Chinese, Hiragana, Katakana, Korean, Hangul, Devangari, Bengali,
Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, alayam, Thai, Lao,
Bopomofo,., Tibetian, Khmer, Runic, Ethiopian, Hieroglyphics, Indo-
European , , .1993 , . , , TeX, PostScript, MS-DOS, Macintosh,
Videotext, OCR, , , , , .
UCS (ISO 10646) 31 , , 65534 (0x0000-0xfffd, (Basic Multilingual
Plane,BMP)), , ( Hieroglyphics) , , 16 BMP .
0x0000 0x007f UCS US-ASCII , 0x0000 0x00ff ISO 8859-1 Latin-1
(COMBINING CHARACTERS)
UCS (combining characters). . . UCS , , . . , Umlaut-A ( A) UCS
0x00c4, " A""": 0x0041 0x0308
(IMPLEMENTATION LEVELS)
, ISO 10646 UCS :
1 (Level 1)
Hangul Jamo ( , Hangul ).
2 (Level 2)
1, . ( Hebrew, Arabic, Devangari, Bengali, Gurmukhi,
Gujarati, Oriya, Tamil, Telugo, Kannada, Malayalam, Thai
Lao).
3 (Level 3)
UCS .
Unicode Unicode 1.1 ISO 10646 , 3 UCS ( Basic Multilingual Plane).
Unicode 1.1 ISO 10646 .
LINUX UNICODE (UNICODE UNDER LINUX)
Linux , , 1 BMP. , . linux C wchar_t 32 UCS4
UTF-8 ISO 8859-1 wctomb, mbtowc, wprintf wchar_t .
(PRIVATE AREA)
BMP , 0xe000 0xf8ff . Linux , 0xe000 0xefff , 0xf000 0xf8ff
linux linux .H. Peter Anvin(<Peter.Anvin@linux.org>, Yggdrasil
Computing,Inc) linux . Unicode DEC VT100 , , Klingon .
(LITERATURE)
* Information technology - Universal Multiple-Octet Coded Character Set
(UCS) - Part 1: Architecture and Basic Multilingual Plane.
International Standard ISO 10646-1, International Organization for
Standardization, Geneva, 1993.
UCS , , , . , www.iso.ch.
* The Unicode Standard - Worldwide Character Encoding Version 1.0. The
Unicode Consortium, Addison-Wesley, Reading, MA, 1991.
Unicode 1.1.4 , 1.0 ftp.unicode.org . Unicode 2.0 1996 .
* S. Harbison, G. Steele. C - A Reference Manual. Fourth edition,
Prentice Hall, Englewood Cliffs, 1995, ISBN 0-13-326224-3.
C . 1994 ISO C (ISO/IEC 9899:1990), C .
(BUGS)
,linux UCS C .
(AUTHOR)
Markus Kuhn <mskuhn@cip.informatik.uni-erlangen.de>
(SEE ALSO)
utf-8(7) http://www.linuxforum.net/books/UTF-8-Unicode.html
[]
mapping <mapping@263.net>
[]
2000/11/06
linuxman:
http://cmpp.linuxforum.net