Provided by:
manpages-zh_1.5.2-1_all 
NAME
UTF-8 - ASCII Unicode
The Unicode 16 Unicode UCS-2) 16 '\0''/' C ASCII UNIX 16 UCS-2
Unicode ISO 10646 Universal Character Set (UCS), Unicode 31 32
UCS-4 UCS-4 UTF-8 Unicode UCS UTF-8 UNIX Unicode
UTF-8
* UCS 0x00000000 0x0000007f US-ASCII 0x00 0x7f ASCII 7 ASCII
ASCII UTF-8.
* 0x7f UCS 0x80 0fd ASCII '\0''\[u2019]
* UCS-4
* 2^32 UCS UTF-8
* 0xfe 0xff UTF-8
* ASCII UCS 0xc0 0xfd 0x80 0xbf
* UTF-8 UCS 6 Unicode 3 Linux 16 Unicode UCS Linux UTF-8
UCS
0x00000000 - 0x0000007F:
0xxxxxxx
0x00000080 - 0x000007FF:
110xxxxx 10xxxxxx
0x00000800 - 0x0000FFFF:
1110xxxx 10xxxxxx 10xxxxxx
0x00010000 - 0x001FFFFF:
11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
0x00200000 - 0x03FFFFFF:
111110xx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx
0x04000000 - 0x7FFFFFFF:
1111110x 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx
xxx
Unicode 0xa9 = 1010 1001 () UTF-8
11000010 10101001 = 0xc2 0xa9
0x2260 = 0010 0010 0110 0000 ("")
11100010 10001001 10100000 = 0xe2 0x89 0xa0
ISO 10646, Unicode 1.1, XPG4, Plan 9.
Markus Kuhn
unicode(7)
[]
billpan <billpan@yeah.net>
[]
2000/11/09
linuxman:
http://cmpp.linuxforum.net