Unicode字符编码表(转)Unicode字符编码表
版权声明:本⽂为博主原创⽂章,未经博主允许不得转载。 blog.csdn/zhenyu5211314/article/details/51537778⼗进制⼗六进制字符数编码分类(中⽂)编码分类(英⽂)
起始终⽌起始终⽌(个)
01270000007F128C0 Control and Basic Latin
128255008000FF128C1 Control and Latin 1 Supplement 2563830100017F128Latin Extended-A
3845910180024F208Latin Extended-B
592687025002AF96IPA Extensions
68876702B002FF80Spacing Modifiers
7688790300036F112Combining Diacritics Marks
8801023037003FF144Greek and Coptic
10241279040004FF256Cyrillic
128013270500052F48Cyrillic Supplement
132814*********F96Armenian
14241535059005FF112Hebrew
15361791060006FF256Arabic
179218710700074F80Syriac
187219190750077F48Arabic Supplement
19201983078007BF64Thaana
1984204707C007FF64N'Ko
204821430800085F96Avestan and Pahlavi
214421750860087F32Mandaic
21762223088008AF48Samaritan
230424310900097F128Devanagari
24322559098009FF128Bengali
256026870A000A7F128Gurmukhi
268828150A800AFF128Gujarati
281629430B000B7F128Oriya
294430710B800BFF128Tamil
307231990C000C7F128Telugu
320033270C800CFF128Kannada
332834550D000D7F128Malayalam
345635830D800DFF128Sinhala
358437110E000E7F128Thai
371238390E800EFF128Lao
384040950F000FFF256Tibetan
409642551000109F160Myanmar
4256435110A010FF96Georgian
43524607110011FF256Hangul Jamo
460849911200137F384Ethiopic
499250231380139F32Ethiopic Supplement
5024511913A013FF96Cherokee
512057591400167F640Unified Canadian Aboriginal Syllabics 576057911680169F32Ogham
5792588716A016FF96Runic
588859191700171F32Tagalog
588859191700171F32Tagalog
592059511720173F32Hanunóo
595259831740175F32Buhid
598460151760177F32Tagbanwa
60166143178017FF128Khmer
61446319180018AF176Mongolian
6320639918B018FF80Cham
640064791900194F80Limbu
648065271950197F48Tai Le
65286623198019DF96New Tai Lue
6624665519E019FF32Kmer Symbols
665666871A001A1F32Buginese
668867511A201A5F64Batak
678468951A801AEF112Lanna
691270391B001B7F128Balinese
704070881B801BB049Sundanese
710471671BC01BFF64Pahawh Hmong
716872471C001C4F80Lepcha
724872951C501C7F48Ol Chiki
729673911C801CDF96Meithei/Manipuri
742475511D001D7F128Phonetic Extensions
755276151D801DBF64Phonetic Extensions Supplement 761676791DC01DFF64Combining Diacritics Marks Supplement 768079351E001EFF256Latin Extended Additional 793681911F001FFF256Greek Extended 819283032000206F112General Punctuation 830483512070209F48Superscripts and Subscripts 8352839920A020CF48Currency Symbols
8400844720D020FF48Combining Diacritics Marks for Symbols 844885272100214F80Letterlike Symbols 852885912150218F64Number Form
85928703219021FF112Arrows
87048959220022FF256Mathematical Operator 89609215230023FF256Miscellaneous Technical 921692792400243F64Control Pictures 928093112440245F32Optical Character Recognition 93129471246024FF160Enclosed Alphanumerics 947295992500257F128Box Drawing
960096312580259F32Block Element
9632972725A025FF96Geometric Shapes 97289983260026FF256Miscellaneous Symbols 998410175270027BF192Dingbats
101761022327C027EF48Miscellaneous Mathematical Symbols-A 102241023927F027FF16Supplem
ental Arrows-A 1024010495280028FF256Braille Patterns 10496106232900297F128Supplemental Arrows-B 1062410751298029FF128Miscellaneous Mathematical Symbols-B 10752110072A002AFF256Supplemental Mathematical Operator 11008112632B002BFF256Miscellaneous Symbols and Arrows 11264113592C002C5F96Glagolitic
11360113912C602C7F32Latin Extended-C
11392115192C802CFF128Coptic
11520115672D002D2F48Georgian Supplement 11568116472D302D7F80Tifinagh
11648117432D802DDF96Ethiopic Extended 11776119032E002E7F128Supplemental Punctuation 11904120312E802EFF128CJK Radicals Supplement 12032122552F002FDF224Kangxi Radicals
12272122872FF02FFF16Ideographic Description Characters 12288123513000303F64CJK Symbols and Punctuation 12352124473040309F96Hiragana
124481254330A030FF96Katakana 12544125913100312F48Bopomofo 12592126873130318F96Hangul Compatibility Jamo 12688127033190319F16Kanbun
127041273531A031BF32Bopomofo Extended 127361278331C031EF48CJK Strokes
127841279931F031FF16Katakana Phonetic Extensions 1280013055320032FF256Enclosed CJK Letters and Months 1305613311330033FF256CJK Compatibility 133121990334004DBF6592CJK Unified Ideographs Extension A 199********DC04DFF64Yijing Hexagrams Symbols 199********E009FBF20928CJK Unified Ideographs 4096042127A000A48F1168Yi Syllables
4212842191A490A4CF64Yi Radicals
4224042527A500A61F288Vai
4259242751A660A6FF160Unified Canadian Aboriginal Syllabics Supplement
4275242783A700A71F32Modifier Tone Letters 4278443007A720A7FF224Latin Extended-D 4300843055A800A82F48Syloti Nagri
4307243135A840A87F64Phags-pa
4313643231A880A8DF96Saurashtraunicode汉字
4326443391A900A97F128Javanese
4339243487A980A9DF96Chakma
4352043583AA00AA3F64Varang Kshiti 4358443631AA40AA6F48Sorang Sompeng 4364843743AA80AADF96Newari
4377643871AB00AB5F96Vi?t Thái
4390443936AB80ABA033Kayah Li
4403255215AC00D7AF11184Hangul Syllables 5529656319D800DBFF1024High-half zone of UTF-16 5632057343DC00DFFF1024Low-half zone of UTF-16 5734463743E000F8FF6400Private Use Zone 6374464255F900FAFF512CJK Compatibility Ideographs 6425664335FB00FB4F80Alphabetic Presentation Form 6433665023FB50FDFF688Arabic Presentation Form-A 6502465039FE00FE0F16Variation Selector 6504065055FE10FE1F16Vertical Forms 6505665071FE20FE2F16Combining Half Marks 6507265103FE30FE4F32CJK Compatibility Forms 6510465135FE50FE6F32Small Form Variants
6510465135FE50FE6F32Small Form Variants
6513665279FE70FEFF144Arabic Presentation Form-B 6528065519FF00FFEF240Halfwidth and Fullwidth Form 6552065535FFF0FFFF16Specials
UTF-8有点类似于Haffman编码,它将Unicode编码为:
0x00-0x7F的字符,⽤单个字节来表⽰;
0x80-0x7FF的字符⽤两个字节表⽰;
0x800-0xFFFF的字符⽤3字节表⽰;
汉字的unicode范围是:0x4E00~0x9FA5
其实这个范围还包括了中,⽇,韩的字符。

版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系QQ:729038198,我们将在24小时内删除。