gpt4 book ai didi

unicode - Unicode 中汉字的完整范围是多少?

转载 作者:行者123 更新时间:2023-12-03 05:08:04 29 4
gpt4 key购买 nike

Unicode 为中文字符分配了 U+4E00..U+9FFF。这是全套的一部分,但不是全部。

最佳答案

最终列表可以在 Unicode Character Code Charts 找到;在页面中搜索“CJK”。

East Asian Script ”文档确实提到:

Blocks Containing Han Ideographs

Han ideographic characters are found in five main blocks of the Unicode Standard, asshown in Table 18-1

表 18-1。包含汉表意文字的 block

Block                                   Range       Comment
CJK Unified Ideographs 4E00-9FFF Common
CJK Unified Ideographs Extension A 3400-4DBF Rare
CJK Unified Ideographs Extension B 20000-2A6DF Rare, historic
CJK Unified Ideographs Extension C 2A700–2B73F Rare, historic
CJK Unified Ideographs Extension D 2B740–2B81F Uncommon, some in current use
CJK Unified Ideographs Extension E 2B820–2CEAF Rare, historic
CJK Unified Ideographs Extension F 2CEB0–2EBEF Rare, historic
CJK Unified Ideographs Extension G 30000–3134F Rare, historic
CJK Unified Ideographs Extension H 31350–323AF Rare, historic
CJK Compatibility Ideographs F900-FAFF Duplicates, unifiable variants, corporate characters
CJK Compatibility Ideographs Supplement 2F800-2FA1F Unifiable variants

注意:此表自 Unicode 15.0 起是最新的。 block 范围会随着时间的推移而变化:最新的是 CJK Unified Ideographs .

还有

CJK Radicals / Kangxi Radicals          2F00–2FDF
CJK Radicals Supplement 2E80–2EFF

其中包含可能会进入常规文本的字符,以及

CJK Symbols and Punctuation             3000–303F

另请参阅维基百科:

另请参阅Unihan Database (组织与中日韩统一表意文字属性相关的信息)

关于unicode - Unicode 中汉字的完整范围是多少?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/1366068/

29 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com