gpt4 book ai didi

unicode - Unicode 拉丁下标字母列表

转载 作者:行者123 更新时间:2023-12-05 02:01:15 28 4
gpt4 key购买 nike

感谢jmcnamara我发现了一种在 xlsxwriter 图表中使用 Unicode 字符的好方法:xlsxwrter: rich text format in chart title

我需要一个所有 Unicode 字符的列表来复制。我找到了一些:

为什么大写下标字母没有字母表?我在哪里可以买到这些?

最佳答案

Unicode 是一个字符集,用于characters 之间的映射。/glyphs到数字。它只处理纯文本,不应该用于格式化文本§。您不能将字母设为 粗体斜体 或将字母移动到 abovebelow 基线Unicode 代码点(参见 Create Unicode subscripts and superscripts with combining glyphs )

似乎表示格式的字符之所以存在,主要是因为它们以前在旧标准中使用过。你可以在 Unicode 标准中找到原因

Q: Why doesn't Unicode have a full set of superscripts and subscripts?

A: The superscripted and subscripted characters encoded in Unicode are either compatibility characters encoded for roundtrip conversion of data from legacy standards, or are actually modifier letters used with particular meanings in technical transcriptional systems such as IPA and UPA. Those characters are not intended for general superscripting or subscripting of arbitrary text strings—for such textual effects, you should use text styles or markup in rich text, instead.

https://www.unicode.org/faq/ligature_digraph.html

兼容性也是为什么上标数字 ²³¹ 经常与其余字符 ⁰⁴⁵⁶⁷⁸⁹ 不同的原因,因为许多字体只包含前一组而不包含后者。而 ¹ 位于 ²³ 之后,因为 ISO 8859-1 就是这样做的

事实上,几乎所有在 Unicode 中看起来很愚蠢的东西都是因为与旧字符集的兼容性。您会发现很多示例,其中有一个不必要的 Unicode 代码点代表一系列字符,例如 Nj、Dž、Ⅷ、㎉、㎓、﷽。同样,还有很多不合理的表情符号,例如the “copyright” ©️, “registered trademark” ®️ and “trademark” ™️ symbols。 .人们以前在其他一些字符集中使用过它们,因此 Unicode 必须做同样的事情才能成功地与它们相互转换。


§ 有关 Unicode 格式文本的更多信息:

Rich Text. Also known as styled text. The result of adding information to plain text. Examples of information that can be added include font data, color, formatting information, phonetic annotations, interlinear text, and so on. The Unicode Standard does not address the representation of rich text. It is expected that systems and applications will implement proprietary forms of rich text. Some public forms of rich text are available (for example, ODA, HTML, and SGML). When everything except primary content is removed from rich text, only plain text should remain.

https://unicode.org/glossary/#rich_text (emphasis mine)

Q: What is the difference between “rich text” and “plain text”?

A: Rich text is text with all its formatting information: typeface, point size, weight, kerning, and so on. Plain text is the underlying content stream to which formatting is applied.

One key distinction between the two is that rich text breaks the text up into runs and applies uniform formatting to each run. As such, rich text is inherently stateful. Plain text is not stateful. It should be possible to lose the first half of a block of plain text without any impact on rendering.

Unicode, by design, only deals with plain text. It doesn't provide a generalized solution to rich text issues.

https://www.unicode.org/faq/ligature_digraph.html

关于unicode - Unicode 拉丁下标字母列表,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/66732251/

28 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com