gpt4 book ai didi

unicode - 所有 Unicode 左括号/右括号的列表

转载 作者:行者123 更新时间:2023-12-02 15:18:58 28 4
gpt4 key购买 nike

每个 Unicode 括号字符的列表是什么(包括,例如: {}[]()<> )?搜索 Unicode 字符的好方法是什么?

最佳答案

有一个plain-text database有关 Unicode 联盟提供的每个 Unicode 字符的信息;格式在Unicode Annex #44中描述。 。主要信息包含在UnicodeData.txt中。 。开始和结束标点字符用 Ps 表示。 (标点符号开始)和 Pe (标点符号结束)位于 General_Category 字段(第三个字段,由 ; 分隔)。寻找这些角色,您就会找到您要找的东西。

请注意,并非所有您认为括号中的字符都会列出;例如,引号(包括“«»”)。用 Pi 表示和Pf (开头和结尾的标点符号),因此您可能也想包含这些内容。还有一些字符,例如 <> 、 在某些上下文(例如 HTML/XML)中用作括号,而它们在 UnicodeData.txt 中被视为数学符号 ( Sm )。那些你必须手动找到的;没有预先确定的列表。

这是一个用于获取此信息及其输出的快速 Bash 脚本。我已经包含了括号和引号。 (注意:在某些 Bash 实现上,UTF-8 打印有一个错误,导致它将 U+00AB“«”和 U+00BB“»”打印为“?”,并且某些终端无法呈现所有字符正确。)

while IFS=';' read number name category rest
do
if [[ "$category" =~ Ps|Pe|Pi|Pf ]]
then
printf "%s (U+%s, %s): \u"$number"\n" "$name" "$number" "$category"
fi
done <UnicodeData.txt
LEFT PARENTHESIS (U+0028, Ps): (RIGHT PARENTHESIS (U+0029, Pe): )LEFT SQUARE BRACKET (U+005B, Ps): [RIGHT SQUARE BRACKET (U+005D, Pe): ]LEFT CURLY BRACKET (U+007B, Ps): {RIGHT CURLY BRACKET (U+007D, Pe): }LEFT-POINTING DOUBLE ANGLE QUOTATION MARK (U+00AB, Pi): «RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK (U+00BB, Pf): »TIBETAN MARK GUG RTAGS GYON (U+0F3A, Ps): ༺TIBETAN MARK GUG RTAGS GYAS (U+0F3B, Pe): ༻TIBETAN MARK ANG KHANG GYON (U+0F3C, Ps): ༼TIBETAN MARK ANG KHANG GYAS (U+0F3D, Pe): ༽OGHAM FEATHER MARK (U+169B, Ps): ᚛OGHAM REVERSED FEATHER MARK (U+169C, Pe): ᚜LEFT SINGLE QUOTATION MARK (U+2018, Pi): ‘RIGHT SINGLE QUOTATION MARK (U+2019, Pf): ’SINGLE LOW-9 QUOTATION MARK (U+201A, Ps): ‚SINGLE HIGH-REVERSED-9 QUOTATION MARK (U+201B, Pi): ‛LEFT DOUBLE QUOTATION MARK (U+201C, Pi): “RIGHT DOUBLE QUOTATION MARK (U+201D, Pf): ”DOUBLE LOW-9 QUOTATION MARK (U+201E, Ps): „DOUBLE HIGH-REVERSED-9 QUOTATION MARK (U+201F, Pi): ‟SINGLE LEFT-POINTING ANGLE QUOTATION MARK (U+2039, Pi): ‹SINGLE RIGHT-POINTING ANGLE QUOTATION MARK (U+203A, Pf): ›LEFT SQUARE BRACKET WITH QUILL (U+2045, Ps): ⁅RIGHT SQUARE BRACKET WITH QUILL (U+2046, Pe): ⁆SUPERSCRIPT LEFT PARENTHESIS (U+207D, Ps): ⁽SUPERSCRIPT RIGHT PARENTHESIS (U+207E, Pe): ⁾SUBSCRIPT LEFT PARENTHESIS (U+208D, Ps): ₍SUBSCRIPT RIGHT PARENTHESIS (U+208E, Pe): ₎LEFT-POINTING ANGLE BRACKET (U+2329, Ps): 〈RIGHT-POINTING ANGLE BRACKET (U+232A, Pe): 〉MEDIUM LEFT PARENTHESIS ORNAMENT (U+2768, Ps): ❨MEDIUM RIGHT PARENTHESIS ORNAMENT (U+2769, Pe): ❩MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT (U+276A, Ps): ❪MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT (U+276B, Pe): ❫MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT (U+276C, Ps): ❬MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT (U+276D, Pe): ❭HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT (U+276E, Ps): ❮HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT (U+276F, Pe): ❯HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT (U+2770, Ps): ❰HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT (U+2771, Pe): ❱LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT (U+2772, Ps): ❲LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT (U+2773, Pe): ❳MEDIUM LEFT CURLY BRACKET ORNAMENT (U+2774, Ps): ❴MEDIUM RIGHT CURLY BRACKET ORNAMENT (U+2775, Pe): ❵LEFT S-SHAPED BAG DELIMITER (U+27C5, Ps): ⟅RIGHT S-SHAPED BAG DELIMITER (U+27C6, Pe): ⟆MATHEMATICAL LEFT WHITE SQUARE BRACKET (U+27E6, Ps): ⟦MATHEMATICAL RIGHT WHITE SQUARE BRACKET (U+27E7, Pe): ⟧MATHEMATICAL LEFT ANGLE BRACKET (U+27E8, Ps): ⟨MATHEMATICAL RIGHT ANGLE BRACKET (U+27E9, Pe): ⟩MATHEMATICAL LEFT DOUBLE ANGLE BRACKET (U+27EA, Ps): ⟪MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET (U+27EB, Pe): ⟫MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET (U+27EC, Ps): ⟬MATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKET (U+27ED, Pe): ⟭MATHEMATICAL LEFT FLATTENED PARENTHESIS (U+27EE, Ps): ⟮MATHEMATICAL RIGHT FLATTENED PARENTHESIS (U+27EF, Pe): ⟯LEFT WHITE CURLY BRACKET (U+2983, Ps): ⦃RIGHT WHITE CURLY BRACKET (U+2984, Pe): ⦄LEFT WHITE PARENTHESIS (U+2985, Ps): ⦅RIGHT WHITE PARENTHESIS (U+2986, Pe): ⦆Z NOTATION LEFT IMAGE BRACKET (U+2987, Ps): ⦇Z NOTATION RIGHT IMAGE BRACKET (U+2988, Pe): ⦈Z NOTATION LEFT BINDING BRACKET (U+2989, Ps): ⦉Z NOTATION RIGHT BINDING BRACKET (U+298A, Pe): ⦊LEFT SQUARE BRACKET WITH UNDERBAR (U+298B, Ps): ⦋RIGHT SQUARE BRACKET WITH UNDERBAR (U+298C, Pe): ⦌LEFT SQUARE BRACKET WITH TICK IN TOP CORNER (U+298D, Ps): ⦍RIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNER (U+298E, Pe): ⦎LEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNER (U+298F, Ps): ⦏RIGHT SQUARE BRACKET WITH TICK IN TOP CORNER (U+2990, Pe): ⦐LEFT ANGLE BRACKET WITH DOT (U+2991, Ps): ⦑RIGHT ANGLE BRACKET WITH DOT (U+2992, Pe): ⦒LEFT ARC LESS-THAN BRACKET (U+2993, Ps): ⦓RIGHT ARC GREATER-THAN BRACKET (U+2994, Pe): ⦔DOUBLE LEFT ARC GREATER-THAN BRACKET (U+2995, Ps): ⦕DOUBLE RIGHT ARC LESS-THAN BRACKET (U+2996, Pe): ⦖LEFT BLACK TORTOISE SHELL BRACKET (U+2997, Ps): ⦗RIGHT BLACK TORTOISE SHELL BRACKET (U+2998, Pe): ⦘LEFT WIGGLY FENCE (U+29D8, Ps): ⧘RIGHT WIGGLY FENCE (U+29D9, Pe): ⧙LEFT DOUBLE WIGGLY FENCE (U+29DA, Ps): ⧚RIGHT DOUBLE WIGGLY FENCE (U+29DB, Pe): ⧛LEFT-POINTING CURVED ANGLE BRACKET (U+29FC, Ps): ⧼RIGHT-POINTING CURVED ANGLE BRACKET (U+29FD, Pe): ⧽LEFT SUBSTITUTION BRACKET (U+2E02, Pi): ⸂RIGHT SUBSTITUTION BRACKET (U+2E03, Pf): ⸃LEFT DOTTED SUBSTITUTION BRACKET (U+2E04, Pi): ⸄RIGHT DOTTED SUBSTITUTION BRACKET (U+2E05, Pf): ⸅LEFT TRANSPOSITION BRACKET (U+2E09, Pi): ⸉RIGHT TRANSPOSITION BRACKET (U+2E0A, Pf): ⸊LEFT RAISED OMISSION BRACKET (U+2E0C, Pi): ⸌RIGHT RAISED OMISSION BRACKET (U+2E0D, Pf): ⸍LEFT LOW PARAPHRASE BRACKET (U+2E1C, Pi): ⸜RIGHT LOW PARAPHRASE BRACKET (U+2E1D, Pf): ⸝LEFT VERTICAL BAR WITH QUILL (U+2E20, Pi): ⸠RIGHT VERTICAL BAR WITH QUILL (U+2E21, Pf): ⸡TOP LEFT HALF BRACKET (U+2E22, Ps): ⸢TOP RIGHT HALF BRACKET (U+2E23, Pe): ⸣BOTTOM LEFT HALF BRACKET (U+2E24, Ps): ⸤BOTTOM RIGHT HALF BRACKET (U+2E25, Pe): ⸥LEFT SIDEWAYS U BRACKET (U+2E26, Ps): ⸦RIGHT SIDEWAYS U BRACKET (U+2E27, Pe): ⸧LEFT DOUBLE PARENTHESIS (U+2E28, Ps): ⸨RIGHT DOUBLE PARENTHESIS (U+2E29, Pe): ⸩LEFT ANGLE BRACKET (U+3008, Ps): 〈RIGHT ANGLE BRACKET (U+3009, Pe): 〉LEFT DOUBLE ANGLE BRACKET (U+300A, Ps): 《RIGHT DOUBLE ANGLE BRACKET (U+300B, Pe): 》LEFT CORNER BRACKET (U+300C, Ps): 「RIGHT CORNER BRACKET (U+300D, Pe): 」LEFT WHITE CORNER BRACKET (U+300E, Ps): 『RIGHT WHITE CORNER BRACKET (U+300F, Pe): 』LEFT BLACK LENTICULAR BRACKET (U+3010, Ps): 【RIGHT BLACK LENTICULAR BRACKET (U+3011, Pe): 】LEFT TORTOISE SHELL BRACKET (U+3014, Ps): 〔RIGHT TORTOISE SHELL BRACKET (U+3015, Pe): 〕LEFT WHITE LENTICULAR BRACKET (U+3016, Ps): 〖RIGHT WHITE LENTICULAR BRACKET (U+3017, Pe): 〗LEFT WHITE TORTOISE SHELL BRACKET (U+3018, Ps): 〘RIGHT WHITE TORTOISE SHELL BRACKET (U+3019, Pe): 〙LEFT WHITE SQUARE BRACKET (U+301A, Ps): 〚RIGHT WHITE SQUARE BRACKET (U+301B, Pe): 〛REVERSED DOUBLE PRIME QUOTATION MARK (U+301D, Ps): 〝DOUBLE PRIME QUOTATION MARK (U+301E, Pe): 〞LOW DOUBLE PRIME QUOTATION MARK (U+301F, Pe): 〟ORNATE LEFT PARENTHESIS (U+FD3E, Ps): ﴾ORNATE RIGHT PARENTHESIS (U+FD3F, Pe): ﴿PRESENTATION FORM FOR VERTICAL LEFT WHITE LENTICULAR BRACKET (U+FE17, Ps): ︗PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRAKCET (U+FE18, Pe): ︘PRESENTATION FORM FOR VERTICAL LEFT PARENTHESIS (U+FE35, Ps): ︵PRESENTATION FORM FOR VERTICAL RIGHT PARENTHESIS (U+FE36, Pe): ︶PRESENTATION FORM FOR VERTICAL LEFT CURLY BRACKET (U+FE37, Ps): ︷PRESENTATION FORM FOR VERTICAL RIGHT CURLY BRACKET (U+FE38, Pe): ︸PRESENTATION FORM FOR VERTICAL LEFT TORTOISE SHELL BRACKET (U+FE39, Ps): ︹PRESENTATION FORM FOR VERTICAL RIGHT TORTOISE SHELL BRACKET (U+FE3A, Pe): ︺PRESENTATION FORM FOR VERTICAL LEFT BLACK LENTICULAR BRACKET (U+FE3B, Ps): ︻PRESENTATION FORM FOR VERTICAL RIGHT BLACK LENTICULAR BRACKET (U+FE3C, Pe): ︼PRESENTATION FORM FOR VERTICAL LEFT DOUBLE ANGLE BRACKET (U+FE3D, Ps): ︽PRESENTATION FORM FOR VERTICAL RIGHT DOUBLE ANGLE BRACKET (U+FE3E, Pe): ︾PRESENTATION FORM FOR VERTICAL LEFT ANGLE BRACKET (U+FE3F, Ps): ︿PRESENTATION FORM FOR VERTICAL RIGHT ANGLE BRACKET (U+FE40, Pe): ﹀PRESENTATION FORM FOR VERTICAL LEFT CORNER BRACKET (U+FE41, Ps): ﹁PRESENTATION FORM FOR VERTICAL RIGHT CORNER BRACKET (U+FE42, Pe): ﹂PRESENTATION FORM FOR VERTICAL LEFT WHITE CORNER BRACKET (U+FE43, Ps): ﹃PRESENTATION FORM FOR VERTICAL RIGHT WHITE CORNER BRACKET (U+FE44, Pe): ﹄PRESENTATION FORM FOR VERTICAL LEFT SQUARE BRACKET (U+FE47, Ps): ﹇PRESENTATION FORM FOR VERTICAL RIGHT SQUARE BRACKET (U+FE48, Pe): ﹈SMALL LEFT PARENTHESIS (U+FE59, Ps): ﹙SMALL RIGHT PARENTHESIS (U+FE5A, Pe): ﹚SMALL LEFT CURLY BRACKET (U+FE5B, Ps): ﹛SMALL RIGHT CURLY BRACKET (U+FE5C, Pe): ﹜SMALL LEFT TORTOISE SHELL BRACKET (U+FE5D, Ps): ﹝SMALL RIGHT TORTOISE SHELL BRACKET (U+FE5E, Pe): ﹞FULLWIDTH LEFT PARENTHESIS (U+FF08, Ps): (FULLWIDTH RIGHT PARENTHESIS (U+FF09, Pe): )FULLWIDTH LEFT SQUARE BRACKET (U+FF3B, Ps): [FULLWIDTH RIGHT SQUARE BRACKET (U+FF3D, Pe): ]FULLWIDTH LEFT CURLY BRACKET (U+FF5B, Ps): {FULLWIDTH RIGHT CURLY BRACKET (U+FF5D, Pe): }FULLWIDTH LEFT WHITE PARENTHESIS (U+FF5F, Ps): ⦅FULLWIDTH RIGHT WHITE PARENTHESIS (U+FF60, Pe): ⦆HALFWIDTH LEFT CORNER BRACKET (U+FF62, Ps): 「HALFWIDTH RIGHT CORNER BRACKET (U+FF63, Pe): 」

关于unicode - 所有 Unicode 左括号/右括号的列表,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/33306907/

28 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com