python - shlex.split 仍然不支持 unicode？-6ren

python - shlex.split 仍然不支持 unicode？

转载作者：太空狗更新时间：2023-10-29 17:45:01

25

4

根据文档，在 Python 2.7.3 中，shlex 应该支持 UNICODE。但是，当运行下面的代码时，我得到:UnicodeEncodeError: 'ascii' codec can't encode characters in position 184-189: ordinal not in range(128)

我做错了什么吗？

import shlex

command_full = u'software.py -fileA="sequence.fasta" -fileB="新建文本文档.fasta.txt" -output_dir="..." -FORMtitle="tst"'

shlex.split(command_full)

具体错误如下:

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/opt/local/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/shlex.py", line 275, in split
    lex = shlex(s, posix=posix)
  File "/opt/local/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/shlex.py", line 25, in __init__
    instream = StringIO(instream)
UnicodeEncodeError: 'ascii' codec can't encode characters in position 44-49: ordinal not in range(128)

这是我的 mac 使用来自 macports 的 python 的输出。我在使用“ native ”python 2.7.3 的 Ubuntu 机器上遇到完全相同的错误。

最佳答案

shlex.split() 代码将 unicode() 和 str() 实例包装在 StringIO() 对象，它只能处理 Latin-1 字节(因此不是完整的 unicode 代码点范围)。

如果您仍想使用 shlex.split()，则必须进行编码(UTF-8 应该可以)；该模块的维护者意味着现在支持 unicode() 对象，只是不支持 Latin-1 代码点范围之外的任何对象。

编码、拆分、解码给我:

>>> map(lambda s: s.decode('UTF8'), shlex.split(command_full.encode('utf8')))
[u'software.py', u'-fileA=sequence.fasta', u'-fileB=\u65b0\u5efa\u6587\u672c\u6587\u6863.fasta.txt', u'-output_dir=...', u'-FORMtitle=tst']

A now closed Python issue试图解决这个问题，但该模块非常面向字节流，并且没有实现新的补丁。目前使用 iso-8859-1 或 UTF-8 编码是我能为您想到的最好的。

关于python - shlex.split 仍然不支持 unicode？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/14218992/

25

4

0

文章推荐： Angular Material 垫表定义组件中的可重用列

文章推荐： c# - 只要使用异步，ASP.Net MVC 4 Controller 就会挂起

文章推荐： C#:构造函数调用的顺序

文章推荐： html - 如何在输入元素中设置默认月份？

python - shlex 包含空字符串
sample = ",," values = shlex.shlex(sample, posix=True) values.quotes = '"' values.whitespace = ',' v
Python shlex - 拆分
感谢 shlex 这种字符串，我想拆分: str = 'This doesn''t work' 54e+5 15 .FALSE. 'Another example of "test"' 预期结果: 这
python - shlex 保留双引号吗？
我将 Popen 与 shlex 一起用于 yum 命令，并使用 --exclude 标志来传递要排除的软件包列表。由于某种原因，shlex 似乎没有保留双引号。有什么指示我该怎么做吗？ >>> im
Python Shlex 用括号分割
我需要将一系列字符串拆分为由空格表示的 3 个组成部分。这些字符串有时包含子列表，但始终作为字符串的最后一个组成部分。我之前使用 Shlex 取得了巨大的成功，但我不再获得所需的结果，因为我最近的子
Python shlex 没有右引号错误——如何处理？
这个简单的代码: s = "it's a nice day..." s = shlex.split(s) 将导致 ValueError: No closing quotation错误: Traceba
python - 使用 shlex 保留连续的空格
我正在使用 shlex 解析 csv 文件，并且需要保留连续的空白字符，如下所示... line = 'a, b, "c, z",,,d,e,f' spltr = shlex.shlex(line)
Python shlex.split() 不能保留单引号
我有以下文字: 'sudo -S java -cp spinn3r-client-3.4.06.jar com.spinn3r.api.Main --vendor=test --remote-filt
python - 使用 shlex 和子进程时出错
这个问题已经有答案了: How do I use subprocess.Popen to connect multiple processes by pipes? (9 个回答) 已关闭 7 年前。
python - 使用 shlex 拆分多行字符串并保留引号字符
如何使用 Python 的 shlex 拆分字符串，同时保留 shlex 拆分的引号字符？示例输入: Two Words "A Multi-line comment." 期望的输出: ['Two'
python - 通配符在使用 shlex 的子进程调用中不起作用
语言:Python v2.6.2 操作系统:AIX 5.3 我正在使用 Python 将一些文件从备份恢复到测试系统 - 所有命令都以下面的方式调用，但有些命令根本不想工作。 #!/usr/bin/p
Python shlex.split()，忽略单引号
在 Python 中，如何使用 shlex.split() 或类似的方法来拆分字符串，只保留双引号？例如，如果输入是 "hello, world"is what 'i say' 那么输出将是 ["he
python - shlex.split 的反面是什么？
如何反转 shlex.split 的结果?也就是说，我怎样才能获得一个带引号的字符串 "resemble that of a Unix shell" ，给定一个我希望引用的字符串的 list？更新0
python - 使用 shlex.split 时保留引号
如何在使用 shlex.split() 时保留“带空格的值”周围的引号？ s = "SOME_VAR=\"value with spaces\" VAR2=value2" shlex.split(s)
java - Java 的 shlex 替代品
是否有 Java 的 shlex 替代品？我希望能够像 shell 处理它们一样拆分引号分隔的字符串。例如，如果我发送: one two "three four"并执行拆分，我想收到 token on
python - 将 shlex 置于 Debug模式
我想看看 shlex对于我正在尝试构建的东西来说是一个不错的选择，所以我想我会把它放在 debug mode 中玩弄它。只有 shlex 的构造函数有这个 weird thing it does :
python - shlex.split 仍然不支持 unicode？
根据文档，在 Python 2.7.3 中，shlex 应该支持 UNICODE。但是，当运行下面的代码时，我得到:UnicodeEncodeError: 'ascii' codec can't en
Bash 相当于 Python 的 shlex.quote
Python 的标准库有一个 shlex.quote函数接受一个字符串并返回一个保证被 Unix shell 解释为相同字符串的函数。这是通过将字符串放在单引号中并转义出现在其中的任何单引号字符来实现
python - shlex:在 Python 3 中转义引号
我想拆分ascii文本 1 'K\^o, Suk\'e' 打印为 Python 字符串 line = "1 'K\\^o, Suk\\'e'\n" 进入 ['1', 'K\\^o, Suk\\'e']
python - Node.js 的 Shlex 拆分等效项
我将如何在 Node.js 中执行以下操作？我意识到可能没有内置功能或为此编写的模块，那么我该如何实现呢？ >>> import shlex >>> shlex.split("-a arga -b \
python - elisp 相当于 python shlex.split？
我需要在 elisp 中解析命令行，例如: (shlex-split "command \"Some file with spaces\" someother\ quote") ;;That give

首页

博学

6Ren·AI

商城

python - shlex.split 仍然不支持 unicode？