gpt4 book ai didi

python - 使用非 BMP 字符引发错误会重新启动 shell

转载 作者:行者123 更新时间:2023-12-01 07:59:12 27 4
gpt4 key购买 nike

我正在编写一个 python 模块,旨在在 pygame 中显示和输入表情符号。这意味着我经常使用非 BMP Unicode 字符,显然 python shell 不喜欢。

我制作了一个自定义的类似字符串的对象,通过将表情符号序列存储为单个字符,可以更轻松地处理表情符号字符和序列。然而,虽然我希望 str(self) 返回对象的原始 Unicode 表示形式,但这会在尝试打印时导致问题,或者更糟糕的是,当它包含在错误消息中时。

这是错误消息中包含非 BMP 字符时发生的情况的示例。在 Windows 10 上运行 Python 3.7.3。

>>> raise ValueError('Beware the non-BMP! \U0001f603')
Traceback (most recent call last):
File "<pyshell#0>", line 1, in <module>
raise ValueError('Beware the non-BMP! \U0001f603')
Traceback (most recent call last):
File "<pyshell#0>", line 1, in <module>
raise ValueError('Beware the non-BMP! \U0001f603')
Traceback (most recent call last):
File "D:\Python37\lib\idlelib\run.py", line 474, in runcode
exec(code, self.locals)
File "<pyshell#0>", line 1, in <module>
Traceback (most recent call last):
File "D:\Python37\lib\idlelib\run.py", line 474, in runcode
exec(code, self.locals)
File "<pyshell#0>", line 1, in <module>
ValueError:

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "D:\Python37\lib\idlelib\run.py", line 144, in main
ret = method(*args, **kwargs)
File "D:\Python37\lib\idlelib\run.py", line 486, in runcode
print_exception()
File "D:\Python37\lib\idlelib\run.py", line 234, in print_exception
print_exc(typ, val, tb)
File "D:\Python37\lib\idlelib\run.py", line 232, in print_exc
print(line, end='', file=efile)
File "D:\Python37\lib\idlelib\run.py", line 362, in write
return self.shell.write(s, self.tags)
File "D:\Python37\lib\idlelib\rpc.py", line 608, in __call__
value = self.sockio.remotecall(self.oid, self.name, args, kwargs)
File "D:\Python37\lib\idlelib\rpc.py", line 220, in remotecall
return self.asyncreturn(seq)
File "D:\Python37\lib\idlelib\rpc.py", line 251, in asyncreturn
return self.decoderesponse(response)
File "D:\Python37\lib\idlelib\rpc.py", line 271, in decoderesponse
raise what
UnicodeEncodeError: 'UCS-2' codec can't encode characters in position 32-32: Non-BMP character not supported in Tk

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "D:\Python37\lib\idlelib\run.py", line 158, in main
print_exception()
File "D:\Python37\lib\idlelib\run.py", line 234, in print_exception
print_exc(typ, val, tb)
File "D:\Python37\lib\idlelib\run.py", line 220, in print_exc
print_exc(type(context), context, context.__traceback__)
File "D:\Python37\lib\idlelib\run.py", line 232, in print_exc
print(line, end='', file=efile)
File "D:\Python37\lib\idlelib\run.py", line 362, in write
return self.shell.write(s, self.tags)
File "D:\Python37\lib\idlelib\rpc.py", line 608, in __call__
value = self.sockio.remotecall(self.oid, self.name, args, kwargs)
File "D:\Python37\lib\idlelib\rpc.py", line 220, in remotecall
return self.asyncreturn(seq)
File "D:\Python37\lib\idlelib\rpc.py", line 251, in asyncreturn
return self.decoderesponse(response)
File "D:\Python37\lib\idlelib\rpc.py", line 271, in decoderesponse
raise what
UnicodeEncodeError: 'UCS-2' codec can't encode characters in position 32-32: Non-BMP character not supported in Tk

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "<string>", line 1, in <module>
File "D:\Python37\lib\idlelib\run.py", line 162, in main
traceback.print_exception(type, value, tb, file=sys.__stderr__)
File "D:\Python37\lib\traceback.py", line 105, in print_exception
print(line, file=file, end="")
File "D:\Python37\lib\idlelib\run.py", line 362, in write
return self.shell.write(s, self.tags)
File "D:\Python37\lib\idlelib\rpc.py", line 608, in __call__
value = self.sockio.remotecall(self.oid, self.name, args, kwargs)
File "D:\Python37\lib\idlelib\rpc.py", line 220, in remotecall
return self.asyncreturn(seq)
File "D:\Python37\lib\idlelib\rpc.py", line 251, in asyncreturn
return self.decoderesponse(response)
File "D:\Python37\lib\idlelib\rpc.py", line 271, in decoderesponse
raise what
UnicodeEncodeError: 'UCS-2' codec can't encode characters in position 32-32: Non-BMP character not supported in Tk

=============================== RESTART: Shell ===============================

正如您所看到的,shell 似乎进入了一个无限循环来尝试处理错误,然后重新启动 shell 以防止卡住。有什么方法可以 a) 让 str 以不同的方式为错误处理程序工作,或者 b) 阻止 shell 重新启动,以便正确显示错误?

最佳答案

从snakecharmerb 和 these 汲取灵感two问题,我已经实现了一些代码来检查模块是否在 IDLE 中运行,如果是,则错误处理程序是否正在调用该函数。测试似乎运行良好。我对 IDLE 运行环境进行了以下检查

IN_IDLE = False
for item in ['idlelib.__main__','idlelib.run','idlelib']:
IN_IDLE = IN_IDLE or item in sys.modules

下面是新的 __str__ 函数

    def __str__(self):
""" Return str(self). """
if IN_IDLE:
# Check for caller. If string is being printed, modify
# output to be IDLE-friendly (no non-BMP characters)
callername = sys._getframe(1).f_code.co_name
if callername == '_some_str':
rstr = ''
for char in self.__raw:
if ord(char) > 0xFFFF:
rstr += '\\U'+hex(ord(char))[2:].zfill(8)
else:
rstr += repr(char)[1:-1]
return rstr
else:
return self.__raw
else:
return self.__raw

其中 self.__raw 保存对象的原始文本表示形式。我缓存它是为了提高效率,因为对象是不可变的。

当然,虽然这确实解决了这个问题,但我觉得 python 不应该在发生这种情况时重新启动整个 shell。将在 bugs.python.org 上发布

编辑:在 bugs.python.org 上发布为 issue 36698

关于python - 使用非 BMP 字符引发错误会重新启动 shell,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/55793287/

27 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com