python - 如何从字符串python中删除所有表情符号(unicode)字符-6ren

python - 如何从字符串python中删除所有表情符号(unicode)字符

转载作者：行者123 更新时间：2023-12-01 06:47:26

24

4

我有以下字符串:

tweet = "Get $10 worth of AMAL!!\\nThis campaign will be final AirDrop before official release!!\\nhttps://form.run/@airdrop-e\xa0\\n\\nRT please!\\n\\n#amanpuri #AMAL\\n#BTC #XRP #ETH \\n#cryptocurrency  \\n#China #bitcoin \\n#\\xe3\\x82\\xa2\\xe3\\x83\\x9e\\xe3\\x83\\xb3\\xe3\\x83\\x97\\xe3\\x83\\xaa"

我需要清理它，但我坚持删除字符串末尾的符号，即 \\n#\\xe3\\x82\\xa2\\xe3最有可能是 unicode 符号、表情符号和换行符 \\n这是我所做的:

pat1 = r'@[A-Za-z0-9]+' # this is to remove any text with @ (links)
pat2 = r'https?://[A-Za-z0-9./]+'  # this is to remove the urls
pat3 = r'[^a-zA-Z0-9$]' # to remove every other character except a-z & 0-9 & $
combined_pat2 = r'|'.join((r'|'.join((pat1, pat2)),pat3)) # combine pat1, pat2 and pat3 to pass it in the cleaning steps

我得到以下输出:

get $10 worth of amal   nthis campaign will be final airdrop before official release   n   e  n nrt please  n n amanpuri  amal n btc  xrp  eth  n cryptocurrency   n china  bitcoin  n  xe3 x82 xa2 xe3 x83 x9e xe3 x83 xb3 xe3 x83 x97 xe3 x83 xaa

所以我仍然拥有所有这些n和xe3有人可以为此目的建议一个Python正则表达式吗？提前谢谢。

最佳答案

这些不是字符。他们是逃避者。您可以使用此正则表达式来匹配它们:

r'\\(n|x..)'

如果您想删除它们，请使用:

import re
tweet = re.sub(r'\\(n|x..)', '', tweet)

关于python - 如何从字符串python中删除所有表情符号(unicode)字符，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/59167852/

24

4

0

文章推荐： character-encoding - 围绕代码页使用的术语和概念

文章推荐： asp.net - asp.net 中的 Ajax 悬停菜单？

文章推荐： java - 如何将 JTextField 中的数据保存到 mysql 数据库中？

文章推荐： wpf - 从代码中获取 WPF 资源？

javascript - Handlebars 表情
我遇到了 Handlebars 表达式 {{!< default}}我不明白。在问这个问题之前，我曾尝试在谷歌上搜索答案。但是，我找不到任何答案。谁能解释一下这个表达式是什么意思？最佳答案在 Ha
IM聊天教程之发送图片/视频/语音/表情
经常有朋友问起，如何在im即时通讯中实现发送图片、视频、语音和表情？为此，小编特意写了一个vue版本的demo，实现了图片视频文件和表情的的发送，参考这个demo源代码，相信你就可以轻松的用un
json - 如何使用 twitch 表情 API
这里是一个网站 ( https://twitchemotes.com/apidocs )，它展示了一个用于获取 twitch 表情的 API(基本上是描述属于不同用户的一组图像的 json)。 API
Python 3 正则表达式和 Unicode 表情
使用 Python 3，像下面这样的简单脚本应该按预期运行，但似乎会因 unicode 表情字符串而窒息: import re phrase = "(╯°□°)╯ ︵ ┻━┻" pattern = r
android - 表情符号(表情) View /键盘布局的实现
我试图弄清楚表情符号(表情)选择是如何在 Facebook 应用和 Google Hangouts 应用上实现的。我查看了 Android API 示例中的 SoftKeyboard 演示应用程序，但
android - 是否可以在 android 通知文本中显示笑脸/表情/图像跨度？
我正在尝试在 android 通知文本中显示笑脸(或图像范围)。它不起作用。有人知道怎么做吗？谢谢。代码如下: SpannableStringBuilder builder = new Spanna

首页

博学

6Ren·AI

商城

python - 如何从字符串python中删除所有表情符号(unicode)字符