python - 从组合(文字 ('@' )+ 'spec' )更改为关键字 ('@spec' )删除空格-6ren

python - 从组合(文字 ('@' )+ 'spec' )更改为关键字 ('@spec' )删除空格

转载作者：太空宇宙更新时间：2023-11-03 17:58:46

27

4

为什么使用 Combine(...) 保留空格，而 Keyword(...) 删除这些空格？

我需要保留匹配标记后面的空格。

测试如下:

from pyparsing import *


def parse(string, refpattern):

    print refpattern.searchString(string)

    pattern = StringStart() \
        + SkipTo(refpattern)('previous') \
        + refpattern('ref') \
        + SkipTo(StringEnd())('rest')

    print pattern.parseString(string)


string = "With @ref to_something"

identifier = Combine(Word(alphas + '_', alphanums + '_') + Optional('.' + Word(alphas)))

pattern_without_space = (CaselessKeyword('@ref') | CaselessKeyword(r'\ref')).setParseAction(lambda s, l, t: ['ref']) \
    + White().suppress() + identifier

pattern_with_space = Combine((Literal('@') | Literal('\\')).suppress() + 'ref') + White().suppress() + identifier

parse(string, pattern_without_space)
parse(string, pattern_with_space)

将输出:

[['ref', 'to_something']]
['With', 'ref', 'to_something', '']
[['ref', 'to_something']]
['With ', 'ref', 'to_something', '']
#     ^ space i need is preserved here

最佳答案

将交替(| 运算符)与 CaselessKeyword 结合使用时会出现此问题。请参阅这些示例:

from pyparsing import *

theString = 'This is @Foo Bar'
identifier = Combine(Word(alphas + '_', alphanums + '_') + Optional('.' + Word(alphas)))
def testParser(p):
  q = StringStart() + SkipTo(p)("previous") + p("body") + SkipTo(StringEnd())("rest")
  return q.parseString(theString)

def test7():
  p0 = (CaselessKeyword('@Foo') | Literal('@qwe')) + White().suppress() + identifier
  p1 = (CaselessKeyword('@Foo') | CaselessKeyword('@qwe')) + White().suppress() + identifier
  p2 = (Literal('@qwe') | CaselessKeyword('@Foo')) + White().suppress() + identifier
  p3 = (CaselessKeyword('@Foo')) + White().suppress() + identifier
  p4 = Combine((Literal('@') | Literal('\\')).suppress() + 'Foo') + White().suppress() + identifier
  print "p0:", testParser(p0)
  print "p1:", testParser(p1)
  print "p2:", testParser(p2)
  print "p3:", testParser(p3)
  print "p4:", testParser(p4)

test7()

输出为:

p0: ['This is', '@Foo', 'Bar', '']
p1: ['This is', '@Foo', 'Bar', '']
p2: ['This is', '@Foo', 'Bar', '']
p3: ['This is ', '@Foo', 'Bar', '']
p4: ['This is ', 'Foo', 'Bar', '']

也许这是一个错误？

更新:这是您定义自己的解析器以匹配 @Foo 或 \Foo 作为关键字的方法:

from pyparsing import *
import string

class FooKeyWord(Token):
  alphas = string.ascii_lowercase + string.ascii_uppercase
  nums       = "0123456789"
  alphanums  = alphas + nums

  def __init__(self):
    super(FooKeyWord,self).__init__()
    self.identChars = alphanums+"_$"
    self.name = "@Foo"
  def parseImpl(self, instring, loc, doActions = True):
    if (instring[loc] in ['@', '\\'] and
         instring.startswith('Foo', loc+1) and
         (loc+4 >= len(instring) or instring[loc+4] not in self.identChars) and
         (loc == 0 or instring[loc-1].upper() not in self.identChars)):
         return loc+4, instring[loc] + 'Foo'
    raise ParseException(instring, loc, self.errmsg, self)

def test8():
  p = FooKeyWord() + White().suppress() + identifier
  q = StringStart() + SkipTo(p)("previous") + p("body") + SkipTo(StringEnd())("rest")
  print "with @Foo:", q.parseString("This is @Foo Bar")
  print "with \\Foo:", q.parseString("This is \\Foo Bar")

输出:

with @Foo: ['This is ', '@Foo', 'Bar', '']
with \Foo: ['This is ', '\\Foo', 'Bar', '']

关于python - 从组合(文字 ('@' )+ 'spec' )更改为关键字 ('@spec' )删除空格，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/28050308/

27

4

0

文章推荐： c# - 对多对象列表进行排序的正确方法？

文章推荐： ruby-on-rails - 如何转换字符串以便在 URL 中正确传递它？

java - 正则表达式 30 个数字 + 空格 + 连字符 + 空格 1 个数字
我有这个代码来查找这个模式:201409250200131738007947036000 - 1，在文本内 final String patternStr = "(\\d{
正则表达式删除方括号/空格
我正在尝试使用正则表达式清除一些用户输入，以删除 [ 和 ] 并删除任何大于 1 个空格的空格。但我似乎无法实现我想要的效果。这是我第一次使用正则表达式，所以我对如何写出来有点困惑。 (preg_re
Java正则表达式匹配单词+空格
我正在尝试构建这个简单的正则表达式来匹配 Java 中的单词+空格，但我在尝试解决它时感到困惑。该网站上有很多类似的示例，但答案大多给出了正则表达式本身，而没有解释它是如何构造的。我正在寻找的是形成
Python删除行之间的输入/空格
好吧，我已经阅读了很多建议如何消除多余空间的帖子，但无论出于何种原因，我似乎无法将这些建议应用到我的系统中，所以我在这里寻求您的帮助。这些是我代码的最后几行: for line in rli
javascript - 如何删除某些空的新行/空格
所以我正在我的测试存储上学习网页抓取，但我不确定如何正确地从“sizes”数组中删除空的新行。 const $ = cheerio.load(body) $('div.lis
javascript - 输入表单中不允许有空白字符/空格
这个问题已经有答案了: How to prevent invalid characters from being typed into input fields (8 个回答) 已关闭 9 年前。是
java - 忽略空格、空格
有人知道如何让扫描仪忽略空间吗？我想输入名字和第二个名字，但扫描仪不让我输入，我想保存全名 String name; System.out.print("Enter name: "); name =
VIM:空格/制表符缩进
这个问题在这里已经有了答案: Make Vim show ALL white spaces as a character (23 个回答) 关闭 8 年前。 VIM(使用 Solarized Dar
java - 流标记器、空格
我想使用 StreamTokenizer 从 java 文件中提取名称。我已将空格设置为逗号 inputTokenizer.whitespaceChars(',', ','); 但是，
Java:读取txt文件并将其保存在字符串数组中但不带反斜杠(空格)？
我正在使用此代码逐行读取 txt 文件。 // Open the file that is the first command line parameter FileInputStream fstre
Java 正则表达式 - 空格
我似乎无法弄清楚我需要的正则表达式。这就是我想要实现的目标: {ANY CHAR} + @javax.persistence.Column(name = "{ANY 30 CHARS}") + {AN
StyleCop 和 = 空格
我正在运行 StyleCop(顺便说一句，如果你想提供高质量的代码，我完全推荐它)... 我有这条线 [System.Xml.Serialization.XmlRootAttribute(Namesp
PhpStorm 在每次保存时删除制表符/空格
我刚刚更新到 PhpStorm 2016，我突然注意到，每次我按 Ctrl + S 保存文件时，它都会删除我在测试这段代码后按下以继续编写的空格/制表符。请帮忙，这对我来说很烦人，因为我在每一行代码
c - 输入名称(空格)
关闭。此题需要details or clarity 。目前不接受答案。想要改进这个问题吗？通过 editing this post 添加详细信息并澄清问题. 已关闭 7 年前。 Improve th
c - 删除c程序中的制表符/空格
已关闭。此问题不符合Stack Overflow guidelines 。目前不接受答案。要求提供代码的问题必须表现出对所解决问题的最低限度的了解。包括尝试的解决方案、为什么它们不起作用以及预期结果
路径中的 C# 空格
我已经看过几十个关于这个主题的问题和答案，但我仍然无法解决我的问题。我在我的代码中使用了一个外部 ffmpeg 转换器，我将文件路径作为参数传递，如下所示: OutputPackage oo = c
c - 空格、特殊字符和转义序列
谁能详细解释一下它们是什么以及它们之间的区别。提前致谢。最佳答案转义序列是代表其他内容的字符序列。例如(“\n” = 新行，“\?” = 问号等)。有关更详细的列表，请检查:https://en.
javascript - 从数组中删除换行符/空格
我无法从我的 javascript 文本中删除换行符。这是我正在处理的数据示例: 0: "Christian Pulisic" 1: "↵" 2: "From Wikipedia, the free
java - 从字符串Java的开头和结尾删除新行/空格
我有一个问题 - 我似乎无法从字符串的开头/结尾删除新行/空格。我在正则表达式的开头和结尾使用 \s ，甚至在获取字符串后使用 .trim() ，但无济于事。 public void extractI
用于超链接的变量中的 PHP 空格
我是 php 的新手，我正在尝试将一系列变量添加到 html 超链接中。但是，任何返回空格的变量都会弄乱超链接。 Grants Test

首页

博学

6Ren·AI

商城

python - 从组合(文字 ('@' )+ 'spec' )更改为关键字 ('@spec' )删除空格