c++ - boost spirit 还原解析-6ren

c++ - boost spirit 还原解析

转载作者：行者123 更新时间：2023-11-30 02:46:00

25

4

我想解析一个包含以下结构的文件:

some
garbage *&%
section1 {
    section_content
}
section2 {
    section_content
}

解析 section_name1 { ... } section_name2 { ... } 的规则已经定义:

section_name_rule = lexeme[+char_("A-Za-z0-9_")];
section = section_name_rule > lit("{") > /*some complicated things*/... > lit("}");
sections %= +section;

所以我需要跳过所有垃圾，直到满足 sections 规则。有什么办法可以做到这一点？我试过 seek[sections]，但它似乎不起作用。

编辑:我本地化了 seek 不起作用的原因:如果我使用 follows operator(>>>)，那么它就起作用了。如果使用期望解析器 (>)，则它会抛出异常。这是一个示例代码:

#define BOOST_SPIRIT_DEBUG
#include <boost/fusion/adapted/struct.hpp>
#include <boost/spirit/include/qi.hpp>
#include <boost/spirit/repository/include/qi_seek.hpp>
#include <boost/spirit/include/phoenix.hpp>

namespace qi = boost::spirit::qi;
using boost::phoenix::push_back;

struct section_t {
    std::string name, contents;
    friend std::ostream& operator<<(std::ostream& os, section_t const& s) { return os << "section_t[" << s.name << "] {" << s.contents << "}"; }
};

BOOST_FUSION_ADAPT_STRUCT(section_t, (std::string, name)(std::string, contents))

    typedef std::vector<section_t> sections_t;

    template <typename It, typename Skipper = qi::space_type>
    struct grammar : qi::grammar<It, sections_t(), Skipper>
{
    grammar() : grammar::base_type(start) {
        using namespace qi;
        using boost::spirit::repository::qi::seek;
        section_name_rule = lexeme[+char_("A-Za-z0-9_")];
        //Replacing '>>'s with '>'s throws an exception, while this works as expected!!
        section = section_name_rule
            >>
            lit("{") >> lexeme[*~char_('}')] >> lit("}");
        start = seek [ hold[section[push_back(qi::_val, qi::_1)]] ]
            >> *(section[push_back(qi::_val, qi::_1)]);
    }
    private:
    qi::rule<It, sections_t(),  Skipper> start;
    qi::rule<It, section_t(),   Skipper> section;
    qi::rule<It, std::string(), Skipper> section_name_rule;
};

int main() {
    typedef std::string::const_iterator iter;
    std::string storage("sdfsdf\n sd:fgdfg section1 {dummy } section2 {dummy  } section3 {dummy  }");
    iter f(storage.begin()), l(storage.end());
    sections_t sections;
    if (qi::phrase_parse(f, l, grammar<iter>(), qi::space, sections))
    {
        for(auto& s : sections)
            std::cout << "Parsed: " << s << "\n";
    }
    if (f != l)
        std::cout << "Remaining unparsed: '" << std::string(f,l) << "'\n";
}

所以在真实的例子中，我的整个语法都是用期望运算符构造的。我是否必须更改所有内容才能使“搜索”工作，或者是否有任何其他方法(比方说，搜索一个简单的“{”，然后将一个 section_name_rule 还原回来)？？

最佳答案

这是一个演示，以哈姆雷特为灵感: Live On Coliru

start = *seek [ no_skip[eol] >> hold [section] ];

注意事项:

降低期望值
通过要求在节名之前开始行来优化

示例输入:

some
garbage *&%
section1 {
   Claudius: ...But now, my cousin Hamlet, and my son —
   Hamlet: A little more than kin, and less than kind.
}
WE CAN DO MOAR GARBAGE
section2 {
   Claudius: How is it that the clouds still hang on you?
   Hamlet: Not so my lord; I am too much i' the sun 
}

输出:

Parsed: section_t[section1] {Claudius: ...But now, my cousin Hamlet, and my son —
   Hamlet: A little more than kin, and less than kind.
}
Parsed: section_t[section2] {Claudius: How is it that the clouds still hang on you?
   Hamlet: Not so my lord; I am too much i' the sun 
}

引用 list

// #define BOOST_SPIRIT_DEBUG
#include <boost/fusion/adapted/struct.hpp>
#include <boost/spirit/include/qi.hpp>
#include <boost/spirit/repository/include/qi_seek.hpp>

namespace qi = boost::spirit::qi;

struct section_t { 
    std::string name, contents;
    friend std::ostream& operator<<(std::ostream& os, section_t const& s) { return os << "section_t[" << s.name << "] {" << s.contents << "}"; }
};

BOOST_FUSION_ADAPT_STRUCT(section_t, (std::string, name)(std::string, contents))

typedef std::vector<section_t> sections_t;

template <typename It, typename Skipper = qi::space_type>
struct grammar : qi::grammar<It, sections_t(), Skipper>
{
    grammar() : grammar::base_type(start) {
        using namespace qi;
        using boost::spirit::repository::qi::seek;

        section_name_rule = lexeme[+char_("A-Za-z0-9_")];
        section           = section_name_rule >> '{' >> lexeme[*~char_('}')] >> '}';
        start             = *seek [ no_skip[eol] >> hold [section] ];

        BOOST_SPIRIT_DEBUG_NODES((start)(section)(section_name_rule))
    }
  private:
    qi::rule<It, sections_t(),  Skipper> start;
    qi::rule<It, section_t(),   Skipper> section;
    qi::rule<It, std::string(), Skipper> section_name_rule;
};

int main() {
    using It = boost::spirit::istream_iterator;
    It f(std::cin >> std::noskipws), l;

    sections_t sections;
    if (qi::phrase_parse(f, l, grammar<It>(), qi::space, sections))
    {
        for(auto& s : sections)
            std::cout << "Parsed: " << s << "\n";
    }
    if (f != l)
        std::cout << "Remaining unparsed: '" << std::string(f,l) << "'\n";
}

关于c++ - boost spirit 还原解析，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/23953804/

25

4

0

文章推荐： java - Android - 使自定义 View 的透明区域不可点击

文章推荐： c++ - 出于测试目的覆盖非虚函数

文章推荐： c++ - VS2010不理解自己文件的字符串编码

boost-spirit - Boost Spirit X3 量产准备好了吗？
我正在将一个手写解析器迁移到 Boost.Spirit (2.5.4)。第一印象是积极的，但由于我使用的是 C++17，X3 似乎是一个非常有吸引力的选择。幸运的是，有很多关于 X3 的可用资源:
boost-spirit - boost::spirit::qi 前瞻以匹配字符串中的最后一次出现
是否可以使用 boost::spirit::qi 来解析以下内容？ A_B --> (A, B) A_B_C --> (A_B, C) A_B_C_D --> (A_B_
boost-spirit - 使用 Spirit.Qi 消除语法糖
我正在尝试解析一种类似 lisp 的语言，它具有一些通用功能的语法糖。例如，plus 函数可以写成 (+ 1 2) 或 1 + 2。我认为在尝试解释语言之前消除句法糖会显着促进解释过程，因为那样的话，
boost-spirit - 使用 Spirit.Qi 消除语法糖
我正在尝试解析一种类似 lisp 的语言，它具有一些通用功能的语法糖。例如，plus 函数可以写成 (+ 1 2) 或 1 + 2。我认为在尝试解释语言之前消除句法糖会显着促进解释过程，因为那样的话，
c++ - 如何使用存储在 boost spirit 闭包中的变量作为 boost spirit 循环解析器的输入？
我想使用解析后的值作为循环解析器的输入。语法定义了一个 header ，它指定了以下字符串的(可变)大小。例如，假设以下字符串是某个解析器的输入。 12\r\nTest Payload 解析器应提取
c++ - 有没有办法将 spirit::lex 字符串标记的内容匹配为 spirit::qi 语法中的文字
我正在编写 DSL 并使用 Boost Spirit 词法分析器来标记我的输入。在我的语法中，我想要一个类似于此的规则(其中 tok 是词法分析器): header_block = tok.n
boost-spirit - 从 boost Spirit 语法中获取结果(phoenix push_back 导致编译错误)
我有以下精神语法。我正在尝试在 struct myresult 中创建 AST 节点的向量使用标准 push_back(at_c(qi::_val), qi::_1)但出现编译错误(见下文)。 typ
c++ - boost::spirit 绑定(bind)函数提供参数作为 spirit:qi::_val
需要为 std::pair 对象提供类型为 boost::variant 的对象的值。您将如何使用其他资源来实现这个想法？下面还有其他方法吗？ struct aggr_pair_visitor
c++ - 如何结合 boost::spirit::lex 和 boost::spirit::qi？
我有一个词法分析器，基于该词法分析器，我现在想创建一个使用该词法分析器生成的标记的语法。我尝试改编我发现的一些示例，现在我有一些可以编译和工作的东西至少有一点，但我的一个应该失败的测试却没有。现在我想
c++ - 使用 spirit::qi 时如何忽略 spirit::Lex 的 token 属性？
当我使用此 qi 语法从 Lex 接受标记时: pair %= token(ID_MARKER) >> ':' >> atom >> ',' >> atom
c++ - boost::spirit::qi::double_ 和 boost::spirit::qi::int_
如何解析可能包含 double 或 int 的字符串，具体取决于是否设置了点。例如。 6.0是double类型，6是int类型。规则是 rule,skipper> r = qi::double_|qi
c++ - boost spirit 语法错误 - "no type named ‘size’ 中的 ‘struct boost::spirit::unused_type’“
请帮助我诊断以下错误。我有一个简单的语法: struct json_start_elem_grammar_object : qi::grammar { json_start_elem_gramma
c++ - 使用 Boost.Spirit.Lex 和 Boost.Spirit.Qi 解析 "true"和 "false"
作为使用 Boost.Spirit 的更大语法的第一阶段，我尝试解析“true”和“false”以生成相应的 bool 值，true 和 false. 我正在使用 Spirit.Lex 对输入进行标记
Boost Spirit 将表达式标记化为向量
我正在尝试解析一个也可以包含标识符的表达式并将每个元素推送到 std::vector 中，我想出了以下语法: #include #include #include #include name
boost-spirit - 如果使用惰性求值实现三元类型
我正在为 if 函数实现生产规则: qi::rule f_if; f_if = qi::ascii::string("if") >> qi::char_('(')
Boost::spirit 序列没有被解析
我编写了这段代码示例并期望它打印OPERATION( OPERATOR(aaa) ID(bbb) ) 但我只得到OPERATION ( OPERATOR(aaa) )反而。 result2 和 it1
c++ - Spirit QI解析器结束EOM
我的数据定义为: std::string data("START34*23*43**"); 我的语法: "START" >> boost::spirit::hex % '*' 题: 如何解析有两颗星的
Boost::spirit 序列没有被解析
我编写了这段代码示例并期望它打印OPERATION( OPERATOR(aaa) ID(bbb) ) 但我只得到OPERATION ( OPERATOR(aaa) )反而。 result2 和 it1
c++ - spirit 上如何解析字符串并将其用作返回值
我需要解析一个键值对，其中键本身是示例中的固定字符串lke'cmd'。不幸的是qi::lit没有综合属性，并且qi::char_没有解析固定的字符串。以下代码无法编译。执行后，我需要那个result
c++ - Spirit X3组合属性
我正在尝试编写精神规则，但我无法弄清楚这个新规则的属性是什么。以下代码按我预期的方式工作。 #include #include #include #include #include nam

首页

博学

6Ren·AI

商城

c++ - boost spirit 还原解析

引用 list