作者热门文章
- c - 在位数组中找到第一个零
- linux - Unix 显示有关匹配两种模式之一的文件的信息
- 正则表达式替换多个文件
- linux - 隐藏来自 xtrace 的命令
在构建支持 Unicode 国际组件 (ICU) 的 boost::regex 1.52 版库后,匹配不区分大小写的正则表达式似乎无法按预期处理大写和小写德语变音字符。
static const std::string pattern("^.*" "\303\226" ".*$");
static const std::string test1("SCH" "\303\226" "NE");
static const std::string test2("sch" "\303\266" "ne");
static const boost::regex exp(pattern, boost::regex::icase);
const char *result = (boost::regex_match(test1, exp)) ? "Match" : "NoMatch";
std::cout << "Testing \"" << test1 << "\" against pattern \"" << pattern
<< "\" : " << result << std::endl;
result = (boost::regex_match(test2, exp)) ? "Match" : "NoMatch";
std::cout << "Testing \"" << test2 << "\" against pattern \"" << pattern
<< "\" : " << result << std::endl;
产量:
Testing "SCHÖNE" against pattern "^.*Ö.*$" : Match
Testing "schöne" against pattern "^.*Ö.*$" : NoMatch
最佳答案
Working with Unicode and ICU string types .
#include <iostream>
#include <boost/regex.hpp>
#include <boost/regex/icu.hpp>
int main()
{
static const std::string pattern("^.*" "\303\226" ".*$");
static const std::string test1("SCH" "\303\226" "NE");
static const std::string test2("sch" "\303\266" "ne");
static const boost::u32regex exp=boost::make_u32regex(pattern, boost::regex::icase);
const char *result = (boost::u32regex_match(test1, exp)) ? "Match" : "NoMatch";
std::cout << "Testing \"" << test1 << "\" against pattern \"" << pattern
<< "\" : " << result << std::endl;
result = (boost::u32regex_match(test2, exp)) ? "Match" : "NoMatch";
std::cout << "Testing \"" << test2 << "\" against pattern \"" << pattern
<< "\" : " << result << std::endl;
}
关于c++ - boost::regex 与 UTF-8 不区分大小写匹配(例如,大写与小写变音),我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/15906136/
我想解析以下 xml 结构: 为了解析这个 element name="?????? 结构,我按以下方式使用 XPath: XPath xPath
我在 OSX 机器上运行 Python 2.7。我正在尝试在 smb 共享上执行 os.walk。 for root, dirnames, filenames in os.walk("./test")
我是一名优秀的程序员,十分优秀!