- html - 出于某种原因,IE8 对我的 Sass 文件中继承的 html5 CSS 不友好?
- JMeter 在响应断言中使用 span 标签的问题
- html - 在 :hover and :active? 上具有不同效果的 CSS 动画
- html - 相对于居中的 html 内容固定的 CSS 重复背景?
给定维基百科文章的文本(参见 http://test.wikipedia.org/w/index.php?title=Paris&action=raw ),我想提取所有需要引用的部分。在维基百科标记中,需要引用由 {{Citation needed}}
表示,部分标题由主要部分的两个等号表示(例如 ==Section 1==
) 或三个等号(例如 ===Section 1A===
)用于小节。
我的正则表达式:
/(={2,3}[^==]+={2,3}[^==]+\{{2}Citation needed[^}}]*\}{2}[^==]*)(={2,3}[^==]+={2,3})?/g
我的正则表达式适用于许多部分,但不适用于其他部分,我不明白为什么。
确实匹配:
===Capital of France=== [[File:Elysée Palace, Paris 2005.jpg|thumb|right|The [[Élysée Palace]], residence of the [[President of France|French President]].]] Paris is the seat of France's national government. For the executive, the two chief officers each have their own official residences, which also serve as their offices. The [[President of France]] resides at the [[Élysée Palace]] in the [[8th arrondissement of Paris|8th arrondissement]], while the [[Prime Minister of France|Prime Minister]]'s seat is at the [[Hôtel Matignon]] in the [[7th arrondissement of Paris|7th arrondissement]]. Government ministries are located in various parts of the city; many are located in the 7th arrondissement, near the Matignon.
The two houses of the French Parliament are also located on the [[Rive Gauche|Left Bank]]. The upper house, the [[Senate of France|Senate]], meets in the [[Palais du Luxembourg]] in the [[6th arrondissement of Paris|6th arrondissement]], while the more important lower house, the [[Assemblée Nationale]], meets in the [[Palais Bourbon]] in the [[7th arrondissement of Paris|7th]]. The [[List of Presidents of the French Senate|President of the Senate]], the second-highest public official in France after the President of the Republic, resides in the "Petit Luxembourg", a smaller palace annex to the [[Palais du Luxembourg]].{{Citation needed|date=June 2010}}
不匹配:
===Intercommunality=== Few of the above changes have taken into account Paris' existence as an [[agglomeration]]. Unlike in most of France's major urban areas such as [[Lille]] and [[Lyon]], there is no [[commune in France#Intercommunality|intercommunal]] entity in the Paris urban area, no intercommunal council treating the problems of the region's dense urban core as a whole; Paris' alienation of its suburbs is indeed a problem today, and considered by many {{Who|date=April 2011}} to be the main causes of civil unrest such as the suburban riots in 2005. A direct result of these unfortunate events is propositions for a more efficient metropolitan structure to cover the city of Paris and some of the suburbs, ranging from a socialist idea of a loose "metropolitan conference" (''conférence métropolitaine'') to the right-wing idea of a more integrated ''Grand Paris'' ("Greater Paris").
One of the main reasons for such incoherence has been the fear felt by the French State in front of such a huge agglomeration and the desire to tap its wealth. {{Citation needed|date=April 2011}} Since the Middle Ages and particularly since the 1649 troubles (La Fronde), Paris has been considered as a source of danger. The authoritarian king Louis the XIVth built Versailles as a new political center, away from the dangerous city crowds. The conflict between the State and the City reached a climax with the Revolution of 1871 (La Commune) : the French Assembly in Bordeaux decided Paris would no longer be the capital city, while the Paris Commune discussed declaring Paris independent of France. Since then, one of the foundations of the centralized French State has been to widely distribute Paris wealth while depriving the agglomeration and keeping it divided into 8 departments and 1 200 communes. (For an analysis of the long hostility against Paris, see [http://www-ohp.univ-paris1.fr/] {{Verify source|date=April 2011}} ). Of the 22 metropolitan French regions, 19 are regularly subsidized — mostly by Paris resources — while Paris suburbs lack necessary equipment.
我的正则表达式有什么问题?
/(={2,3}((?!==).)+={2,3}((?!==)[^])+\{{2}Citation needed((?!}})[^])*\}{2}((?!==)[^])*)/g
说明:匹配任何部分或小节标题(==Section== 或 ===Subsection===),后跟不是两个等号的任何内容,然后是类似 {{Citation needed}} 的字符串,后跟通过除两个等号以外的所有其他内容。
最佳答案
问题伴随着这样的序列:{{Who|date=
。用于匹配标题和 {{Citation Needed}} 之间的文本的代码在遇到 =
关于用于匹配维基百科部分文本的 Javascript 正则表达式,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/14743757/
我正在用 yacc/bison 编写一个简单的计算器。 表达式的语法看起来有点像这样: expr : NUM | expr '+' expr { $$ = $1 + $3; } | expr '-'
我开始学习 lambda 表达式,并在以下情况下遇到了以下语句: interface MyNumber { double getValue(); } MyNumber number; nu
这两个 Linq 查询有什么区别: var result = ResultLists().Where( c=> c.code == "abc").FirstOrDefault(); // vs. va
如果我们查看 draft C++ standard 5.1.2 Lambda 表达式 段 2 说(强调我的 future ): The evaluation of a lambda-expressio
我使用的是 Mule 4.2.2 运行时、studio 7.5.1 和 Oracle JDK 1.8.0_251。 我在 java 代码中使用 Lambda 表达式,该表达式由 java Invoke
我是 XPath 的新手。我有网页的html源 http://london.craigslist.co.uk/com/1233708939.html 现在我想从上面的页面中提取以下数据 完整日期 电子
已关闭。这个问题是 off-topic 。目前不接受答案。 想要改进这个问题吗? Update the question所以它是on-topic用于堆栈溢出。 已关闭10 年前。 Improve th
我将如何编写一个 Cron 表达式以在每天上午 8 点和下午 3:30 触发?我了解如何创建每天触发一次的表达式,而不是在多个设定时间触发。提前致谢 最佳答案 你应该只使用两行。 0 8 * * *
这个问题已经有答案了: What do 3 dots next to a parameter type mean in Java? (9 个回答) varargs and the '...' argu
我是 python 新手,在阅读 BeautifulSoup 教程时,我不明白这个表达式“[x for x in titles if x.findChildren()][:-1]”我不明白?你能解释一
(?:) 这是一个有效的 ruby 正则表达式,谁能告诉我它是什么意思? 谢谢 最佳答案 正如其他人所说,它被用作正则表达式的非捕获语法,但是,它也是正则表达式之外的有效 ruby 语法。 在
这个问题在这里已经有了答案: Why does ++[[]][+[]]+[+[]] return the string "10"? (10 个答案) 关闭 8 年前。 谁能帮我处理这个 JavaSc
这个问题在这里已经有了答案: What is the "-->" operator in C++? (29 个答案) Java: Prefix/postfix of increment/decrem
这个问题在这里已经有了答案: List comprehension vs. lambda + filter (16 个答案) 关闭 10 个月前。 我不确定我是否需要 lambda 或其他东西。但是,
C 中的 assert() 函数工作原理对我来说就像一片黑暗的森林。根据这里的答案https://stackoverflow.com/a/1571360 ,您可以使用以下构造将自定义消息输出到您的断言
在this页,John Barnes 写道: If the conditional expression is the argument of a type conversion then effec
我必须创建一个调度程序,它必须每周从第一天上午 9 点到第二天晚上 11 点 59 分运行 2 天(星期四和星期五)。为此,我需要提供一个 cron 表达式。 0-0 0-0 9-23 ? * THU
我正在尝试编写一个 Linq 表达式来检查派生类中的属性,但该列表由来自基类的成员组成。下面的示例代码。以“var list”开头的 Process 方法的第二行无法编译,但我不确定应该使用什么语法来
此 sed 表达式将输入字符串转换为两行输出字符串。两条输出行中的每一行都由输入的子串组成。第一行需要转换成大写: s:random_stuff\(choice1\|choice2\){\([^}]*
我正在使用 Quartz.Net 在我的应用程序中安排我的工作。我只是想知道是否可以为以下场景构建 CRON 表达式: Every second between 2:15AM and 5:20AM 最
我是一名优秀的程序员,十分优秀!