java - 正则表达式: Match pattern `foo` but not incase if it occurs after pattern `bar`-6ren

java - 正则表达式: Match pattern `foo` but not incase if it occurs after pattern `bar`

转载作者：太空宇宙更新时间：2023-11-04 10:35:32

25

4

匹配模式foo但如果它出现在模式 bar 之后则不然。基本上给定一个字符串，我“尝试”匹配开始标签 <任何字符串 >如果匹配位于结束标记 </ 之后，则不应发生匹配任何字符串 > 。

注意:我正在“尝试”类似的方法来解决，这可能不是解决方案的实际路径。如果您能帮助解决当前问题，我将非常高兴。

所以它应该匹配:
<h1>在<h1>
<h1>在<h1> abc </h1>
<abc>在<abc>something</cde><efg>
<abc>在something<abc>something

不应匹配以下内容:
</h1>
</abc> one two three <abc> five six <abc>
one two three </abc> five six <abc>

最佳答案

最简单的解决方案是将部分工作外包给 java regex API。使用正则表达式，我们只能匹配 <[^>]*> ，即任何 html 标签。然后我们可以使用Matcher.region()将匹配限制为任何 </ 之前的字符串.

这是代码:

    // example data
    String[] inputLines = {
            "<h1>",
            "<h1> abc </h1>",
            "<abc>something</cde><efg>",
            "something<abc>something",
            "",
            "</h1>",
            "</abc> one two three <abc> five six <abc>",
            "one two three </abc> five six <abc>"
    };

    // the pattern for any html tag
    Pattern pattern = Pattern.compile("<[^>]*>");

    for (String line : inputLines) {
        Matcher matcher = pattern.matcher(line);
        // the index that we must not search after
        int undesiredStart = line.indexOf("</");

        //  undesiredStart == -1 ? line.length() : undesiredStart handles the undesired not found case. In that case the region end must be the length of the string
        matcher.region(0, undesiredStart == -1 ? line.length() : undesiredStart);

        // this is the idiom to iterate through the matches
        while (matcher.find()) {
            System.out.println(matcher.group());
        }
    }

关于java - 正则表达式: Match pattern `foo` but not incase if it occurs after pattern `bar` ，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/49537019/

25

4

0

文章推荐： python - 交替使用列表和元组

文章推荐： css - 如何改变图片背景的大小

search - lucene中BooleanClause.Occur.Must和BooleanClause.Occur.SHOULD的区别
谁能用一个例子来解释 BooleanQuery 中 lucene 中的 BooleanClause.Occur.Must 和 BooleanClause.Occur.SHOULD 之间的区别？最佳答
c++ - 插入多重集 : before the first occurence of that value instead of after the last occurence
正如标题所说，multiset 在所有相同值的范围末尾插入一个值。 (例如:在多重集 1,2,2,3 中插入 2 使其成为 1,2,2,/*new*/2,3)。如何在所有相同值范围的开头插入新值？
c++ - 插入 priority_queue : before the first occurence of that value instead of after the last occurence
所以这是与此(Inserting in a multiset: before the first occurence of that value instead of after the last o
c# - 聚合异常 "One or more errors occurred.An error occurred while sending the request."
我试图从我的 WCF .Net Framework 4.5 向 API rest 发布一个文件。这是我的代码: public string CreateConclusion(string[] inst
mysql - SQL : Count the number of occurrences occuring on output column and calculate some percentage based on the occurences
我的 SQL 查询获取固件的错误修复验证列表，例如def-456 是一张票，要求我对产品进行固件测试。 def-456 有几个记录结果的子任务。结果记录为:id:abc-123、abc-124、abc
linux - 文件操作: removing every occurence of a string execpt the first occurence after a certain different pattern
我想删除文件中多次出现的行，但想保留某些行。我该怎么做？这是我的文件的一部分，我想更改它: §M: 1, K: 2 name, time, cycle, instr, L1-mi
ssms - SQL 2016 实时查询统计错误 : "An error occurred while executing batch. Error message is: One or more errors occurred."
我正在 SSMS 中测试 SQL 2016 Live Query Stats，每次尝试时都会收到错误消息“执行批处理时出错。错误消息是:发生一个或多个错误。”并且不返回任何结果集。一位同事试过了，对他
tomcat - JBoss 缓存服务 : exception occurred in cache put error occurred after changing cache mode to REPL_SYNC
我们在 JBoss 4.2 上设置了一个水平集群。在我们将缓存模式从 REPL_ASYNC 更改为 REPL_SYNC 以解决问题之前， session 复制工作正常。我们开始看到一些 session
asp.net-mvc - "An exception occurred while processing your request. Additionally, another exception occurred while executing the custom error page..."
我正在尝试将 MVC 网站发布为 Azure 网络角色。当我在本地运行它时，一切正常。但是当我将其发布到 Azure 并浏览某些 MVC 操作时，我收到此错误: Server Error in '
linux - 未使用功能的链接器错误 : When do they occur?
假设一个静态库 libfoo 依赖于另一个静态库 libbar 的某些功能。这些和我的应用程序都是用 D 编写的。如果我的应用程序只直接使用 libfoo，并且只调用 libfoo 中的函数而不引用
eclipse - 在安装颠覆性连接器发现期间 - 'problems occurred'
我正在尝试在 Eclipse Helios 上安装 SVN 客户端，我已经从 Collaboration 节点安装了所有 SVN 模块(更新中)，现在重启后我可以选择一个连接器出现“颠覆性连接器
CakePHP错误: An internal error has occurred
我在 cakephp 中有一些代码会产生错误。这是 PHP Controller : $this->loadModel( 'Vote' ); //Newly added by amit start
Java : Occurances of a character in a String
我需要有关 Java 代码的帮助。这就是问题所在: 输入示例:AaaaaAa 输出:A 出现 7。问题是我需要它来忽略案例。请帮助我，我的代码工作正常，只是它不忽略大小写。 import jav
java - Java 中的死锁 : When they occur?
我正在为 J2ME 开发一个应用程序，有时它完全卡住并且 AMS 需要相当长的时间来关闭它.在我看来，这像是一个死锁问题。你能告诉我什么会导致死锁吗？例如，如果对象调用其自身的另一个同步方法，调用对
安卓dexguard : Multiple problems have occured?
尝试将 DEXguard 安装到 Eclipse 中的简单应用程序时出现以下错误: Errors occurred during the build. Errors running builder '
SAS 数据 : How to remove observations that only occur once
在 SAS 中，假设我有一个名为“person_groups”的数据集。它有两个变量，名为“person”和“group”。该数据集只是将每个人分配到一个组。我如何从这个数据集中删除所有在他们的组中
正则表达式 : replace the n-th occurence
有人知道如何在表达式中找到第 n 次出现的字符串以及如何用正则表达式替换它吗？例如我有以下字符串 txt sub("(^(.*?-){4}.*?)-(.*?-.*?)-", "\\1|\\3||"
emacs - emacs 中有多个 "Occur"结果缓冲区？
是否有一个包允许我为同一个缓冲区设置多个 Occur 结果缓冲区(例如 grep-a-lot: http://www.emacswiki.org/emacs/grep-a-lot.el )。我在分析
Powershell 错误处理 : do something if NO error occured
我一直在寻找这个，但似乎无法找到它。我有一个带有 try {} catch {} 语句的脚本。如果没有发生错误，我想添加一个操作。例如 try { something } catch { "Err
iphone - iPhone : Unknown Error Occurred
我正在从 iPhone 应用程序将照片上传到 Facebook。我已经让它工作了，只是有时它会返回“发生未知错误”。我不确定问题是什么。这种情况发生的概率约为 75%。其他人也遇到过这种情况吗？最

首页

博学

6Ren·AI

商城

java - 正则表达式: Match pattern `foo` but not incase if it occurs after pattern `bar`