gpt4 book ai didi

java - 查找两个特定单词之间的字符串

转载 作者:行者123 更新时间:2023-12-02 06:10:57 25 4
gpt4 key购买 nike

我有一个文本,我需要提取两个特定单词之间的数据,例如 Activity :子 Activity :之间的数据。这是我的文字:

Activity: S1. Outline Design
Sub-Activity: S3.3 Walk through Release Backlog
Question Tag: tag
Questioner (role or team): Solution Architect
Which response should the user read first?: Response 8
Responder 1 (role or team): Developer
Response 1:
Responder 2 (role or team): Scrum Master
Response 2: response2
Responder 3 (role or team): Please select:
Response 3:
Responder 4 (role or team): Please select:
Response 4:
Responder 5 (role or team): Please select:
Response 5:

我想出了这段代码,但问题是这种模式曾经用于文字,但一旦我更改了文本,它就不再起作用了,任何人都有任何想法:

private static String extractActivity(String text) {
Pattern pattern = Pattern.compile("(?:\\W|\\w)*Activity:(?:\\W)*(.*)(?:\\W)*Sub-Activity:(?:\\W|\\w)*",
Pattern.DOTALL);
Matcher matcher = pattern.matcher(text);
matcher.matches();
String activities = matcher.group(1);
return activities;
}

它向我显示以下错误:

Feb 19, 2014 5:06:58 PM org.apache.catalina.core.StandardWrapperValve invoke
SEVERE: Servlet.service() for servlet [default-dispatcher] in context with path [/webmi] threw exception [Request processing failed; nested exception is java.lang.IllegalStateException: No match found] with root cause
java.lang.IllegalStateException: No match found
at java.util.regex.Matcher.group(Matcher.java:485)
at com.lloydsbanking.webmi.service.RSSReaderService.extractSubActivity(RSSReaderService.java:107)
at com.lloydsbanking.webmi.service.RSSReaderService.read(RSSReaderService.java:61)
at com.lloydsbanking.webmi.web.RssController.getFeed(RssController.java:34)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.springframework.web.method.support.InvocableHandlerMethod.invoke(InvocableHandlerMethod.java:219)
at org.springframework.web.method.support.InvocableHandlerMethod.invokeForRequest(InvocableHandlerMethod.java:132)
at org.springframework.web.servlet.mvc.method.annotation.ServletInvocableHandlerMethod.invokeAndHandle(ServletInvocableHandlerMethod.java:104)
at org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.invokeHandleMethod(RequestMappingHandlerAdapter.java:745)
at org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.handleInternal(RequestMappingHandlerAdapter.java:686)
at org.springframework.web.servlet.mvc.method.AbstractHandlerMethodAdapter.handle(AbstractHandlerMethodAdapter.java:80)
at org.springframework.web.servlet.DispatcherServlet.doDispatch(DispatcherServlet.java:925)
at org.springframework.web.servlet.DispatcherServlet.doService(DispatcherServlet.java:856)
at org.springframework.web.servlet.FrameworkServlet.processRequest(FrameworkServlet.java:920)
at org.springframework.web.servlet.FrameworkServlet.doPost(FrameworkServlet.java:827)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:647)
at org.springframework.web.servlet.FrameworkServlet.service(FrameworkServlet.java:801)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:728)
at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:305)
at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:210)
at org.tuckey.web.filters.urlrewrite.RuleChain.handleRewrite(RuleChain.java:176)
at org.tuckey.web.filters.urlrewrite.RuleChain.doRules(RuleChain.java:145)
at org.tuckey.web.filters.urlrewrite.UrlRewriter.processRequest(UrlRewriter.java:92)
at org.tuckey.web.filters.urlrewrite.UrlRewriteFilter.doFilter(UrlRewriteFilter.java:394)
at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:243)
at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:210)
at com.lloydsbanking.webmi.web.VersionNumberFilter.doFilter(VersionNumberFilter.java:50)
at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:243)
at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:210)
at com.opensymphony.sitemesh.webapp.SiteMeshFilter.obtainContent(SiteMeshFilter.java:129)
at com.opensymphony.sitemesh.webapp.SiteMeshFilter.doFilter(SiteMeshFilter.java:77)

最佳答案

这个表情有点过分了。另外,也可以对部分字符串执行find(),而不是对整个字符串执行ma​​tches()\\w 匹配单词字符,\\W 匹配非单词字符。因此 \\W|\\w 可以是 ..

Pattern pattern = Pattern.compile("\\bActivity\\:(.*)\\bSub-Activity\\:",
Pattern.DOTALL);
Matcher matcher = pattern.matcher(text);
if (matcher.find()) {
String activities = matcher.group(1);
return activities;
}
throw new IllegalStateException("No activity in: " + text);

我使用 \\b 作为单词边界,匹配/消耗 0 个字符,并检测单词边界。这也适用于文本开头的 Activity。虽然我不确定,但我已经转义了冒号 (:),但 x:{2,3} 会匹配 xxxxx .

<小时/>

正如 @Pshemo 评论的那样,您的代码原则上是正确的。数据可能未按预期到达。

改变

matcher.matches();

if (!matcher.matches)) {
throw new IllegalStateException("No activity in: " + text);
}

关于java - 查找两个特定单词之间的字符串,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/21887204/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com