java - netbeans : Unrecoverable error while loading a tagger model 中的 StanfordCoreNLP 错误-6ren

java - netbeans : Unrecoverable error while loading a tagger model 中的 StanfordCoreNLP 错误

转载作者：塔克拉玛干更新时间：2023-11-01 22:46:09

我正在尝试使用 StanfordCoreNLP 来区分句子中的单数名词和复数名词。作为开始，我正在使用 http://nlp.stanford.edu/software/corenlp.shtml 中的代码.在 netbeans 8.0 中，我打开了一个新的 java 项目。我已经下载了 stanford-corenlp-full-2014-06-16 并将 jar 文件(包括模型 jar)添加到我的项目中:

enter image description here

代码类 SingularORPlural:

    import java.util.LinkedList;
import java.util.List;
import java.util.Properties;

import edu.stanford.nlp.ling.CoreAnnotations.LemmaAnnotation;
import edu.stanford.nlp.ling.CoreAnnotations.SentencesAnnotation;
import edu.stanford.nlp.ling.CoreAnnotations.TokensAnnotation;
import edu.stanford.nlp.ling.CoreLabel;
import edu.stanford.nlp.pipeline.Annotation;
import edu.stanford.nlp.pipeline.StanfordCoreNLP;
import edu.stanford.nlp.util.CoreMap;


/**
 *
 * @author ha
 */
public class SingularORPlural {

    protected StanfordCoreNLP pipeline;

    public SingularORPlural() {
        // Create StanfordCoreNLP object properties, with POS tagging
        // (required for lemmatization), and lemmatization
        Properties props;
        props = new Properties();
        props.put("annotators", "tokenize, ssplit, pos, lemma");

        /*
         * This is a pipeline that takes in a string and returns various analyzed linguistic forms. 
         * The String is tokenized via a tokenizer (such as PTBTokenizerAnnotator), 
         * and then other sequence model style annotation can be used to add things like lemmas, 
         * POS tags, and named entities. These are returned as a list of CoreLabels. 
         * Other analysis components build and store parse trees, dependency graphs, etc. 
         * 
         * This class is designed to apply multiple Annotators to an Annotation. 
         * The idea is that you first build up the pipeline by adding Annotators, 
         * and then you take the objects you wish to annotate and pass them in and 
         * get in return a fully annotated object.
         * 
         *  StanfordCoreNLP loads a lot of models, so you probably
         *  only want to do this once per execution
         */
        this.pipeline = new StanfordCoreNLP(props);
    }

    public List<String> lemmatize(String documentText)
    {
        List<String> lemmas = new LinkedList<String>();
        // Create an empty Annotation just with the given text
        Annotation document = new Annotation(documentText);
        // run all Annotators on this text
        this.pipeline.annotate(document);
        // Iterate over all of the sentences found
        List<CoreMap> sentences = document.get(SentencesAnnotation.class);
        for(CoreMap sentence: sentences) {
            // Iterate over all tokens in a sentence
            for (CoreLabel token: sentence.get(TokensAnnotation.class)) {
                // Retrieve and add the lemma for each word into the
                // list of lemmas
                lemmas.add(token.get(LemmaAnnotation.class));
            }
        }
        return lemmas;
    }


}

然后在主要部分:

System.out.println("Starting Stanford Lemmatizer");
        String text = "How could you be seeing into my eyes like open doors? \n";
        SingularORPlural slem = new SingularORPlural();
        System.out.println( slem.lemmatize(text) );

我收到这个错误:

    run:
Starting Stanford Lemmatizer
Adding annotator tokenize
Adding annotator ssplit

Adding annotator pos
Reading POS tagger model from edu/stanford/nlp/models/pos-tagger/english-left3words/english-left3words-distsim.tagger ... Exception in thread "main" java.lang.RuntimeException: edu.stanford.nlp.io.RuntimeIOException: Unrecoverable error while loading a tagger model
    at edu.stanford.nlp.pipeline.StanfordCoreNLP$4.create(StanfordCoreNLP.java:558)
    at edu.stanford.nlp.pipeline.AnnotatorPool.get(AnnotatorPool.java:85)
    at edu.stanford.nlp.pipeline.StanfordCoreNLP.construct(StanfordCoreNLP.java:267)
    at edu.stanford.nlp.pipeline.StanfordCoreNLP.<init>(StanfordCoreNLP.java:129)
    at edu.stanford.nlp.pipeline.StanfordCoreNLP.<init>(StanfordCoreNLP.java:125)
    at stanfordposcode.SingularORPlural.<init>(SingularORPlural.java:51)
    at stanfordposcode.StanfordPOSCode.main(StanfordPOSCode.java:74)
Caused by: edu.stanford.nlp.io.RuntimeIOException: Unrecoverable error while loading a tagger model
    at edu.stanford.nlp.tagger.maxent.MaxentTagger.readModelAndInit(MaxentTagger.java:857)
    at edu.stanford.nlp.tagger.maxent.MaxentTagger.readModelAndInit(MaxentTagger.java:755)
    at edu.stanford.nlp.tagger.maxent.MaxentTagger.<init>(MaxentTagger.java:289)
    at edu.stanford.nlp.tagger.maxent.MaxentTagger.<init>(MaxentTagger.java:253)
    at edu.stanford.nlp.pipeline.POSTaggerAnnotator.loadModel(POSTaggerAnnotator.java:97)
    at edu.stanford.nlp.pipeline.POSTaggerAnnotator.<init>(POSTaggerAnnotator.java:77)
    at edu.stanford.nlp.pipeline.StanfordCoreNLP$4.create(StanfordCoreNLP.java:556)
    ... 6 more
Caused by: java.io.InvalidClassException: edu.stanford.nlp.tagger.maxent.ExtractorDistsim; local class incompatible: stream classdesc serialVersionUID = 2, local class serialVersionUID = 1
    at java.io.ObjectStreamClass.initNonProxy(ObjectStreamClass.java:621)   at java.io.ObjectStreamClass.initNonProxy(  at java.io.ObjectStreamClass.initNonProxy(ObjectStreamClass.java:621)
ObjectStreamClass.java:621)
ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1623)
    at java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1518)
    at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1774)
    at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
    at java.io.ObjectInputStream.readArray(ObjectInputStream.java:1707)
    at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1345)
    at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:1993)
    at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1918)
    at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
    at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
    at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
    at edu.stanford.nlp.tagger.maxent.MaxentTagger.readExtractors(MaxentTagger.java:582)
    at edu.stanford.nlp.tagger.maxent.MaxentTagger.readModelAndInit(MaxentTagger.java:808)
    ... 12 more
Java Result: 1
BUILD SUCCESSFUL (total time: 3 seconds)

我该如何解决这个错误。

最佳答案

我刚刚有同样的错误，所以

失败的原因是您使用的旧标记器文件(“english-left3words-distsim.tagger”)与 StanfordCoreNLP 的 src/binary/byte 代码的新版本不兼容。一切都应该一致/兼容 - 来自同一个盒子/构建。

简单的答案是:确保您使用正确的标记器文件。

这些简单的步骤将有所帮助:

下载:http://nlp.stanford.edu/software/stanford-corenlp-full-2014-06-16.zip
将此添加到您的 pom.xml(如果您使用 maven)

<dependencies>
<dependency>
    <groupId>edu.stanford.nlp</groupId>
    <artifactId>stanford-corenlp</artifactId>
    <version>3.4</version>
</dependency>
<dependency>
    <groupId>edu.stanford.nlp</groupId>
    <artifactId>stanford-corenlp</artifactId>
    <version>3.4</version>
    <classifier>models</classifier>
</dependency>
</dependencies>

然后确保它可以正常工作:

public class TagText {
    public static void main(String[] args) throws IOException,
            ClassNotFoundException {

        // Initialize the tagger
        final MaxentTagger tagger = new MaxentTagger("taggers/english-left3words-distsim.tagger");

        // The sample string
        final String sample1 = "This is a sample text.";
        final String sample2 = "The sailor dogs the hatch.";

        // The tagged string
        final String tagged1 = tagger.tagString(sample1);
        final String tagged2 = tagger.tagString(sample2);

        // Output the result
        System.out.println(tagged1);
        System.out.println(tagged2);
    }
}

关于java - netbeans : Unrecoverable error while loading a tagger model 中的 StanfordCoreNLP 错误，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/24265004/

文章推荐： java - 使用多个关键字作为分隔符进行拆分

文章推荐： java - 无限循环内的同步块(synchronized block)

文章推荐： Java Simple EchoServer 无法工作

Sonarqube升级到6.7 : Unrecoverable indexation failures
目前我们正处于从 v4.5.7 到 v6.7.1 的升级过程中。 SonarQube已成功升级到v5.6.7(包括数据库升级)，目前正在更新到v6.7.1。新的 SonarQube 版本在空数据库
pdf - 速写: “Unrecoverable error: undefinedfilename in setpagedevice”
我正在尝试使用ghostscript压缩pdf文件，如下所示: gs -sDEVICE=pdfwrite -dPDFSETTINGS=/ebook -dCompatibilityLevel=1.4 -
sonarqube 6.6 升级 : Unrecoverable indexation failures
sonarqube 从 6.5(带 Debian 软件包)升级到 6.6 后，sonar 无法启动。升级前我没有更新插件。我只是让标准的 Debian 软件包升级。我正在使用外部数据库 (Postgr
visual-studio-2008 - MSI设置项目中的“Unrecoverable build error”
前一段时间，我在构建任何Visual Studio部署项目时收到此错误。 “无法恢复的构建错误” 我以为我的VS安装已损坏或删除了一些重要文件，但是... 最佳答案 ...我要做的就是关闭 Visu
drupal - Drush状态: unrecoverable error caused by drupal_goto() in hook_init()
我有一个带有 drupal_goto() 的模块，如果用户没有通过 hook_init() 中的年龄要求( Wine 网站)，它会重定向用户。该模块设置了较重的重量，因此它最后加载。在 PHP 5.
java - 斯坦福神经网络依赖解析器 : Unrecoverable error while loading a tagger model
我正在尝试测试 new Stanford Dependency parser它与神经网络一起使用。我正在尝试运行 zip 文件中包含的演示。文件 ParserDemo.java 和 ParserDem
scala - 如何修复 "error: encountered unrecoverable cycle resolving import"？
如何解决以下编译错误？ SOApp.scala:7: error: encountered unrecoverable cycle resolving import. Note: this is of
java - 斯坦福 corenlp : Unrecoverable error while loading a tagger model
我需要使用Java API 来使用StanfordTruecaser。我有这个代码: String text = "i love paris. i am with barack obama";
android - 为什么我收到错误 'Channel is unrecoverably broken and will be disposed!'
当我尝试启动 AndEngine Activity 时，我收到以下错误: ERROR/InputDispatcher(21374): channel '4122e148 my.package.AcGa
Drupal Drush : Drush command terminated abnormally due to an unrecoverable error
当我尝试从命令行运行 Drush 时，出现以下错误: Drush command terminated abnormally due to an unrecoverable error 我不确定为什么
iOS 应用程序将不再在 iOS 12 : Unrecoverable CT signature issue 下的设备上运行
我有一个应用程序在设备上启动 iOS 12 时立即崩溃。控制台显示以下错误: kernel AMFI: 'AppName' does not pass CT evaluation, result:
java - HttpPost : InputDispatcher: "Channel is unrecoverably broken and will be disposed!" on Nexus 7
在 Nexus 7 (4.3) 上，而不是在我的旧设备 LG Optimus 3d (Android 2.2) 上，当我执行 HttpPost 时，我得到了这个 E/InputDispatcher:
java - netbeans : Unrecoverable error while loading a tagger model 中的 StanfordCoreNLP 错误
我正在尝试使用 StanfordCoreNLP 来区分句子中的单数名词和复数名词。作为开始，我正在使用 http://nlp.stanford.edu/software/corenlp.shtml 中
php - imagecreatefromjpeg(): gd-jpeg: JPEG library reports unrecoverable error - Laravel
我有以下 2 条路线:- Route::get('resize/avatar', function() { $image = 'avatar.jpg'; $target_filenam
virtual-machine - VMware Workstation 遇到 "unrecoverable error: (svga) Unexpected signal: 11."我该怎么办？
使用 VMware 工作站已有一段时间了。然而，前几天我在尝试启动我的 Windows7 虚拟机时收到了这条消息: VMware Workstation unrecoverable error: (s
php - 警告 : imagejpeg() [function:imagejpeg]: gd-jpeg: JPEG library reports unrecoverable error
我在我的根文件夹中安装了 wordpress，直到昨天它工作正常，但今天它给出了以下错误，因为我猜想生成缩略图， Warning: imagejpeg() [function:imagejpeg]:

塔克拉玛干

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

java - netbeans : Unrecoverable error while loading a tagger model 中的 StanfordCoreNLP 错误