java - 从图像到字符串的手写识别-6ren

java - 从图像到字符串的手写识别

转载作者：行者123 更新时间：2023-11-30 02:52:03

我正在使用 Encog 并运行了 ocr 示例。效果很好。但是，我想传递一个图像文件(png，jpg，...)作为参数。该图像包含要识别的文本。然后，系统应该返回一个具有“相同”文本的字符串。

有人已经做过类似的事情了吗？我应该如何开始？

谢谢!

最佳答案

第 1 步:在 GUI 中创建文件输入并从用户处获取文件

JFileChooser fc;
JButton b, b1;
JTextField tf;
FileInputStream in;
Socket s;
DataOutputStream dout;
DataInputStream din;
int i;

public void actionPerformed(ActionEvent e) {
try {
    if (e.getSource() == b) {
        int x = fc.showOpenDialog(null);
        if (x == JFileChooser.APPROVE_OPTION) {
            fileToBeSent = fc.getSelectedFile();
            tf.setText(f1.getAbsolutePath());
            b1.setEnabled(true);
        } else {
            fileToBeSent = null;
            tf.setText(null;);
            b1.setEnabled(false);
        }
    }
    if (e.getSource() == b1) {
        send();
    }
} catch (Exception ex) {
}
}

 public void copy() throws IOException {
    File f1 = fc.getSelectedFile();
    tf.setText(f1.getAbsolutePath());
    in = new FileInputStream(f1.getAbsolutePath());
    while ((i = in.read()) != -1) {
        System.out.print(i);
    }
}

public void send() throws IOException {
    dout.write(i);
    dout.flush();

}

第 2 步:向下采样

  private void processNetwork() throws IOException {
    System.out.println("Downsampling images...");

    for (final ImagePair pair : this.imageList) {
        final MLData ideal = new BasicMLData(this.outputCount);
        final int idx = pair.getIdentity();
        for (int i = 0; i < this.outputCount; i++) {
            if (i == idx) {
                ideal.setData(i, 1);
            } else {
                ideal.setData(i, -1);
            }
        }

        final Image img = ImageIO.read(fc.getFile());
        final ImageMLData data = new ImageMLData(img);
        this.training.add(data, ideal);
    }

    final String strHidden1 = getArg("hidden1");
    final String strHidden2 = getArg("hidden2");

    this.training.downsample(this.downsampleHeight, this.downsampleWidth);

    final int hidden1 = Integer.parseInt(strHidden1);
    final int hidden2 = Integer.parseInt(strHidden2);

    this.network = EncogUtility.simpleFeedForward(this.training
            .getInputSize(), hidden1, hidden2,
            this.training.getIdealSize(), true);
    System.out.println("Created network: " + this.network.toString());
}

第 3 步:开始使用训练集进行训练

 private void processTrain() throws IOException {
    final String strMode = getArg("mode");
    final String strMinutes = getArg("minutes");
    final String strStrategyError = getArg("strategyerror");
    final String strStrategyCycles = getArg("strategycycles");

    System.out.println("Training Beginning... Output patterns="
            + this.outputCount);

    final double strategyError = Double.parseDouble(strStrategyError);
    final int strategyCycles = Integer.parseInt(strStrategyCycles);

    final ResilientPropagation train = new ResilientPropagation(this.network, this.training);
    train.addStrategy(new ResetStrategy(strategyError, strategyCycles));

    if (strMode.equalsIgnoreCase("gui")) {
        TrainingDialog.trainDialog(train, this.network, this.training);
    } else {
        final int minutes = Integer.parseInt(strMinutes);
        EncogUtility.trainConsole(train, this.network, this.training,
                minutes);
    }
    System.out.println("Training Stopped...");
}

第4步:将采样文件传递给神经网络

 public void processWhatIs() throws IOException {
    final String filename = getArg("image");
    final File file = new File(filename);
    final Image img = ImageIO.read(file);
    final ImageMLData input = new ImageMLData(img);
    input.downsample(this.downsample, false, this.downsampleHeight,
            this.downsampleWidth, 1, -1);
    final int winner = this.network.winner(input);
    System.out.println("What is: " + filename + ", it seems to be: "
            + this.neuron2identity.get(winner));
 }

第5步:检查结果

关于java - 从图像到字符串的手写识别，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/38340081/

文章推荐： java.io.IOException : mark/reset not supported

文章推荐： java - 以编程方式将站点添加到 Java 的异常(exception)列表

文章推荐： java - 我应该将静态工厂方法设为 final方法吗？

(手写)PCA原理及其Python实现图文详解
目录 1、背景 2、样本均值和样本方差矩阵 3、PCA 3.1 最大投影方差 3.2 最小重构距离 4、Py
android获取屏幕的长与宽实现代码(手写)
android中获取屏幕的长于宽，参考了网上有很多代码，但结果与实际不符，如我的手机是i9000，屏幕大小是480*800px，得到的结果却为320*533 结果很不靠谱，于是自己写了几行代码，亲测
c++ - 手写 Action
我写了一个 vector 类来学习 move 语义。我使用 move 构造函数来 move T(注释行)。我的问题是为什么不像在 C 中一样复制临时对象的所有字节并将临时对象的所有字节设置为零？
C++解析代码(手写)
我需要解析一种类似于 Java 的最小化版本的语言。由于效率是最重要的因素，所以我选择手写解析器而不是像 GOLD、bison 和 yacc 这样的 LRAR 解析器生成器。但是我找不到优秀的手写解
javascript - 手写 asm.js
我正在尝试向 perlin 单纯形噪声函数添加 asm.js 注释: "use strict"; // Ported from Stefan Gustavson's java implementati
手写 call、apply 及 bind 函数
之前在bind和apply以及call函数使用中详解总结过bind和apply以及call函数的使用，下面手写一下三个函数。一、首先call函数 Function.prototype.MyCall
javascript - 手写 asm.js - 如何跟踪堆中的 javascript 对象？
我正在 asm.js 中编写优先级队列和八叉树Javascript 的子集，以便从它们中挤出最后可能的性能。但是，您如何在 asm.js 函数的 heap 中存储对 Javascript 对象的引用

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

java - 从图像到字符串的手写识别