python - OpenCV Python FAR/FRR中的人脸识别-6ren

python - OpenCV Python FAR/FRR中的人脸识别

转载作者：行者123 更新时间：2023-12-01 06:58:17

如何在OpenCV Python中进行性能测试以进行检查；

获得识别结果所需的时间

对数据库测试用例的错误接受/错误拒绝率。

我正在OpenCV中使用示例eigenface方法(来自Phillip- https://github.com/bytefish/facerecognition_guide)，并且仅对结果感兴趣。如果有人可以指出正确的方向/显示示例，那就太好了。也许我可以利用某些功能吗？

最佳答案

验证OpenCV算法

介绍

首先，很抱歉花了这么长时间答复，但根本没有剩余时间。实际上，验证算法是一个非常有趣的话题，这并不难。在这篇文章中，我将展示如何验证您的算法(我将使用FaceRecognizer，因为您已经提出了要求)。像往常一样，我将用完整的源代码示例来演示它，因为我认为用代码解释内容要容易得多。

因此，每当有人告诉我“我的算法执行不佳”时，我都会问他们:

实际上有什么不好？

您是否通过查看一个样本对此进行了评分？

您的图像数据是什么？

您如何在训练和测试数据之间划分？

您的度量标准是什么？

[...]

我的希望是，这篇文章将消除一些困惑，并显示验证算法有多么容易。因为我从实验计算机视觉和机器学习算法中学到的东西是:

如果没有适当的验证，那就是追逐幽灵。您确实非常需要谈论数字。

这篇文章中的所有代码均已获得BSD许可，请随时在您的项目中使用它。

验证算法

任何计算机视觉项目中最重要的任务之一就是获取图像数据。您需要获得与生产中期望的图像数据相同的图像数据，因此上线时不会有任何不良体验。一个非常实际的示例:如果您想在野外识别人脸，那么对在非常受控的情况下拍摄的图像上的算法进行验证就没有用。获取尽可能多的数据，因为数据为王。那就是数据。

一旦获得一些数据并编写了算法，就需要对其进行评估。有几种验证策略，但是我认为您应该从简单的交叉验证开始，然后再继续，有关交叉验证的信息，请参见:

Wikipedia on Cross-Validation

除了使用我们自己的全部实现之外，我们将使用 scikit-learn这个伟大的开源项目:

https://github.com/scikit-learn/

它具有用于验证算法的非常好的文档和教程:

http://scikit-learn.org/stable/tutorial/statistical_inference/index.html

因此，该计划如下:

编写函数以读取一些图像数据。

将cv2.FaceRecognizer包装到scikit-learn估计器中。

使用给定的验证和指标估算cv2.FaceRecognizer的性能。

利润!

正确获取图像数据

首先，我想在要读取的图像数据上写一些字，因为关于此的问题几乎总是会出现。为简单起见，我在示例中假设图像(文件夹，要识别的人)在文件夹中给出。每人一个文件夹。因此，假设我有一个名为 images的文件夹(数据集)，其子文件夹为 person1， person2等:

philipp@mango:~/facerec/data/images$ tree -L 2 | head -n 20
.
|-- person1
|   |-- 1.jpg
|   |-- 2.jpg
|   |-- 3.jpg
|   |-- 4.jpg
|-- person2
|   |-- 1.jpg
|   |-- 2.jpg
|   |-- 3.jpg
|   |-- 4.jpg

[...]

这样的文件夹结构中已经存在的一个公共(public)可用数据集是AT＆T Facedatabase，可在以下位置找到:

http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html

解压缩后，它看起来将是这样(在我的文件系统上，它解压缩为 /home/philipp/facerec/data/at/，您的路径有所不同!):

philipp@mango:~/facerec/data/at$ tree .
.
|-- README
|-- s1
|   |-- 1.pgm
|   |-- 2.pgm
[...]
|   `-- 10.pgm
|-- s2
|   |-- 1.pgm
|   |-- 2.pgm
[...]
|   `-- 10.pgm
|-- s3
|   |-- 1.pgm
|   |-- 2.pgm
[...]
|   `-- 10.pgm

...

40 directories, 401 files

把它放在一起

因此，首先我们将定义一种 read_images方法来读取图像数据和标签:

import os
import sys
import cv2
import numpy as np

def read_images(path, sz=None):
    """Reads the images in a given folder, resizes images on the fly if size is given.

    Args:
        path: Path to a folder with subfolders representing the subjects (persons).
        sz: A tuple with the size Resizes 

    Returns:
        A list [X,y]

            X: The images, which is a Python list of numpy arrays.
            y: The corresponding labels (the unique number of the subject, person) in a Python list.
    """
    c = 0
    X,y = [], []
    for dirname, dirnames, filenames in os.walk(path):
        for subdirname in dirnames:
            subject_path = os.path.join(dirname, subdirname)
            for filename in os.listdir(subject_path):
                try:
                    im = cv2.imread(os.path.join(subject_path, filename), cv2.IMREAD_GRAYSCALE)
                    # resize to given size (if given)
                    if (sz is not None):
                        im = cv2.resize(im, sz)
                    X.append(np.asarray(im, dtype=np.uint8))
                    y.append(c)
                except IOError, (errno, strerror):
                    print "I/O error({0}): {1}".format(errno, strerror)
                except:
                    print "Unexpected error:", sys.exc_info()[0]
                    raise
            c = c+1
    return [X,y]

然后，读取图像数据变得像调用一样容易:

[X,y] = read_images("/path/to/some/folder")

因为某些算法(例如Eigenfaces，Fisherfaces)要求图像大小相等，所以我添加了第二个参数 sz。通过传递元组 sz，可以调整所有图像的大小。因此，以下调用会将 /path/to/some/folder中所有图像的大小调整为 100x100像素。:

[X,y] = read_images("/path/to/some/folder", (100,100))

scikit-learn中的所有分类器均来自 BaseEstimator，该类应该具有 fit和 predict方法。 fit方法获取示例 X和相应标签 y的列表，因此映射到 cv2.FaceRecognizer的train方法很简单。 predict方法还获得了一个样本列表和相应的标签，但是这次我们需要返回每个样本的预测:

from sklearn.base import BaseEstimator

class FaceRecognizerModel(BaseEstimator):

    def __init__(self):
        self.model = cv2.createEigenFaceRecognizer()

    def fit(self, X, y):
        self.model.train(X,y)

    def predict(self, T):
        return [self.model.predict(T[i]) for i in range(0, T.shape[0])]

然后，您可以在各种验证方法和指标之间进行选择，以测试 cv2.FaceRecognizer。您可以在 sklearn.cross_validation中找到可用的交叉验证算法:

留一法交叉验证

K-folds交叉验证

分层K折交叉验证

离开一标签输出交叉验证

具有替代交叉验证的随机采样

[...]

为了估计 cv2.FaceRecognizer的识别率，我建议使用分层交叉验证。您可能会问为什么有人需要其他交叉验证方法。想象一下，您想使用算法执行情感识别。如果您的训练集上有与您测试算法的人的影像，将会发生什么？您可能会找到与人最接近的匹配，但找不到情感。在这些情况下，您应该执行与主题无关的交叉验证。

使用scikit-learn，创建分层k折交叉验证迭代器非常简单:

from sklearn import cross_validation as cval
# Then we create a 10-fold cross validation iterator:
cv = cval.StratifiedKFold(y, 10)

我们可以选择多种指标。现在，我只想知道模型的精度，因此我们导入可调用函数 sklearn.metrics.precision_score:

from sklearn.metrics import precision_score

现在，我们只需要创建估算器，并将 estimator， X， y， precision_score和 cv传递给 sklearn.cross_validation.cross_val_score，即可为我们计算交叉验证得分:

# Now we'll create a classifier, note we wrap it up in the 
# FaceRecognizerModel we have defined in this file. This is 
# done, so we can use it in the awesome scikit-learn library:
estimator = FaceRecognizerModel()
# And getting the precision_scores is then as easy as writing:
precision_scores = cval.cross_val_score(estimator, X, y, score_func=precision_score, cv=cv)

有大量指标可用，随时选择另一个指标:

https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/metrics/metrics.py

因此，让我们将所有这些放到脚本中!

validation.py

# Author: Philipp Wagner <bytefish@gmx.de>
# Released to public domain under terms of the BSD Simplified license.
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions are met:
#   * Redistributions of source code must retain the above copyright
#     notice, this list of conditions and the following disclaimer.
#   * Redistributions in binary form must reproduce the above copyright
#     notice, this list of conditions and the following disclaimer in the
#     documentation and/or other materials provided with the distribution.
#   * Neither the name of the organization nor the names of its contributors
#     may be used to endorse or promote products derived from this software
#     without specific prior written permission.
#
#   See <http://www.opensource.org/licenses/bsd-license>

import os
import sys
import cv2
import numpy as np

from sklearn import cross_validation as cval
from sklearn.base import BaseEstimator
from sklearn.metrics import precision_score

def read_images(path, sz=None):
    """Reads the images in a given folder, resizes images on the fly if size is given.

    Args:
        path: Path to a folder with subfolders representing the subjects (persons).
        sz: A tuple with the size Resizes 

    Returns:
        A list [X,y]

            X: The images, which is a Python list of numpy arrays.
            y: The corresponding labels (the unique number of the subject, person) in a Python list.
    """
    c = 0
    X,y = [], []
    for dirname, dirnames, filenames in os.walk(path):
        for subdirname in dirnames:
            subject_path = os.path.join(dirname, subdirname)
            for filename in os.listdir(subject_path):
                try:
                    im = cv2.imread(os.path.join(subject_path, filename), cv2.IMREAD_GRAYSCALE)
                    # resize to given size (if given)
                    if (sz is not None):
                        im = cv2.resize(im, sz)
                    X.append(np.asarray(im, dtype=np.uint8))
                    y.append(c)
                except IOError, (errno, strerror):
                    print "I/O error({0}): {1}".format(errno, strerror)
                except:
                    print "Unexpected error:", sys.exc_info()[0]
                    raise
            c = c+1
    return [X,y]

class FaceRecognizerModel(BaseEstimator):

    def __init__(self):
        self.model = cv2.createFisherFaceRecognizer()

    def fit(self, X, y):
        self.model.train(X,y)

    def predict(self, T):
        return [self.model.predict(T[i]) for i in range(0, T.shape[0])]

if __name__ == "__main__":
    # You'll need at least some images to perform the validation on:
    if len(sys.argv) < 2:
        print "USAGE: facerec_demo.py </path/to/images> [</path/to/store/images/at>]"
        sys.exit()
    # Read the images and corresponding labels into X and y.
    [X,y] = read_images(sys.argv[1])
    # Convert labels to 32bit integers. This is a workaround for 64bit machines,
    # because the labels will truncated else. This is fixed in recent OpenCV
    # revisions already, I just leave it here for people on older revisions.
    #
    # Thanks to Leo Dirac for reporting:
    y = np.asarray(y, dtype=np.int32)
    # Then we create a 10-fold cross validation iterator:
    cv = cval.StratifiedKFold(y, 10)
    # Now we'll create a classifier, note we wrap it up in the 
    # FaceRecognizerModel we have defined in this file. This is 
    # done, so we can use it in the awesome scikit-learn library:
    estimator = FaceRecognizerModel()
    # And getting the precision_scores is then as easy as writing:
    precision_scores = cval.cross_val_score(estimator, X, y, score_func=precision_score, cv=cv)
    # Let's print them:
    print precision_scores

运行脚本

上面的脚本将打印出Fisherfaces方法的精度得分。您只需要使用image文件夹调用脚本:

philipp@mango:~/src/python$ python validation.py /home/philipp/facerec/data/at

Precision Scores:
[ 1.          0.85        0.925       0.9625      1.          0.9625
  0.8875      0.93333333  0.9625      0.925     ]

结论

结论是，使用开源项目使您的生活变得非常轻松!示例脚本还有很多要增强的地方。您可能想要添加一些日志记录，以查看您所在的折页。但这是评估所需指标的开始，只需通读scikit-learn教程以了解如何进行操作并将其适应以上脚本。

我鼓励每个人都喜欢使用OpenCV Python和scikit-learn，因为如您所见，这两个很棒的项目的接口(interface)非常非常容易。

关于python - OpenCV Python FAR/FRR中的人脸识别，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/12197383/

文章推荐： java - 用作输出参数的自定义数字类

文章推荐： jquery - 删除文本中的动态单词

文章推荐： jquery - 如何使用jquery禁用具有相似ID的按钮

windows - gcc 可被 cmd 识别，但不能被 bash 识别
我使用的是linux的windows子系统，安装了ubuntu，bash运行流畅。我正在尝试使用make，似乎bash 无法识别gcc。尝试将其添加到 PATH，但没有任何改变。奇怪的是 - cmd
installation - Imagick 被 WAMPServer 识别，但不被 PHP 识别
ImageMagick 已正确安装。 WAMP 的“PHP 扩展”菜单也显示带有勾选的 php_imagick。除了 Apache 和系统环境变量外，phpinfo() 没有显示任何 imagick
deterministic - 如果一种语言 (L) 被 n 状态 NFA 识别，它是否也能被状态不超过 2^n 的 DFA 识别？
我是这么想的，因为上限是 2^n，并且考虑到它们都是有限机，n 状态 NFA 和具有 2^n 或更少状态的 DFA 的交集将是有效。我错了吗？最佳答案你是对的。 2^n 是一个上限，因此生成的
r - 识别/描述向量中具有特定值的连续几天的序列
我有一个大型数据集，其中包含每日值，指示一年中的特定一天是否特别热(用 1 或 0 表示)。我的目标是识别 3 个或更多特别炎热的日子的序列，并创建一个包含每个日子的长度以及开始和结束日期的新数据集。
识别 R 向量中的特定元素顺序模式
我有一个向量列表，每个向量看起来像这样 c("Japan", "USA", "country", "Japan", "source", "country", "UK", "source", "coun
c - 识别/防止静态缓冲区溢出的工具和方法
是否有任何工具或方法可以识别静态定义数组中的缓冲区溢出(即 char[1234] 而不是 malloc(1234))？昨天我花了大部分时间来追踪崩溃和奇怪的行为，最终证明是由以下行引起的: // e
python - 手动创建的snakemake通配符未使用/识别
我一直在尝试通过导入制表符分隔的文件来手动创建 Snakemake 通配符，如下所示: dataset sample species frr PRJNA493818_GSE120639_SRP1628
python - 手动创建的snakemake通配符未使用/识别
我一直在尝试通过导入制表符分隔的文件来手动创建 Snakemake 通配符，如下所示: dataset sample species frr PRJNA493818_GSE120639_SRP1628
c# - 人声识别/识别
我想录下某人的声音，然后根据我获得的关于他/她声音的信息，如果那个人再次说话，我就能认出来!问题是我没有关于哪些统计数据(如频率)导致人声差异的信息，如果有人可以帮助我如何识别某人的声音？在研究过程
c++ - 识别 “Enter”
我希望我的程序能够识别用户何时按下“enter”并继续循环播放。但是我不知道如何使程序识别“输入”。尝试了两种方法: string enter; string ent = "\n"; dice d1;
识别 Bash 脚本中文件扩展名的正则表达式模式对于捕获压缩文件不准确
我创建了这个带有一个参数(文件名)的 Bash 小脚本，该脚本应该根据文件的扩展名做出响应: #!/bin/bash fileFormat=${1} if [[ ${fileFormat} =~ [F
ios - 识别 subview
我正在寻找一种在 for 循环内迭代时识别 subview 对象的方法，我基本上通过执行 cell.contentView.subviews 从 UITableView 的 contentView 获
Swift CallKit 识别
我正在尝试在 Swift 中使用 CallKit 来识别调用者。我正在寻找一种通过发出 URL 请求来识别调用者的方法。例如:+1-234-45-241 给我打电话，我希望它向 mydomain.
javascript - 厚盒插件 - 识别
我将(相当古老的)插件称为“thickbox”，如下所述: 创建厚盒时，它包含基于查询的内容列表。使用 JavaScript 或 jQuery，我希望能够访问 type 的值(在上面的示例中 t
c++ - 识别/生成波形？
我想编写一些可以接受某种输入并将其识别为方波、三角波或某种波形的代码。我还需要一些产生所述波的方法。我确实有使用 C/C++ 的经验，但是，我不确定我将如何模拟所有这些。最终，我想将其转换为微 Co
C# 识别 for 循环中的项目
我创建了一个 for 循环，用于在每个部分显示 8 个项目，但我试图在循环中识别某些项目。例如，我想识别前两项，然后是第五项和第六项，但我的识别技术似乎是正确的。 for (int i = 0; i
ios - 识别 UIStoryboard
如何识别 UIStoryboard？该类具有创建和实例化的方法，但我没有看到带有类似name 的@property。例如获取 Storyboard对象 + storyboardWithName:b
识别 MSSQL 各个版本的版本号的方法
如何确定所运行的SQLServer2005的版本要确定所运行的SQLServer2005的版本，请使用SQLServerManagementStudio连接到SQLServer2005，然后运行
javascript - 识别 Javascript 中的函数名称或属性
这个问题在这里已经有了答案: How to check whether an object is a date? (26 个答案) 关闭2 年前。我正在使用一个 npm 模块，它在错误时抛出一个空
android - 后台 Activity 识别
我正在制作一个使用 ActivityRecognition API 在后台跟踪用户 Activity 的应用，如果用户在指定时间段(例如 1 小时)内停留在同一个地方，系统就会推送通知告诉用户去散步.

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - OpenCV Python FAR/FRR中的人脸识别