python - facenet 识别不正确-6ren

python - facenet 识别不正确

转载作者：太空宇宙更新时间：2023-11-03 21:50:17

我一直在研究人脸识别考勤管理系统。我从头开始构建管道，但最后，脚本在一组 10 个类中识别出错误的面孔。我使用 Tensorflow 和 Python 实现了以下管道。

捕获图像，调整大小，使用 dlib 的形状预测器对齐它们，并将它们存储在命名文件夹中，以便日后在执行识别时进行比较。
将图像腌制到 data.pickle 文件中以供以后反序列化。
使用 OpenCV 实现 MTCNN 算法检测网络摄像头捕获的帧中的人脸
将这些帧传递到 facenet 网络以创建 128 维嵌入，并相应地与 pickle 数据库中存在的嵌入进行比较。

下面是运行第 3 步和第 4 步的主文件:

from keras import backend as K
import time
from multiprocessing.dummy import Pool
K.set_image_data_format('channels_first')
import cv2
import os
import glob
import numpy as np
from numpy import genfromtxt
import tensorflow as tf
from keras.models import load_model
from fr_utils import *
from inception_blocks_v2 import *
from mtcnn.mtcnn import MTCNN
import dlib
from imutils import face_utils
import imutils
import pickle
from sklearn.neighbors import KNeighborsClassifier
from sklearn.model_selection import train_test_split


FRmodel = load_model('face-rec_Google.h5')
# detector = dlib.get_frontal_face_detector()
detector = MTCNN()
# FRmodel = faceRecoModel(input_shape=(3, 96, 96))
#
# # detector = dlib.get_frontal_face_detector()
# # predictor = dlib.shape_predictor("shape_predictor_68_face_landmarks.dat")
# def triplet_loss(y_true, y_pred, alpha = 0.3):
#     """
#     Implementation of the triplet loss as defined by formula (3)
#
#     Arguments:
#     y_pred -- python list containing three objects:
#             anchor -- the encodings for the anchor images, of shape (None, 128)
#             positive -- the encodings for the positive images, of shape (None, 128)
#             negative -- the encodings for the negative images, of shape (None, 128)
#
#     Returns:
#     loss -- real number, value of the loss
#     """
#
#     anchor, positive, negative = y_pred[0], y_pred[1], y_pred[2]
#
#     pos_dist = tf.reduce_sum(tf.square(tf.subtract(anchor, positive)), axis=-1)
#     neg_dist = tf.reduce_sum(tf.square(tf.subtract(anchor, negative)), axis=-1)
#     basic_loss = tf.add(tf.subtract(pos_dist, neg_dist), alpha)
#     loss = tf.reduce_sum(tf.maximum(basic_loss, 0.0))
#
#     return loss
#
# FRmodel.compile(optimizer = 'adam', loss = triplet_loss, metrics = ['accuracy'])
# load_weights_from_FaceNet(FRmodel)
def ret_model():
    return FRmodel

def prepare_database():
    pickle_in = open("data.pickle","rb")
    database =  pickle.load(pickle_in)
    return database

def unpickle_something(pickle_file):
    pickle_in = open(pickle_file,"rb")
    unpickled_file =  pickle.load(pickle_in)
    return unpickled_file


def webcam_face_recognizer(database):

    cv2.namedWindow("preview")
    vc = cv2.VideoCapture(0)

    while vc.isOpened():
        ret, frame = vc.read()
        img_rgb = cv2.cvtColor(frame,cv2.COLOR_BGR2RGB)
        img = frame
        # We do not want to detect a new identity while the program is in the process of identifying another person
        img = process_frame(img,img)

        cv2.imshow("Preview", img)
        cv2.waitKey(1)

    vc.release()

def process_frame(img, frame):
    """
    Determine whether the current frame contains the faces of people from our database
    """
    # rects = detector(img)
    rects = detector.detect_faces(img)
    # Loop through all the faces detected and determine whether or not they are in the database
    identities = []
    for (i,rect) in enumerate(rects):
        (x,y,w,h) = rect['box'][0],rect['box'][1],rect['box'][2],rect['box'][3]
        img = cv2.rectangle(frame,(x, y),(x+w, y+h),(255,0,0),2)

        identity = find_identity(frame, x-50, y-50, x+w+50, y+h+50)
        cv2.putText(img, identity,(10,500), cv2.FONT_HERSHEY_SIMPLEX , 4,(255,255,255),2,cv2.LINE_AA)

        if identity is not None:
            identities.append(identity)

    if identities != []:
        cv2.imwrite('example.png',img)

    return img

def find_identity(frame, x,y,w,h):
    """
    Determine whether the face contained within the bounding box exists in our database

    x1,y1_____________
    |                 |
    |                 |
    |_________________x2,y2

    """
    height, width, channels = frame.shape
    # The padding is necessary since the OpenCV face detector creates the bounding box around the face and not the head
    part_image = frame[y:y+h, x:x+w]

    return who_is_it(part_image, database, FRmodel)

def who_is_it(image, database, model):

    encoding = img_to_encoding(image, model)


    min_dist = 100
    # Loop over the database dictionary's names and encodings.
    for (name, db_enc) in database.items():

        # Compute L2 distance between the target "encoding" and the current "emb" from the database.
        dist = np.linalg.norm(db_enc.flatten() - encoding.flatten())

        print('distance for %s is %s' %(name, dist))

        # If this distance is less than the min_dist, then set min_dist to dist, and identity to name
        if dist < min_dist:
            min_dist = dist
            identity = name

        if min_dist >0.1:
            print('Unknown person')
        else:
            print(identity)
    return identity


if __name__ == "__main__":
    database = prepare_database()
    webcam_face_recognizer(database)

我在这里做错了什么？这里的FRmodel是facenet训练好的模型

最佳答案

几点:

我没有看到输入网络的输入人脸图像的大小调整、对齐和美白。
您不能将 50 的固定边距添加到可变大小的面部。必须进行缩放，使人脸区域几乎填满每个输入图像中的相同区域。
我不确定您使用的型号，但如果您使用的是 FaceNet ，你接受的匹配阈值 0.1 似乎很低。它不会接受任何匹配项，除非它是完全相同的图像(距离为 0.0)，或者与图库图像的差异非常小。

关于python - facenet 识别不正确，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/52963149/

文章推荐： c# - ASP.NET 错误处理问题

文章推荐： php - 输入不显示完整值

人脸检测之Retina FaceNet
retinaface 人脸检测算法甜点最近一直了解人脸检测的算法，所以也尝试学多人脸检测框架。所以这里将拿出来和大家分享一下 Retinaface 与普通的目标检测算法类似，在
python - FaceNet 嵌入的余弦距离问题
我已经在论坛上询问过这个问题，但这似乎足够小众，有自己的问题我从 here 在线获取了带有余弦距离的片段。但输出似乎不正确... 这是我的代码(注意:我从 np.matmul 更改为 np.dot
python - facenet 识别不正确
我一直在研究人脸识别考勤管理系统。我从头开始构建管道，但最后，脚本在一组 10 个类中识别出错误的面孔。我使用 Tensorflow 和 Python 实现了以下管道。捕获图像，调整大小，使用 dl
machine-learning - FaceNet 傻瓜版
FaceNet 算法(在 this 文章中进行了描述)使用卷积神经网络来表示 128 维欧几里得空间中的图像。阅读文章时我不明白: 损失函数对卷积网络有何影响(在普通网络中，为了最小化损失，权重略有
python - 使用 FaceNet 创建三元组损失模型时遇到问题
我正在尝试使用 FacenetModel 实现三元组损失模型。我使用了 coursera 作业中提供的 Facenet 实现。每当我编译模型时，我都会收到此错误: ValueError: No da
python - 将 FaceNet 模型转换为 ONNX 格式时出错
系统信息操作系统平台和发行版:Linux Ubuntu 19.10 Tensorflow 版本:1.15 Python 版本:3.7 问题我从这个 page 下载了一个 FaceNet 的 ten
c++ - 我如何使用 Tensorflow c++ 来实现此代码 (FaceNet)？
FaceNet .大卫·桑德伯格 FaceNet使用python实现代码: #load graph with tf.gfile.GFile(frozen_graph_filename, "rb") a
neural-network - 使用 keras 进行 facenet 三元组损失
我正在尝试使用 Tensorflow 后端在 Keras 中实现 facenet，但我在三元组丢失方面遇到了一些问题。我用 3*n 个图像调用 fit 函数，然后我定义我的自定义损失函数如下: de
python - 将 Facenet 模型 .pb 文件转换为 TFLITE 格式时出错
我正在尝试基于我从 David Sandbergs Github 获得的 Inception ResNet 转换预训练的卡住 .pb使用以下命令在 Ubuntu 上使用 Tensorflow Lite

太空宇宙

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - facenet 识别不正确