python - 以函数为核的卷积-6ren

python - 以函数为核的卷积

转载作者：太空宇宙更新时间：2023-11-04 02:16:57

26

4

我需要试验卷积，其中内核不是常数，而是取决于输入的补丁(我将其称为过滤器以区分)并由函数 f 计算。

所以我需要的是:

conv2d :: 
    R^(batch x height x width x in_channel) x 
    f : 
        R ^ (filter_height x filter_width x in_channels) 
        -> R ^ (filter_height x filter_width x in_channels x out_channels)
    -> out

我不想要的是只为卷积中的每个补丁评估 f。 f 只为输入的每个补丁生成过滤器，然后将过滤器应用于补丁，这对我的应用程序至关重要。

这是否可以通过 tf 以某种方式破解，如果不能，是否有合理的方法来扩展 tf 以提供我所需的功能？我从未写过 CUDA，但我并不回避它。

最佳答案

编辑:

正如 OP 所建议的，考虑到操作的复杂性，使用 tf.map_fn 可能会变得更容易处理:

import tensorflow as tf
import numpy as np

mode_same = True  # True to make output same size as input
BATCH_SIZE = 10
HEIGHT = 100
WIDTH = 200
IN_CHANNELS = 3
FILTER_HEIGHT = 10
FILTER_WIDTH = 7
OUT_CHANNELS = 5

def make_img_filters(img):
    # Dummy filters function
    img_shape = tf.shape(img)
    img_height = img_shape[0]
    img_width = img_shape[1]
    img_ch = img_shape[2]
    filters_shape = (img_height, img_width, FILTER_HEIGHT, FILTER_WIDTH, img_ch, OUT_CHANNELS)
    return tf.zeros(filters_shape, dtype=img.dtype)

def filter_img(img, mode_same=True):
    img_filters = make_img_filters(img)
    # Shapes
    img_shape = tf.shape(img)
    img_height = img_shape[0]
    img_width = img_shape[1]
    filters_shape = tf.shape(img_filters)
    filter_height = filters_shape[2]
    filter_width = filters_shape[3]
    # Image margins to pad or crop
    margin_bottom = filter_height // 2
    margin_top = tf.maximum(filter_height - margin_bottom - 1, 0)
    margin_right = filter_width // 2
    margin_left = tf.maximum(filter_width - margin_right - 1, 0)
    # Pad or crop depending on "same" or "valid" mode
    img_pad = img
    img_filters_crop = img_filters
    if mode_same:
        img_pad = tf.pad(img, [[margin_top, margin_bottom], [margin_left, margin_right], [0, 0]])
        img_height += margin_top + margin_bottom
        img_width += margin_left + margin_right
    else:
        img_filters_crop = img_filters[margin_top:img_height - margin_bottom, margin_left:img_width - margin_right]
    # Make tensor of image patches
    # This could be replaced with tf.while_loop and tf.TensorArray
    img_extend = tf.stack([img_pad[i:(img_height - (FILTER_HEIGHT - i - 1))] for i in range(FILTER_HEIGHT)], axis=2)
    img_extend = tf.stack([img_extend[:, i:(img_width - (FILTER_WIDTH - i - 1))] for i in range(FILTER_WIDTH)], axis=3)
    # Compute "convolution" result
    img_result = tf.einsum('hwpqc,hwpqcd->hwd', img_extend, img_filters_crop)
    # Or with multiplication and reduction
    img_result = tf.reduce_sum(img_extend[..., tf.newaxis] * img_filters_crop, axis=(2, 3))
    return img_result

# Input
imgs = tf.placeholder(tf.float32, [None, HEIGHT, WIDTH, IN_CHANNELS])
filters = tf.placeholder(tf.float32, [None, HEIGHT, WIDTH, FILTER_HEIGHT, FILTER_WIDTH, IN_CHANNELS, OUT_CHANNELS])
# Compute "convolution" with mapping
result = tf.map_fn(lambda img: filter_img(img, mode_same), imgs)

# Test
with tf.Session() as sess:
    imgs_random = np.random.random((BATCH_SIZE, HEIGHT, WIDTH, IN_CHANNELS))
    filters_random = np.random.random((BATCH_SIZE, HEIGHT, WIDTH, FILTER_HEIGHT, FILTER_WIDTH, IN_CHANNELS, OUT_CHANNELS))
    value = sess.run(result, feed_dict={imgs: imgs_random, filters: filters_random})
    print(value.shape)
    # (10, 91, 194, 5) with mode_same=False, (10, 100, 200, 5) with mode_same=True

如果我对你的理解正确，这应该可以完成你想要的操作。它可能不是最有效的方法，但我不确定使用标准 TensorFlow 操作是否可以更快地完成它。

import tensorflow as tf
import numpy as np

mode_same = False  # True to make output same size as input
BATCH_SIZE = 10
HEIGHT = 100
WIDTH = 200
IN_CHANNELS = 3
FILTER_HEIGHT = 10
FILTER_WIDTH = 7
OUT_CHANNELS = 5
# Input
imgs = tf.placeholder(tf.float32, [None, HEIGHT, WIDTH, IN_CHANNELS])
filters = tf.placeholder(tf.float32, [None, HEIGHT, WIDTH, FILTER_HEIGHT, FILTER_WIDTH, IN_CHANNELS, OUT_CHANNELS])
# Shapes
imgs_shape = tf.shape(imgs)
img_height = imgs_shape[1]
img_width = imgs_shape[2]
filters_shape = tf.shape(filters)
filter_height = filters_shape[3]
filter_width = filters_shape[4]
# Image margins to pad or crop
margin_bottom = filter_height // 2
margin_top = tf.maximum(filter_height - margin_bottom - 1, 0)
margin_right = filter_width // 2
margin_left = tf.maximum(filter_width - margin_right - 1, 0)
# Pad or crop depending on "same" or "valid" mode
imgs_pad = imgs
filters_crop = filters
if mode_same:
    imgs_pad = tf.pad(imgs, [[0, 0], [margin_top, margin_bottom], [margin_left, margin_right], [0, 0]])
    img_height += margin_top + margin_bottom
    img_width += margin_left + margin_right
else:
    filters_crop = filters[:, margin_top:img_height - margin_bottom, margin_left:img_width - margin_right]
# Make tensor of image patches
# This could be replaced with tf.while_loop and tf.TensorArray
imgs_extend = tf.stack([imgs_pad[:, i:(img_height - (FILTER_HEIGHT - i - 1))] for i in range(FILTER_HEIGHT)], axis=3)
imgs_extend = tf.stack([imgs_extend[:, :, i:(img_width - (FILTER_WIDTH - i - 1))] for i in range(FILTER_WIDTH)], axis=4)
# Compute "convolution" result
result = tf.einsum('ahwpqc,ahwpqcd->ahwd', imgs_extend, filters_crop)
# Test
with tf.Session() as sess:
    imgs_random = np.random.random((BATCH_SIZE, HEIGHT, WIDTH, IN_CHANNELS))
    filters_random = np.random.random((BATCH_SIZE, HEIGHT, WIDTH, FILTER_HEIGHT, FILTER_WIDTH, IN_CHANNELS, OUT_CHANNELS))
    value = sess.run(result, feed_dict={imgs: imgs_random, filters: filters_random})
    print(value.shape)
    # (10, 91, 194, 5) with mode_same=False, (10, 100, 200, 5) with mode_same=True

关于python - 以函数为核的卷积，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/52445630/

26

4

0

文章推荐： c - 如果点在三角形内部(当点在三角形的边界上时有帮助)

文章推荐： node.js - PM2 是否自动检测模块系统的应用程序？

文章推荐： c - Xcode 错误 : "EXC_BAD_ACCESS"

文章推荐： javascript - JS 获取宽度不按预期工作

Scala:具有多个来源和异构类型的交叉(笛卡尔)积
我正在尝试构建不同(但每个同质)类型的可遍历项的多个交叉产品。所需的返回类型是元组的可遍历对象，其类型与输入可遍历对象中的类型相匹配。例如: List(1, 2, 3) cross Seq("a",
java - 求两个矩阵的 boolean 积
import java.util.Scanner; public class BooleanProduct { public static void main(String[] args) {
c++ - 数字的最大 K 积
任务 - 数字的最大 K 积时间限制:1 内存限制:64 M 给定一个整数序列 N(1 ≤ N ≤ 10 月，| A i | ≤ 2.10 9)和数量 K(1 ≤ K ≤ N)。找出乘积最大的 K
c++ - 快速单精度矩阵乘以 vector 积
考虑一个大小为 48x16 的 float 矩阵 A 和一个大小为 1x48 的 float vector b。请建议一种在常见桌面处理器 (i5/i7) 上尽可能快地计算 b×A 的方法。背景。
python - 获取元组或列表的(乘法)积？
假设我有一个 class Rectangle(object): def __init__(self, len
c++ - 矩阵 A 的 boolean 积
设 A 为 3x3 阶矩阵。判断矩阵A的 boolean 积可以组成多少个不同的矩阵。这是我想出的: #include int main() { int matri
python - 如何在 NumPy 计算中避免 Kronecker 积
背景生成随机权重列表后: sizes = [784,30,10] weights = [np.random.randn(y, x) for x, y in zip(sizes[:-1],sizes[
python - 具有单位矩阵和正则矩阵的高效 Kronecker 积 - NumPy/Python
我正在开发一个 python 项目并使用 numpy。我经常需要通过单位矩阵计算矩阵的克罗内克积。这些是我代码中的一个相当大的瓶颈，所以我想优化它们。我必须服用两种产品。第一个是: np.kron(n
c++ - Boost uBLAS 矩阵/vector 积
有人可以提供一个例子说明如何使用 uBLAS 产品来乘法吗？或者，如果有更好的 C++ 矩阵库，您可以推荐我也欢迎。这正在变成一个令人头疼的问题。这是我的代码: vector myVec(scala
javascript - 使用 Javascript 对话框获取和、积、差和商 + 显示结果
我正在尝试开发一个Javascript程序，它会提示用户输入两个整数，然后显示这两个整数的和、乘积、差和商。现在它只显示总和。我实际上不知道乘法、减法和除法命令是否正在执行。这是 jsfiddle 的
java - 如何使用 la4j 计算 3D vector 积？
如何使用 la4j 计算 vector (叉)积？ vector 乘积为接受两个 vector 并返回 vector 。但是他们有scalar product , product of all e
c++ - 使用 dgemm/dgemv 的矩阵 vector 积
在 C++ 中使用 Lapack 让我有点头疼。我发现为 fortran 定义的函数有点古怪，所以我尝试在 C++ 上创建一些函数，以便我更容易阅读正在发生的事情。无论如何，我没有让矩阵 vecto
objective-c - Apple Metal 逐元素矩阵乘法(Hadamard 积)
是否可以使用 Apple 的 Metal Performance Shaders 执行 Hadamard 产品？我看到可以使用 this 执行普通矩阵乘法，但我特别在寻找逐元素乘法，或者一种构造乘法的
c - 使用 open mp 的慢速稀疏矩阵 vector 积 (CSR)
我正在尝试使用 open mp 加速稀疏矩阵 vector 乘积，代码如下: void zAx(double * z, double * data, long * colind, long * row
c++ - 在 OpenCv 中计算 cv::Mat 的外(张量)积
有没有一种方法可以使用 cv::Mat OpenCV 中的数据结构？我检查过 the documentation并且没有内置功能。但是我在尝试将标准矩阵乘法表达式 (*) 与 cv::Mat 类型的

首页

博学

6Ren·AI

商城

python - 以函数为核的卷积