python - 将图像切成重叠的补丁并将补丁合并到图像的快速方法-6ren

python - 将图像切成重叠的补丁并将补丁合并到图像的快速方法

转载作者：IT老高更新时间：2023-10-28 21:02:54

39

4

尝试将大小为 100x100 的灰度图像分割成大小为 39x39 的重叠 block ，步幅大小为 1。这意味着下一个从右侧/或下方开始一个像素的 block 仅与另一列/或行中的上一个补丁。

代码粗略概述:首先计算每个补丁的索引，以便能够从图像构造二维 block 数组，并能够从 block 构造图像:

patches = imgFlat[ind]

'patches' 是一个二维数组，每列包含一个向量形式的补丁。

处理这些补丁，每个补丁单独并随后再次合并到图像中，并使用预先计算的索引。

img = np.sum(patchesWithColFlat[ind],axis=2)

由于补丁重叠，最后需要将 img 与预先计算的权重相乘:

imgOut = weights*imgOut

我的代码真的很慢，速度是一个关键问题，因为这应该在 ca. 10^8 个补丁。

函数 get_indices_for_un_patchify 和 weights_unpatchify 可以预先计算一次，因此速度只是 patchify 和 unpatchify 的问题。

感谢任何提示。

卡洛斯

import numpy as np
import scipy
import collections
import random as rand


def get_indices_for_un_patchify(sImg,sP,step):
    ''' creates indices for fast patchifying and unpatchifying

    INPUTS:
      sx    image size
      sp    patch size
      step  offset between two patches (default == [1,1])

      OUTPUTS:
       patchInd             collection with indices
       patchInd.img2patch   patchifying indices
                            patch = img(patchInd.img2patch);
       patchInd.patch2img   unpatchifying indices

    NOTE: * for unpatchifying necessary to add a 0 column to the patch matrix
          * matrices are constructed row by row, as normally there are less rows than columns in the 
            patchMtx
     '''
    lImg = np.prod(sImg)
    indImg = np.reshape(range(lImg), sImg)

    # no. of patches which fit into the image
    sB = (sImg - sP + step) / step

    lb              = np.prod(sB)
    lp              = np.prod(sP)
    indImg2Patch    = np.zeros([lp, lb])
    indPatch        = np.reshape(range(lp*lb), [lp, lb])

    indPatch2Img = np.ones([sImg[0],sImg[1],lp])*(lp*lb+1)

    # default value should be last column
    iRow   = 0;
    for jCol in range(sP[1]):
        for jRow in range(sP[0]):
            tmp1 = np.array(range(0, sImg[0]-sP[0]+1, step[0]))
            tmp2 = np.array(range(0, sImg[1]-sP[1]+1, step[1]))
            sel1                    = jRow  + tmp1
            sel2                    = jCol  + tmp2
            tmpIndImg2Patch = indImg[sel1,:]          
            # do not know how to combine following 2 lines in python
            tmpIndImg2Patch = tmpIndImg2Patch[:,sel2]
            indImg2Patch[iRow, :]   = tmpIndImg2Patch.flatten()

            # next line not nice, but do not know how to implement it better
            indPatch2Img[min(sel1):max(sel1)+1, min(sel2):max(sel2)+1, iRow] = np.reshape(indPatch[iRow, :, np.newaxis], sB)
            iRow                    += 1

    pInd = collections.namedtuple
    pInd.patch2img = indPatch2Img
    pInd.img2patch = indImg2Patch

    return pInd

def weights_unpatchify(sImg,pInd):
    weights = 1./unpatchify(patchify(np.ones(sImg), pInd), pInd)
    return weights

# @profile
def patchify(img,pInd):
    imgFlat = img.flat
   # imgFlat = img.flatten()
    ind = pInd.img2patch.tolist()
    patches = imgFlat[ind]

    return patches

# @profile
def unpatchify(patches,pInd):
    # add a row of zeros to the patches matrix    
    h,w = patches.shape
    patchesWithCol = np.zeros([h+1,w])
    patchesWithCol[:-1,:] = patches
    patchesWithColFlat = patchesWithCol.flat
   #  patchesWithColFlat = patchesWithCol.flatten()
    ind = pInd.patch2img.tolist()
    img = np.sum(patchesWithColFlat[ind],axis=2)
    return img

我在这里调用这些函数，例如随机图片

if __name__ =='__main__':
    img = np.random.randint(255,size=[100,100])
    sImg = img.shape
    sP = np.array([39,39])  # size of patch
    step = np.array([1,1])  # sliding window step size
    pInd = get_indices_for_un_patchify(sImg,sP,step)
    patches = patchify(img,pInd)
    imgOut = unpatchify(patches,pInd)
    weights = weights_unpatchify(sImg,pInd)
    imgOut = weights*imgOut

    print 'Difference of img and imgOut = %.7f' %sum(img.flatten() - imgOut.flatten())

最佳答案

“修补”数组的一种有效方法，即获取原始数组的窗口数组是使用自定义 strides 创建一个 View 。 , 跳转到下一个元素的字节数。将 numpy 数组视为(美化的)内存块可能会有所帮助，然后 strides 是将索引映射到内存地址的一种方式。

例如，在

a = np.arange(10).reshape(2, 5)

a.itemsize 等于 4(即每个元素 4 个字节或 32 位)并且 a.strides 是 (20, 4) (5 个元素，1 个元素)，因此 a[1,2] 指的是 1*20 + 2*4 字节(或 1 *5 + 2 个元素)在第一个元素之后:

0 1 2 3 4
5 6 7 x x

其实元素是一个接一个的放到内存中的，0 1 2 3 4 5 6 7 x x但是strides让我们把它索引为一个二维数组。

基于这个概念，我们可以重写 patchify 如下

def patchify(img, patch_shape):
    img = np.ascontiguousarray(img)  # won't make a copy if not needed
    X, Y = img.shape
    x, y = patch_shape
    shape = ((X-x+1), (Y-y+1), x, y) # number of patches, patch_shape
    # The right strides can be thought by:
    # 1) Thinking of `img` as a chunk of memory in C order
    # 2) Asking how many items through that chunk of memory are needed when indices
    #    i,j,k,l are incremented by one
    strides = img.itemsize*np.array([Y, 1, Y, 1])
    return np.lib.stride_tricks.as_strided(img, shape=shape, strides=strides)

这个函数返回一个img的 View ，所以没有分配内存，它只运行了几十微秒。输出的形状并不是你想要的，实际上它必须被复制才能得到那个形状。

在处理比基本数组大得多的数组 View 时必须小心，因为操作可能会触发需要分配大量内存的副本。在你的情况下，由于数组不是太大并且没有那么多补丁，应该没问题。

最后，我们可以稍微梳理一下补丁数组:

patches = patchify(img, (39,39))
contiguous_patches = np.ascontiguousarray(patches)
contiguous_patches.shape = (-1, 39**2)

这不会重现您的 patchify 函数的输出，因为您是按 Fortran 顺序开发补丁的。我建议你改用这个，因为

它会在以后导致更自然的索引(即，第一个补丁是补丁[0]，而不是您的解决方案的补丁[:, 0])。
在 numpy 中使用 C 排序也更容易，因为您需要更少的输入(避免使用 order='F' 之类的东西，数组默认按 C 顺序创建...)。

“提示”如果你坚持:strides = img.itemsize * np.array([1, Y, Y, 1])，使用 .reshape(...,在 contiguous_patches 上 order='F') 最后转置 .T

关于python - 将图像切成重叠的补丁并将补丁合并到图像的快速方法，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/16774148/

39

4

0

文章推荐： python - 如何使用python和mechanize登录网站

文章推荐： java - Apache Wicket 的缺点是什么？

文章推荐： java - 何时使用 pathParams 或 QueryParams

Python通过特定方程列出交叉识别(重叠？)
我对具有 2 个轴的数据有交叉识别问题，例如 A = array([['x0', 'y0', 'data0', 'data0'], ['x0', 'y0', 'data0', '
Haskell 重叠/不连贯的实例
我知道这是代码有点傻，但有人可以解释为什么 isList [42]返回 True而isList2 [42]打印 False ，以及如何防止这种情况？我想更好地理解一些更晦涩的 GHC 类型扩展，我认为
c - Memmove 重叠
我正在使用memmove()，但目标似乎正在覆盖源，或者也许我不明白覆盖是什么。我有一个 char 数组(目标)，然后是一个指向目标的指针，该指针位于 vector 内部。 char destinat
flash - Flash中的流音频播放多次，重叠
以下AS3代码有时会导致音频多次播放，就像疯狂的回声一样，几乎同时播放。通常使用该URL都可以，但是当我使用https://soundcloud.com url时，它总是会发疯。在极少数情况下，我认为
java - 线性布局不可见/重叠
我正在尝试在 android 2.2 中实现类似操作栏的东西。这是我的 main.xml
ios图表框架值(value)重叠
如何避免第一个值的重叠问题而且，我怎样才能看到最后一个被剪裁的值？最佳答案我认为您在修改轴上的样式和调整视口(viewport)之间有几种选择。我会尝试: 禁用左轴，启用右轴 chart.le
ios - UIScrollView 重叠
我正在构建一个简单的应用程序，您可以在其中使用纸娃娃之类的工具来描述您的外观。 Check out this image.计划是有 4 个水平 ScrollView :第一个用于发型，第二个用于面部毛
android - 重叠 ScrollView
我有一个问题...我在绝对布局中有两个 ScrollView 。换句话说，它们是全屏的并且相互重叠上面的scrollview是水平滚动的，下面的是垂直滚动的scrollview。当我水平滚动时，我
几个旋转屏幕后Android fragment 重叠
我看了一些类似的问题，但我不太明白在我的层次结构中我应该做什么？我有用于屏幕底部的标签菜单和对于其他将创建的 fragment 。我有 9 个标签菜单，每个都是 fragment 。一
Android fragment 重叠
在我的 Android 应用程序中，我有一个编辑文本和一个按钮，单击该按钮会向我的主要 Activity 添加一个 fragment ，其中包含在我的编辑文本中写入的消息。问题是，当我更改消息并单击按
ios - 分段控件的标题不适合，重叠
在我的分段控件中，有时标题比其段宽。我怎样才能让它截断？假设第 1 段的标题是 Text overlaps，第 2 段的名称是 ok。我希望它看起来如何: [Text ov...| ok
iphone - UITableViewCell 重叠
我想创建一个带有重叠单元格的 uitableview，如下图所示。问题是，即使我为单元格的内容 View 设置 clipsToBounds = NO，单元格假标题(例如，将与前一个单元格重叠的西类牙语
CSS 重叠 div
有了这个CSS .addProblemClass{ width:300px; height:300px; /*width:25%; height:40%;*/
javascript - 离开窗口选项卡时图像堆叠(重叠)
我有跨窗口移动的图像(2 行)，当我离开页面选项卡时，然后返回它，所有图像都相互堆叠。 JS代码(记入jfriend00) function startMoving(img) { va
javascript - SetTimeout 重叠？
这是我的一段代码。图像在 23 毫秒后正常可见，但永远不会像第二行所示那样返回隐藏状态。如果我将其从 17 毫秒更改为大于 23 毫秒的值，它就会起作用。反之亦然，如果我将第一行更改为 16 毫秒，它
javascript - javascript中for循环中的碰撞/重叠
我正在可汗学院为学校项目编写一款太空入侵者游戏，但我不知道如何在子弹和外星人之间进行碰撞，然后摆脱子弹所碰撞的外星人。这是非常基本的 JS，尽管我尝试过，但我不太明白如何将有关该主题的其他答案放入我的
iOS UITableViewCell 重叠
当我尝试重新加载 tableView 的数据时出现奇怪的重叠，导致单元格的高度发生变化(使用 UITableViewAutomaticDimension)，然后内容与上面的单元格重叠，无法弄清楚怎么做
html - 标题和部分相互合并/重叠
我是一个新手，如果这是一个愚蠢的问题，请原谅我。我想有一个部分与标题分开，但发生了两种情况: (1) 当我把在下面，它们相互重叠，如下所示: Section overlapping header
css - Div 重叠
我正在尝试创建两个那是重叠的。唯一的问题是第二个在第一个的前面它必须是相反的。我尝试设置第一个的 z-index至 1但它仍然不起作用。这是我的代码: #content{ backgrou
CSS - 重叠 - 有效
是否有重叠 2 个 div 的有效方法。我有以下内容，但无法让它们重叠。 #top-border{width:100%; height:60px; background:url(image.jpg)

首页

博学

6Ren·AI

商城

python - 将图像切成重叠的补丁并将补丁合并到图像的快速方法