Algorithm for drawing outline of image shape in BezierPath (Canny edge detector)(BezierPath(Canny边缘检测器)中的图像形状轮廓提取算法)-6ren

Algorithm for drawing outline of image shape in BezierPath (Canny edge detector)(BezierPath(Canny边缘检测器)中的图像形状轮廓提取算法)

转载作者：bug小助手更新时间：2023-10-25 15:17:49

I'm trying to draw the outline of an image using BezierPath based on the transparency of each pixel.

我正在尝试根据每个像素的透明度使用BezierPath绘制图像的轮廓。

However, I'm having an issue with the logic; my logic also draws the internal outlines.

然而，我对逻辑有一个问题；我的逻辑也画出了内部轮廓。

I only want to draw the external outline with BezierPath.
What I get (the first shape is the original image, the second is the bezierPath):

我只想用BezierPath绘制外部轮廓。我得到的(第一个形状是原始图像，第二个是bezierPath)：

My code:

我的代码是：

func processImage(_ image: UIImage) -> UIBezierPath? {
   guard let cgImage = image.cgImage else {
       print("Error: Couldn't get CGImage from UIImage")
       return nil
   }

   let width = cgImage.width
   let height = cgImage.height

   // Create a context to perform image processing
   let colorSpace = CGColorSpaceCreateDeviceGray()
   let context = CGContext(data: nil, width: width, height: height, bitsPerComponent: 8, bytesPerRow: width, space: colorSpace, bitmapInfo: CGImageAlphaInfo.none.rawValue)

   guard let context = context else {
       print("Error: Couldn't create CGContext")
       return nil
   }

   // Draw the image into the context
   context.draw(cgImage, in: CGRect(x: 0, y: 0, width: width, height: height))

   // Perform Canny edge detection
   guard let edgeImage = context.makeImage() else {
       print("Error: Couldn't create edge image")
       return nil
   }

   // Create a bezier path for the outline of the shape
   let bezierPath = UIBezierPath()
   
   // Iterate over the image pixels to find the edges
   for y in 0..<height {
       for x in 0..<width {
           let pixel = edgeImage.pixel(x: x, y: y)
           
           if pixel > 0 {
               let leftPixel = (x > 0) ? edgeImage.pixel(x: x - 1, y: y) : 0
               let rightPixel = (x < width - 1) ? edgeImage.pixel(x: x + 1, y: y) : 0
               let abovePixel = (y > 0) ? edgeImage.pixel(x: x, y: y - 1) : 0
               let belowPixel = (y < height - 1) ? edgeImage.pixel(x: x, y: y + 1) : 0
               
               if leftPixel == 0 || rightPixel == 0 || abovePixel == 0 || belowPixel == 0 {
                   bezierPath.move(to: CGPoint(x: CGFloat(x), y: CGFloat(y)))
                   bezierPath.addLine(to: CGPoint(x: CGFloat(x) + 1.0, y: CGFloat(y) + 1.0))
               }
           }
       }
   }

   return bezierPath
}

extension CGImage {
    func pixel(x: Int, y: Int) -> UInt8 {
        let data = self.dataProvider!.data
        let pointer = CFDataGetBytePtr(data)
        let bytesPerRow = self.bytesPerRow
        
        let pixelInfo = (bytesPerRow * y) + x
        return pointer![pixelInfo]
    }
}

更多回答

You want a flood fill algorithm, starting outside the image to find everything "not" in the shape. Then edge detect on that. en.wikipedia.org/wiki/Flood_fill There are many other approaches, but that one is pretty easy to implement and will work reasonably for many things as long as there are no gaps in your edge. You can also find the fist pixel of an edge as you are, and then try "walking around it" by testing nearby pixels in all directions.

您需要一个整体填充算法，从图像外部开始查找形状中的所有内容。然后对其进行边缘检测。En.wikipedia.org/wiki/Flood_Fill还有许多其他方法，但这种方法很容易实现，只要您的边缘没有缺口，它就可以很好地适用于许多事情。你也可以找到一条边的第一个像素，然后尝试通过在各个方向上测试附近的像素来“绕过它”。

Note that generating thousands of two-pixel lines can be very expensive to work with. You probably will want to simplify the final curve using something like Ramer–Douglas–Peucker (possibly before creating an actual BezierPath).

请注意，生成数千条两像素行的操作成本可能非常高。您可能希望使用类似Ramer-Douglas-Peucker的方法来简化最终曲线(可能在创建实际的BezierPath之前)。

That said, how would you want this to work if the above font were the letter P? I would assume there would be a large internal hole that you would want to treat as edges, even though you don't want to capture the other, smaller internal holes. Solving this well is likely complex, and require you to detect "features" rather than just adjacent pixels. One approach is to scan with a larger (3x3, 5x5, on that order) overlapping "block" that is considered filled in if any of its pixels are filled in. That will ignore small holes, while capturing larger holes. It's challenging.

也就是说，如果上面的字体是字母P，你希望它是如何工作的？我会假设会有一个大的内部孔，您会希望将其视为边，即使您不想捕获其他较小的内部孔。解决这口井可能很复杂，需要你检测“特征”，而不仅仅是相邻的像素。一种方法是用一个更大的(3x3，5x5，顺序为3x3，5x5)重叠的“块”进行扫描，如果它的任何像素被填充，就认为它被填充了。这将忽略小洞，同时捕获较大的洞。这很有挑战性。

Thank you for your response. I will try to implement the flood fill algorithm; it seems to be what I'm looking for. Indeed, I will probably need to simplify the paths, which I will do in step 2. And to address the example of the letter P, I'm not interested in the hole inside for my goal, only the external outline. I will keep you updated. Thank you very much!

谢谢您的回复。我将尝试实现泛洪填充算法；这似乎就是我要寻找的。事实上，我可能需要简化路径，我将在步骤2中这样做。为了解决字母P的例子，我对我的目标不是内部的洞感兴趣，只对外部轮廓感兴趣。我会随时通知你最新情况。非常感谢!

My goal is to get out the the outline as BezierPath, after more investigation I need to use another algorithm Moore-Neighbor tracing algorithm

我的目标是得到BezierPath的轮廓，经过更多的研究后，我需要使用另一种算法摩尔-邻居跟踪算法

优秀答案推荐

From the comments, you found the algorithm (Moore Neighborhood Tracing). Here's an implementation that works well for your problem. I'll comment on some improvements you might consider.

从评论中，您找到了算法(摩尔邻域跟踪)。这里有一个可以很好地解决您的问题的实现。我会评论一下你可能会考虑的一些改进。

First, you need to get the data into a buffer, with one byte per pixel. You seem to know how to do that, so I won't belabor the point. 0 should be "transparent" and non-0 should be "filled." In the literature, these are often black (1) lines on a white (0) background, so I'll use that naming.

首先，您需要将数据放入缓冲区，每个像素一个字节。你似乎知道如何做那件事，所以我就不多说了。0应该是“透明的”，非0应该是“填充的”。在文献中，这些通常是白(0)背景上的黑(1)线，所以我将使用这种命名。

The best introduction I've found (and regularly cited) is Abeer George Ghuneim's Contour Tracing site. Really useful site. I've seen some implementations of MNT that over-check some pixels. I try to follow the algorithm Abeer describes carefully to avoid that.

我找到的最好的介绍(并经常被引用)是阿比尔·乔治·古纳姆的等高线追踪网站。非常有用的网站。我见过MNT的一些实现，它们过度检查了一些像素。我试着遵循阿比尔描述的算法，以避免这种情况。

There is more testing I want to do on this code, but it handles your case.

我还想对这段代码进行更多测试，但它可以处理您的情况。

First, the algorithm operates on a Grid of Cells:

首先，该算法对单元格网格进行操作：

public struct Cell: Equatable {
    public var x: Int
    public var y: Int
}

public struct Grid: Equatable {
    public var width: Int
    public var height: Int
    public var values: [UInt8]

    public var columns: Range<Int> { 0..<width }
    public var rows: Range<Int> { 0..<height }

    // The pixels immediately outside the grid are white. 
    // Accessing beyond that is a runtime error.
    public subscript (p: Cell) -> Bool {
        if p.x == -1 || p.y == -1 || p.x == width || p.y == height { return false }
        else { return values[p.y * width + p.x] != 0 }
    }

    public init?(width: Int, height: Int, values: [UInt8]) {
        guard values.count == height * width else { return nil }
        self.height = height
        self.width = width
        self.values = values
    }
}

There is also the concept of "direction." This is in two forms: the direction from the center to one of the 8 neighbors, and the "backtrack" direction, which is the direction a cell is "entered" during the search

还有“方向”的概念。这有两种形式：从中心到8个邻域之一的方向和“回溯”方向，即在搜索过程中“进入”像元的方向

enum Direction: Equatable {
    case north, northEast, east, southEast, south, southWest, west, northWest
    
    mutating func rotateClockwise() {
        self = switch self {
        case .north: .northEast
        case .northEast: .east
        case .east: .southEast
        case .southEast: .south
        case .south: .southWest
        case .southWest: .west
        case .west: .northWest
        case .northWest: .north
        }
    }

    //
    // Given a direction from the center, this is the direction that box was entered from when
    // rotating clockwise.
    //
    // +---+---+---+
    // + ↓ + ← + ← +
    // +---+---+---+
    // + ↓ +   + ↑ +
    // +---+---+---+
    // + → + → + ↑ +
    // +---+---+---+
    func backtrackDirection() -> Direction {
        switch self {
        case .north: .west
        case .northEast: .west
        case .east: .north
        case .southEast: .north
        case .south: .east
        case .southWest: .east
        case .west: .south
        case .northWest: .south
        }
    }
}

And Cells can advance in a given direction:

细胞可以沿着给定的方向前进：

extension Cell {
    func inDirection(_ direction: Direction) -> Cell {
        switch direction {
        case .north:     Cell(x: x,     y: y - 1)
        case .northEast: Cell(x: x + 1, y: y - 1)
        case .east:      Cell(x: x + 1, y: y    )
        case .southEast: Cell(x: x + 1, y: y + 1)
        case .south:     Cell(x: x    , y: y + 1)
        case .southWest: Cell(x: x - 1, y: y + 1)
        case .west:      Cell(x: x - 1, y: y    )
        case .northWest: Cell(x: x - 1, y: y - 1)
        }
    }
}

And finally, the Moore Neighbor algorithm:

最后是摩尔邻居算法：

public struct BorderFinder {
    public init() {}

    // Returns the point and the direction of the previous point
    // Since this scans left-to-right, the previous point is always to the west
    // The grid includes x=-1, so it's ok if this is an edge.
    func startingPoint(for grid: Grid) -> (point: Cell, direction: Direction)? {
        for y in grid.rows {
            for x in grid.columns {
                let point = Cell(x: x, y: y)
                if grid[point] {
                    return (point, .west)
                }
            }
        }
        return nil
    }

    /// Finds the boundary of a blob within `grid`
    ///
    /// - Parameter grid: an Array of bytes representing a 2D grid of UInt8. Each cell is either zero (white) or non-zero (black).
    /// - Returns: An array of points defining the boundary. The boundary includes only black points.
    ///            If multiple "blobs" exist, it is not defined which will be returned.
    ///            If no blob is found, an empty array is returned
    public func findBorder(in grid: Grid) -> [Cell] {
        guard let start = startingPoint(for: grid) else { return [] }
        var (point, direction) = start
        var boundary: [Cell] = [point]

        var rotations = 0
        repeat {
            direction.rotateClockwise()
            let nextPoint = point.inDirection(direction)
            if grid[nextPoint] {
                boundary.append(nextPoint)
                point = nextPoint
                direction = direction.backtrackDirection()
                rotations = 0
            } else {
                rotations += 1
            }
        } while (point, direction) != start && rotations <= 7

        return boundary
    }
}

This returns a list of Cells. That can be converted to a CGPath as follows:

这将返回一个单元格列表。它可以转换为CGPath，如下所示：

let data = ... Bitmap data with background as 0, and foreground as non-0 ...
let grid = Grid(width: image.width, height: image.height, values: Array(data))!

let points = BorderFinder().findBorder(in: grid)

let path = CGMutablePath()
let start = points.first!

path.move(to: CGPoint(x: start.x, y: start.y))
for point in points.dropFirst() {
    let cgPoint = CGPoint(x: point.x, y: point.y)
    path.addLine(to: cgPoint)
}
path.closeSubpath()

That generates the following path:

这将生成以下路径：

There is a gist of the full sample code I used. (This sample code isn't meant to be a good example of how to prep the image for processing. I just threw it together to work on the algorithm.)

下面是我使用的完整示例代码的要点。(此示例代码并不是一个很好的示例，说明如何准备图像以进行处理。我只是把它们拼凑在一起来研究算法。)

Some thoughts for future work:

对今后工作的一些思考：

You can likely get good and faster results by first scaling the image to something smaller. Half-scale definitely works well, but consider even 1/10 scale.

You may get better results by first applying a small Gaussian blur to the image. This will eliminate small gaps in the edge, which can cause trouble for the algorithm, and reduce the complexity of the contour.

Managing 5000 path elements, each a 2-pixel line, is probably not great. Pre-scaling the image can help a lot. Another approach is applying Ramer–Douglas–Peucker to further simplify the contour.

更多回答

Works perfectly, thank you for your help. You just forgot the 'inDirection' method in your response. But I had understood it:func inDirection(_ direction: Direction) -> Cell { switch direction { case .north: return Cell(x: x, y: y - 1)....

效果很好，谢谢你的帮助。您只是忘记了您的响应中的“间接”方法。但我已经理解了：函数间接(_Direction：Direction)->单元格{切换方向{case.North：Return Cell(x：x，y：y-1)...

Oh yes! Sorry about that. Added.

哦，是的!真对不起。加进去了。

html - 在没有模糊效果的情况下在元素周围均匀地生成阴影/轮廓
我有一个不规则形状的元素(比方说图标)。我想要围绕它的某种轮廓，以符合特定颜色的形状。此轮廓的颜色必须均匀地围绕形状，即与形状各处的距离相同，并且没有颜色渐变。我发现使用的是 css 选项 fil
c++ - OpenCV 轮廓？
这部分代码我总是出错 &contours = ((contours.h_next) -> h_next); contours.h_next = ((contours.h_next) -> h_next
css - 更正形状的不规则边框/轮廓
我通过 css (:after) 创建了 3 个圆圈，使用一些背景颜色，边框看起来不规则。有什么解决办法吗？在这里您可以看到问题:https://flowersliving.com/cpt_01/a
css - 渐变边框上的边框(轮廓)
使用这个: background: -moz-linear-gradient(315deg, transparent 10px, black 10px); 如何在不使用 border 的情况下围绕它创
二进制二维矩阵的 python 轮廓
我想计算二元 NxM 矩阵中某个形状周围的凸包。凸包算法需要一个坐标列表，所以我采用 numpy.argwhere(im) 来获得所有形状点坐标。然而，这些点中的大多数对凸包没有贡献(它们位于形状的内
css - 删除焦点下拉菜单的虚线边框/轮廓
如何删除从下拉菜单中选择元素时显示的虚线边框/轮廓？您可以看到显示了虚线边框/轮廓，我想删除它(在 Firefox 中截取的屏幕截图)。尝试下面的解决方案并没有删除它: select:focus,
css - 如何绘制半圆(仅限边框、轮廓)
关闭。这个问题不符合Stack Overflow guidelines .它目前不接受答案。这个问题似乎是题外话，因为它缺乏足够的信息来诊断问题。更详细地描述您的问题或include a min
Qt4:缩放不变的 qgraphicsitem 轮廓
我正在使用 Qt4 GraphicsView 框架绘制一些多边形，并且允许用户放大和缩小绘图。我希望多边形随着用户在 View 中更改缩放级别(比例)而变得越来越小，但是有没有办法使轮廓厚度始终保持不
点列表的 3D 轮廓(凹包)
我在 C# 中有一个 Vector3 点列表，我需要计算这些点的凹轮廓。确实有很多引用资料，尤其是 -convex- 分辨率(我已经成功实现了，多亏了 graham 的算法)，但是，由于我现在需要有
java - 基于运输时间的热图/轮廓(反向等时轮廓)
注: r 中的解决方案, python , java ，或者如果需要，c++或 c#是需要的。我正在尝试根据运输时间绘制轮廓。更清楚地说，我想将具有相似旅行时间(比如说 10 分钟间隔)的点聚集到特
python - 在另一个图像上匹配轮廓或绘制 (png) 轮廓
假设我在图像上找到了轮廓。在图像 2 上找到此轮廓位置的最佳方法是什么？我看到两个选项:要么我用白线绘制轮廓并匹配图像 2 上的图像，要么我以某种方式(这甚至可能吗？)直接匹配图像 2 上的轮廓。
python - len(轮廓)是什么意思？
我一直在研究细菌的图像，希望从图像中获取细菌的数量，还需要根据特定的形状和大小对细菌进行分类。我正在使用opencv python。现在，我使用轮廓法。 contours,hierarchy
python - 如何在OpenCV中区分实心圆/轮廓和未实心圆/轮廓？
我无法区分以下两个轮廓。 cv2.contourArea两者的值相同。在Python中有什么功能可以区分它们吗？最佳答案要区分填充轮廓和未填充轮廓，可以在使用 cv2.findContours 查
java - 基于条件的 Spring 轮廓
是否可以根据 Activity 配置文件的某些表达式来注册bean前任。 @Profile(!prod) @Profile(name!="test") 我有一种情况，我需要根据许多不同的条件配
iphone - 重叠的 CAShapeLayer 轮廓
我有一个由多个 CAShapeLayer 组成的 3D 相似图形对象。必须抚摸所有形状(天花板和墙壁)。有些形状共享一条边 - 这似乎是问题的根源。然而，轮廓似乎是围绕另一个形状的现有轮廓绘制的。所
javascript - 表单中输入元素周围的 CSS 轮廓
有谁知道，是否可以在用户使用顺序导航(TAB 按钮)时在输入元素周围显示轮廓，并在用户用鼠标单击此输入元素时隐藏轮廓？有没有人实现过这种行为？我在 CSS 文件中的 :focus 选择器上使用这个属
css - 悬停时围绕框阴影的 Firefox 轮廓
这是我在 StackOverflow 上的第一个问题，所以我会尝试以正确的方式格式化它。基本上，我有一个带有边框和轮廓的 div。悬停时，div 也会有一个阴影，当然，它应该在轮廓之外。这适用于所有
c++ - 如何水平连接 OpenCV 轮廓？
我在 Opencv 2.9 (C++) 中使用 findContours。我得到的是一个 vector> contours，它描述了我的轮廓。假设我有一个矩形，其轮廓存储在 vector 中。接下来我
javascript - 仅围绕父元素的 CSS 轮廓
我有一个 div，它有附加的子 div，定位在父 div 之外。我希望父 div 有一个轮廓 onclick，但轮廓延伸到子 div 周围。有没有办法让轮廓完全围绕父 div。我不能使用边框，因
css - ionic 图标周围的阴影/轮廓
我正在尝试在彩色图标周围设置实线边框。应该足够直截了当，显然它适用于字形，但我无法让它适用于我试过... // like this fiddle: http://jsfiddle.net/9s

bug小助手

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

Algorithm for drawing outline of image shape in BezierPath (Canny edge detector)(BezierPath(Canny边缘检测器)中的图像形状轮廓提取算法)