ios - 使用 MTKView 显示解码的视频流会导致不希望的模糊输出-6ren

ios - 使用 MTKView 显示解码的视频流会导致不希望的模糊输出

转载作者：行者123 更新时间：2023-11-29 11:29:02

我已经成功地创建了一个应用程序来接收实时的 h264 编码视频流，然后使用 Video Toolbox 和 AVSampleBufferDisplayLayer 解码并显示视频。这按预期工作，但我希望能够将过滤器应用于渲染输出，因此我改为使用 Video Toolbox 解码并使用 MetalKit 显示/渲染解码视频。我遇到的唯一问题是，我使用 MetalKit 渲染的输出明显比使用 AVSampleBufferDisplayLayer 接收的输出更模糊，我还没有设法找出原因。

这是 AVSampleBufferDisplayLayer 的输出

这是 MetalKit 的输出

我尝试跳过 MetalKit 并直接渲染到 CAMetalLayer，但同样的问题仍然存在。我正在尝试将我的 CVImageBufferRef 转换为可以使用 UIView 显示的 UIImage。如果这也最终变得模糊，那么问题可能出在我的 VTDecompressionSession 而不是 Metal 方面。

解码部分与此处给出的非常相似 How to use VideoToolbox to decompress H.264 video stream

我会尝试只粘贴我的代码中有趣的片段。

这些是我为 VTDecompressionSession 提供的选项。

NSDictionary *destinationImageBufferAttributes = [NSDictionary dictionaryWithObjectsAndKeys:
                                                      [NSNumber numberWithInteger:kCVPixelFormatType_420YpCbCr8BiPlanarVideoRange],
                                                      (id)kCVPixelBufferPixelFormatTypeKey,
                                                      nil];

这是我继承自MTKView的 View

@interface StreamView : MTKView

@property id<MTLCommandQueue> commandQueue;
@property id<MTLBuffer> vertexBuffer;
@property id<MTLBuffer> colorConversionBuffer;
@property id<MTLRenderPipelineState> pipeline;
@property CVMetalTextureCacheRef textureCache;

@property CFMutableArrayRef imageBuffers;

-(id)initWithRect:(CGRect)rect withDelay:(int)delayInFrames;
-(void)addToRenderQueue:(CVPixelBufferRef)image renderAt:(int)frame;

@end

这就是我从 View Controller 初始化 View 的方式。我收到的视频大小相同，即 666x374。

streamView = [[StreamView alloc] initWithRect:CGRectMake(0, 0, 666, 374) withDelay:0];
[self.view addSubview:streamView];

这是StreamView的initWithRect方法的内容

id<MTLDevice> device = MTLCreateSystemDefaultDevice();
self = [super initWithFrame:rect device:device];

self.colorPixelFormat = MTLPixelFormatBGRA8Unorm;
self.commandQueue = [self.device newCommandQueue];
[self buildTextureCache];
[self buildPipeline];
[self buildVertexBuffers];

这是buildPipeline方法

- (void)buildPipeline
{
    NSBundle *bundle = [NSBundle bundleForClass:[self class]];
    id<MTLLibrary> library = [self.device newDefaultLibraryWithBundle:bundle error:NULL];

    id<MTLFunction> vertexFunc = [library newFunctionWithName:@"vertex_main"];
    id<MTLFunction> fragmentFunc = [library newFunctionWithName:@"fragment_main"];

    MTLRenderPipelineDescriptor *pipelineDescriptor = [MTLRenderPipelineDescriptor new];
    pipelineDescriptor.vertexFunction = vertexFunc;
    pipelineDescriptor.fragmentFunction = fragmentFunc;
    pipelineDescriptor.colorAttachments[0].pixelFormat = self.colorPixelFormat;

    self.pipeline = [self.device newRenderPipelineStateWithDescriptor:pipelineDescriptor error:NULL];
}

这是我实际绘制纹理的方式

CVImageBufferRef image = (CVImageBufferRef)CFArrayGetValueAtIndex(_imageBuffers, 0);

id<MTLTexture> textureY = [self getTexture:image pixelFormat:MTLPixelFormatR8Unorm planeIndex:0];
id<MTLTexture> textureCbCr = [self getTexture:image pixelFormat:MTLPixelFormatRG8Unorm planeIndex:1];
if(textureY == NULL || textureCbCr == NULL)
   return;

id<CAMetalDrawable> drawable = self.currentDrawable;

id<MTLCommandBuffer> commandBuffer = [_commandQueue commandBuffer];
MTLRenderPassDescriptor *renderPass = self.currentRenderPassDescriptor;
renderPass.colorAttachments[0].clearColor = MTLClearColorMake(0.5, 1, 0.5, 1);

id<MTLRenderCommandEncoder> commandEncoder = [commandBuffer renderCommandEncoderWithDescriptor:renderPass];
[commandEncoder setRenderPipelineState:self.pipeline];
[commandEncoder setVertexBuffer:self.vertexBuffer offset:0 atIndex:0];
[commandEncoder setFragmentTexture:textureY atIndex:0];
[commandEncoder setFragmentTexture:textureCbCr atIndex:1];
[commandEncoder setFragmentBuffer:_colorConversionBuffer offset:0 atIndex:0];
[commandEncoder drawPrimitives:MTLPrimitiveTypeTriangleStrip vertexStart:0 vertexCount:4 instanceCount:1];
[commandEncoder endEncoding];

[commandBuffer presentDrawable:drawable];
[commandBuffer commit];

这就是我将 CVPixelBufferRef 转换为 MTLTexture 的方法

- (id<MTLTexture>)getTexture:(CVPixelBufferRef)image pixelFormat:(MTLPixelFormat)pixelFormat planeIndex:(int)planeIndex {
    id<MTLTexture> texture;
    size_t width, height;

    if (planeIndex == -1)
    {
        width = CVPixelBufferGetWidth(image);
        height = CVPixelBufferGetHeight(image);
        planeIndex = 0;
    }
    else
    {
        width = CVPixelBufferGetWidthOfPlane(image, planeIndex);
        height = CVPixelBufferGetHeightOfPlane(image, planeIndex);
        NSLog(@"texture %d, %ld, %ld", planeIndex, width, height);
    }

    CVMetalTextureRef textureRef = NULL;
    CVReturn status = CVMetalTextureCacheCreateTextureFromImage(NULL, _textureCache, image, NULL, pixelFormat, width, height, planeIndex, &textureRef);
    if(status == kCVReturnSuccess)
    {
        texture = CVMetalTextureGetTexture(textureRef);
        CFRelease(textureRef);
    }
    else
    {
        NSLog(@"CVMetalTextureCacheCreateTextureFromImage failed with return stats %d", status);
        return NULL;
    }

    return texture;
}

这是我的片段着色器

fragment float4 fragment_main(Varyings in [[ stage_in ]],
                              texture2d<float, access::sample> textureY [[ texture(0) ]],
                              texture2d<float, access::sample> textureCbCr [[ texture(1) ]],
                              constant ColorConversion &colorConversion [[ buffer(0) ]])
{
    constexpr sampler s(address::clamp_to_edge, filter::linear);
    float3 ycbcr = float3(textureY.sample(s, in.texcoord).r, textureCbCr.sample(s, in.texcoord).rg);

    float3 rgb = colorConversion.matrix * (ycbcr + colorConversion.offset);

    return float4(rgb, 1.0);
}

因为我编码的 View 和视频都是 666x374，所以我尝试将片段着色器中的采样类型更改为 filter::nearest。我认为它会以 1:1 的比例匹配像素，但它仍然很模糊。我注意到的另一件奇怪的事情是，如果你在新选项卡中打开上传的图像，你会看到它们比 666x374 大得多......我怀疑我在编码方面犯了错误，即使我当时犯了错误AVSampleBufferDisplayLayer 仍然设法在不模糊的情况下显示视频，因此他们一定是在做我所缺少的正确事情。

最佳答案

看起来你已经解决了最严重的 View 比例问题，其他问题是正确的 YCbCr 渲染(听起来你将通过在解码时输出 BGRA 像素来避免)然后将原始电影缩放到匹配 View 的尺寸。当您请求 BGRA 像素数据时，数据被编码为 sRGB，因此您应该将纹理中的数据视为 sRGB。从 sRGB 纹理读取时，Metal 会自动为您进行非线性到线性的转换，但您必须告诉 Metal 它是 sRGB 像素数据(使用 MTLPixelFormatBGRA8Unorm_sRGB)。要实现缩放，您只需要使用线性重采样从 BGRA 数据渲染到 View 中。如果您想查看 MetalBT709Decoder 的源代码，请参阅我上面链接的 SO 问题，这是我自己的项目，它实现了 BT.709 的正确渲染。

关于ios - 使用 MTKView 显示解码的视频流会导致不希望的模糊输出，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/55538310/

文章推荐： mysql - 复制表行，但使用另一个值

文章推荐： php - MYSQL从异常中删除

java - 如何找到 Cassandra 导致 Spark 作业中止的根本原因(导致 ClassCastException - ShuffleMapTask 到 Task)？
我正在尝试使用 Spark 从 Cassandra 读取数据。 DataFrame rdf = sqlContext.read().option("keyspace", "readypulse
ctime() 导致 SIGABRT(？!)
这是代码: void i_log_ (int error, const char * file, int line, const char * fmt, ...) { /* Get erro
导致 Gtk 在断言时中止
我必须调试一个严重依赖 Gtk 的程序。问题是由于某些原因，在使用 GtkWindow 对象时开始出现许多运行时警告。问题是，即使 Gtk 提示严重错误，它也不会因这些错误而中止。我没有代码库的更改历
glsl - glGetProgramBinary 导致 GL_INVALID_OPERATION
我正在尝试从已有效编译和链接的程序中检索二进制文件。我已经通过 GL_PROGRAM_BINARY_LENGTH 收到了它的长度。该文档说有两个实例可能会发生 GL_INVALID_OPERATION
wcf - 导致 ServiceActivationException 的原因是什么？
我有一个托管在 Azure 环境中的服务。我正在使用控制台应用程序使用该服务。这样做时，我得到了异常: "The requested service, 'http://xxxx-d.yyyy.be/S
multithreading - sem_init() 导致 SEGV
我有以下代码，它被 SEGV 信号杀死。使用调试器表明它被 main() 中的第一个 sem_init() 杀死。如果我注释掉第一个 sem_init() ，第二个会导致同样的问题。我试图弄清楚是什么
xcode - NSJSONSerialization 导致 EXC_BAD_ACCESS
目前我正在编写一个应用程序(目标 iOS 6，启用 ARC)，它使用 JSON 进行数据传输，使用核心数据进行持久存储。 JSON 数据由 PHP 脚本通过 json_encode 从 MySQL 数
android - PopAsync 导致 ArgumentOutOfRangeException
我对 Xamarin.Forms 还是很陌生。我在出现的主页上有一个非常简单的功能 async public Task BaseAppearing() { if (UserID
android - notifyDataSetChanged() 导致 IndexOutOfBoundsException
这是我的代码的简化版本。 public class MainActivity extends ActionBarActivity { private ArrayList entry = new Arr
java - 导致 NoSuchMethodError 的显式转换？
我想弄明白为什么我的两个 Java 库很难很好地协同工作。这是场景: 库 1 有一个类 A，其构造函数如下: public A(Object obj) { /* boilerplate */ } 在以
iphone - didReceiveAuthenticationChallenge 导致 EXC_BAD_ACCESS
如果网站不需要身份验证，我的代码可以正常工作，如果需要，则在打印“已创建凭据”后会立即出现 EXC_BAD_ACCESS 错误。我不会发布任何内容，并且此代码是直接从文档中复制的 - 知道出了什么问题
iphone - NSArray 导致 EXC_BAD_ACCESS
我在使用 NSArray 填充 UITableView 时遇到问题。我确信我正在做一些愚蠢的事情，但我无法弄清楚。当我尝试进行简单的计数时，我得到了 EXC_BAD_ACCESS，我知道这是因为我试图
iphone - resignFirstResponder 导致 EXC_BAD_ACCESS
我在 UITableViewCell 上有一个 UITextField，在另一个单元格上有一个按钮。我单击 UITextField(出现键盘)。 UITextField 调用了以下方法: - (BO
iphone - MKReverseGeocoder 导致 EXC_BAD_ACCESS？
我有一个应用程序出现间歇性崩溃。崩溃日志显示了一个堆栈跟踪，这对我来说很难破译，因此希望其他人看到了这一点并能为我指出正确的方向。基本上，应用程序在启动时执行反向地理编码请求，以在标签中显示用户的位
iphone - UIImageWriteToSavedPhotosAlbum 导致 EXC_BAD_ACCESS
我开发了一个 CGImage，当程序使用以下命令将其显示在屏幕上时它工作正常: [output_view.layer performSelectorOnMainThread:@selector(set
android - EncryptedSharedPreferences 导致 UnrecoverableKeyException
我正在使用新的 EncryptedSharedPreferences以谷歌推荐的方式上课: private fun securePrefs(context: Context): SharedPrefe
javascript - ClientId 导致 NullReferenceException
我有一个中继器，里面有一些控件，其中一个是文本框。我正在尝试使用 jquery 获取文本框，我的代码如下所示: $("#").click(function (event) {}); 但我总是得到 nu
android - 导致 TTS 初始化失败的原因是什么？
在以下场景中观察到 TTS 初始化错误，太随机了。已安装 TTS 引擎，存在语音集，并且可以从辅助功能选项中播放示例 tts。 TTS 初始化在之前初始化和播放的同一设备上随机失败。在不同的设备(
java - 64位VM不启动指针压缩，导致-8内存对齐
maven pom.xml org.openjdk.jol jol-core 0.10 Java 类: public class MyObjectData { pr
math - 导致 MD5 冲突的最短字符串对是什么？
在不担心冲突的情况下，可以使用 MD5 作为哈希值，字符串长度最多为多少？这可能是通过为特定字符集中的每个可能的字符串生成 MD5 哈希来计算的，长度不断增加，直到哈希第二次出现(冲突)。没有冲突的

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

ios - 使用 MTKView 显示解码的视频流会导致不希望的模糊输出