c# - 使用 DirectShow .NET 从 SampleGrabber 计算 FFT-6ren

c# - 使用 DirectShow .NET 从 SampleGrabber 计算 FFT

转载作者：太空宇宙更新时间：2023-11-03 15:52:50

我正在使用 DirectShow .NET 开发一个项目。我正在尝试集成一个名为“WPF Sound Visualization Library”的库，它创建了一个频谱分析仪视觉对象。

为了让视觉效果正常工作，我需要在我的播放器中实现这两种方法:

GetFFTData(float[] fftDataBuffer) - 将当前 FFT 数据分配给缓冲区。
备注:缓冲区中的 FFT 数据应仅包含实数强度值。这意味着如果您的 FFT 算法返回复数(很多人都这样做)，您将运行类似于以下的算法:for(int i = 0; i < complexNumbers.Length/2; i++) fftResult[i] = Math.Sqrt (complexNumbers[i].Real * complexNumbers[i].Real + complexNumbers[i].Imaginary * complexNumbers[i].Imaginary);
GetFFTFrequencyIndex(int frequency) - 获取给定频率的 FFT 数据缓冲区中的索引。

编辑:我已经添加了 SampleGrabber 并将其回调与 GetFFTData 集成(仍未测试)。但是如何集成GetFFTFrequencyIndex方法呢？

    protected int SampleCB(double SampleTime, IMediaSample pSample)
    {
        IntPtr pointer = IntPtr.Zero;
        pSample.GetPointer(out pointer);
        sampleDataBytes = new byte[pSample.GetSize()];
        Marshal.Copy(pointer, sampleDataBytes, 0, sampleDataBytes.Length);
        var sampleTime = SampleTime;
        var actualDataLength = pSample.GetActualDataLength();

        /* Release unmanaged resources */
        Marshal.ReleaseComObject(pSample);
        pSample = null;

        return (int)HResults.S_OK;
    }

    #region ISpectrumPlayer
    byte[] sampleDataBytes = null;

    public bool GetFFTData(float[] fftDataBuffer)
    {
        if (sampleDataBytes != null)
        {
            var sampleData = Utils.GetInt16Array(sampleDataBytes);
            double[] pRealIn = new double[sampleData.Length];

            for (var i = 0; i <= sampleData.Length - 1; i++)
                pRealIn[i] = sampleData[i];

            var pImagIn = new double[sampleDataBytes.Length];
            var pRealOut = new double[sampleDataBytes.Length];
            var pImagOut = new double[sampleDataBytes.Length];

            FFTUtils.Compute((uint) pRealIn.Length, pRealIn, pImagIn, pRealOut, pImagOut, false);

            fftDataBuffer = new float[sampleDataBytes.Length];

            for (int i = 0; i < pRealOut.Length; i++)
                fftDataBuffer[i] = (float) Math.Sqrt(pRealOut[i] * pRealOut[i] + pImagOut[i] * pImagOut[i]);
        }

        return true;
    }

    public int GetFFTFrequencyIndex(int frequency)
    {
        throw new NotImplementedException();
    }
    #endregion

我对这个类有帮助的方法:

public class FFTUtils
{
    public const Double DDC_PI = 3.14159265358979323846;

    /// <summary>
    /// Verifies a number is a power of two
    /// </summary>
    /// <param name="x">Number to check</param>
    /// <returns>true if number is a power two (i.e.:1,2,4,8,16,...)</returns>
    public static Boolean IsPowerOfTwo(UInt32 x)
    {
        return ((x != 0) && (x & (x - 1)) == 0);
    }

    /// <summary>
    /// Get Next power of number.
    /// </summary>
    /// <param name="x">Number to check</param>
    /// <returns>A power of two number</returns>
    public static UInt32 NextPowerOfTwo(UInt32 x)
    {
        x = x - 1;
        x = x | (x >> 1);
        x = x | (x >> 2);
        x = x | (x >> 4);
        x = x | (x >> 8);
        x = x | (x >> 16);
        return x + 1;
    }

    /// <summary>
    /// Get Number of bits needed for a power of two
    /// </summary>
    /// <param name="PowerOfTwo">Power of two number</param>
    /// <returns>Number of bits</returns>
    public static UInt32 NumberOfBitsNeeded(UInt32 PowerOfTwo)
    {
        if (PowerOfTwo > 0)
        {
            for (UInt32 i = 0, mask = 1; ; i++, mask <<= 1)
            {
                if ((PowerOfTwo & mask) != 0)
                    return i;
            }
        }
        return 0; // error
    }

    /// <summary>
    /// Reverse bits
    /// </summary>
    /// <param name="index">Bits</param>
    /// <param name="NumBits">Number of bits to reverse</param>
    /// <returns>Reverse Bits</returns>
    public static UInt32 ReverseBits(UInt32 index, UInt32 NumBits)
    {
        UInt32 i, rev;

        for (i = rev = 0; i < NumBits; i++)
        {
            rev = (rev << 1) | (index & 1);
            index >>= 1;
        }

        return rev;
    }

    /// <summary>
    /// Return index to frequency based on number of samples
    /// </summary>
    /// <param name="Index">sample index</param>
    /// <param name="NumSamples">number of samples</param>
    /// <returns>Frequency index range</returns>
    public static Double IndexToFrequency(UInt32 Index, UInt32 NumSamples)
    {
        if (Index >= NumSamples)
            return 0.0;
        else if (Index <= NumSamples / 2)
            return (double)Index / (double)NumSamples;

        return -(double)(NumSamples - Index) / (double)NumSamples;
    }

    /// <summary>
    /// Compute FFT
    /// </summary>
    /// <param name="NumSamples">NumSamples Number of samples (must be power two)</param>
    /// <param name="pRealIn">Real samples</param>
    /// <param name="pImagIn">Imaginary (optional, may be null)</param>
    /// <param name="pRealOut">Real coefficient output</param>
    /// <param name="pImagOut">Imaginary coefficient output</param>
    /// <param name="bInverseTransform">bInverseTransform when true, compute Inverse FFT</param>
    public static void Compute(UInt32 NumSamples, Double[] pRealIn, Double[] pImagIn,
                        Double[] pRealOut, Double[] pImagOut, Boolean bInverseTransform)
    {
        UInt32 NumBits;    /* Number of bits needed to store indices */
        UInt32 i, j, k, n;
        UInt32 BlockSize, BlockEnd;

        double angle_numerator = 2.0 * DDC_PI;
        double tr, ti;     /* temp real, temp imaginary */

        if (pRealIn == null || pRealOut == null || pImagOut == null)
        {
            // error
            throw new ArgumentNullException("Null argument");
        }
        if (!IsPowerOfTwo(NumSamples))
        {
            // error
            throw new ArgumentException("Number of samples must be power of 2");
        }
        if (pRealIn.Length < NumSamples || (pImagIn != null && pImagIn.Length < NumSamples) ||
             pRealOut.Length < NumSamples || pImagOut.Length < NumSamples)
        {
            // error
            throw new ArgumentException("Invalid Array argument detected");
        }

        if (bInverseTransform)
            angle_numerator = -angle_numerator;

        NumBits = NumberOfBitsNeeded(NumSamples);

        /*
        **   Do simultaneous data copy and bit-reversal ordering into outputs...
        */
        for (i = 0; i < NumSamples; i++)
        {
            j = ReverseBits(i, NumBits);
            pRealOut[j] = pRealIn[i];
            pImagOut[j] = (double)((pImagIn == null) ? 0.0 : pImagIn[i]);
        }

        /*
        **   Do the FFT itself...
        */
        BlockEnd = 1;
        for (BlockSize = 2; BlockSize <= NumSamples; BlockSize <<= 1)
        {
            double delta_angle = angle_numerator / (double)BlockSize;
            double sm2 = Math.Sin(-2 * delta_angle);
            double sm1 = Math.Sin(-delta_angle);
            double cm2 = Math.Cos(-2 * delta_angle);
            double cm1 = Math.Cos(-delta_angle);
            double w = 2 * cm1;
            double ar0, ar1, ar2;
            double ai0, ai1, ai2;

            for (i = 0; i < NumSamples; i += BlockSize)
            {
                ar2 = cm2;
                ar1 = cm1;

                ai2 = sm2;
                ai1 = sm1;

                for (j = i, n = 0; n < BlockEnd; j++, n++)
                {
                    ar0 = w * ar1 - ar2;
                    ar2 = ar1;
                    ar1 = ar0;

                    ai0 = w * ai1 - ai2;
                    ai2 = ai1;
                    ai1 = ai0;

                    k = j + BlockEnd;
                    tr = ar0 * pRealOut[k] - ai0 * pImagOut[k];
                    ti = ar0 * pImagOut[k] + ai0 * pRealOut[k];

                    pRealOut[k] = (pRealOut[j] - tr);
                    pImagOut[k] = (pImagOut[j] - ti);

                    pRealOut[j] += (tr);
                    pImagOut[j] += (ti);
                }
            }

            BlockEnd = BlockSize;
        }

        /*
        **   Need to normalize if inverse transform...
        */
        if (bInverseTransform)
        {
            double denom = (double)(NumSamples);

            for (i = 0; i < NumSamples; i++)
            {
                pRealOut[i] /= denom;
                pImagOut[i] /= denom;
            }
        }
    }

    /// <summary>
    /// Calculate normal (power spectrum)
    /// </summary>
    /// <param name="NumSamples">Number of sample</param>
    /// <param name="pReal">Real coefficient buffer</param>
    /// <param name="pImag">Imaginary coefficient buffer</param>
    /// <param name="pAmpl">Working buffer to hold amplitude Xps(m) = | X(m)^2 | = Xreal(m)^2  + Ximag(m)^2</param>
    public static void Norm(UInt32 NumSamples, Double[] pReal, Double[] pImag, Double[] pAmpl)
    {
        if (pReal == null || pImag == null || pAmpl == null)
        {
            // error
            throw new ArgumentNullException("pReal,pImag,pAmpl");
        }
        if (pReal.Length < NumSamples || pImag.Length < NumSamples || pAmpl.Length < NumSamples)
        {
            // error
            throw new ArgumentException("Invalid Array argument detected");
        }

        // Calculate amplitude values in the buffer provided
        for (UInt32 i = 0; i < NumSamples; i++)
        {
            pAmpl[i] = pReal[i] * pReal[i] + pImag[i] * pImag[i];
        }
    }

    /// <summary>
    /// Find Peak frequency in Hz
    /// </summary>
    /// <param name="NumSamples">Number of samples</param>
    /// <param name="pAmpl">Current amplitude</param>
    /// <param name="samplingRate">Sampling rate in samples/second (Hz)</param>
    /// <param name="index">Frequency index</param>
    /// <returns>Peak frequency in Hz</returns>
    public static Double PeakFrequency(UInt32 NumSamples, Double[] pAmpl, Double samplingRate, ref UInt32 index)
    {
        UInt32 N = NumSamples >> 1;   // number of positive frequencies. (numSamples/2)

        if (pAmpl == null)
        {
            // error
            throw new ArgumentNullException("pAmpl");
        }
        if (pAmpl.Length < NumSamples)
        {
            // error
            throw new ArgumentException("Invalid Array argument detected");
        }

        double maxAmpl = -1.0;
        double peakFreq = -1.0;
        index = 0;

        for (UInt32 i = 0; i < N; i++)
        {
            if (pAmpl[i] > maxAmpl)
            {
                maxAmpl = (double)pAmpl[i];
                index = i;
                peakFreq = (double)(i);
            }
        }

        return samplingRate * peakFreq / (double)(NumSamples);
    }
}

非常感谢!

最佳答案

如果我没记错的话，该算法采用实数(例如 int[n])信号或复数信号(例如 int[n][2])和返回复数 FFT 结果。

所以，这看起来很简单:

您获取输入值(您可以在图表中绘制为时间值，例如左扬声器音频值)并将它们输入 pRealIn 参数。在 pImagIn 中放置零(与在 pRealIn 中一样多)。在 bInverseTransform 中你输入 false(当然)。

然后您将把结果取回pRealOut & pImagOut。结果缓冲区在逻辑上应与输入缓冲区大小相同。您必须采用这两个输出缓冲区并将它们像这样成对组合(对于 OUT 数组的每个元素):

fftDataBuffer[k] = Math.Sqrt(pRealOut[k] * pRealOut[k] + pImagOut[k] * pImagOut[k]); // Do this from 1 to n

FFT 结果是一组复数值(x = 实部，y = 虚部 - 您可以在笛卡尔系统中将其描述为向量)。你想要向量的大小/振幅，这就是你执行上述操作的原因。

那是为了 GetFFTData。

我看到您有一个名为 IndexToFrequency 的函数。所以这可能有效。您所要做的就是为缓冲区的每个索引调用此方法。即:

for(int i=0; i<n; i++) freq[i] = IndexToFrequency(i, n);

保存这些值，然后在您的 GetFFTFrequencyIndex(int frequency) 中找到输入参数 (frequency) 与 freq 中的元素最接近的匹配项[n] 并返回其索引。

我认为这就足够了。

重要提示:确保您的缓冲区大小为二次方(NextPowerOfTwo 似乎旨在帮助您做到这一点)。

如果您的数据有时碰巧较小，您可以在末尾用零填充值(也就是将零附加到输入缓冲区)。另外:为了获得更好的分辨率，您可以再次用零填充数据。这将增加您可能需要的结果的“平滑度”。

如果我可能会问(只是好奇 :))，您在哪里找到这段代码？

所以，就是这样! :)

关于c# - 使用 DirectShow .NET 从 SampleGrabber 计算 FFT，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/25094829/

文章推荐： c# - 在使用 Excel DNA C# 创建的 Excel 插件中创建身份验证

文章推荐： python - 从 Flask-Script 命令向 Flask View 发出请求

文章推荐： python - 导入错误 : No module named tensorflow in Spyder

文章推荐： python - python中打开文件I/O内存管理

directshow - 在 DirectShow 之外使用 DirectShow 过滤器？
我目前正在处理 Windows Media Foundation。但是，由于 Microsoft H.264 解码器的一些问题和一些缺少自定义格式的解码器，我想知道是否可以直接使用 CLSID 实例化
directshow - 有没有办法发现图中的哪个 DirectShow 过滤器发送了 DirectShow 事件？
IMediaEvent 和 IMediaEventEx 没有说明图中哪个过滤器发送了检索到的 DirectShow 事件。有什么办法可以找到这些信息。特别是对于图中的过滤器之一发送 EC_ERRORA
directshow - 在哪里可以找到完整的 DirectShow 教程？
我已经阅读了 DirectShow 上的 MSDN 文档，但它仍然令人困惑。我觉得我需要更多关于对象的上下文:图形、图钉、过滤器...等。谷歌搜索并没有给我太多的工作。我需要什么才能理解 Direct
directshow - 如何唯一标识 DirectShow 音频渲染器过滤器？
正如我刚刚发现的那样，友好的名称不能保证是唯一的。如果我可以从该标识符实例化过滤器而无需枚举它们，则可加分。最佳答案可以通过 WaveOutId 识别包装 WaveOut 设备的渲染器过滤器。那些
directshow - 有没有例子展示如何编写 DirectShow 转换过滤器？
我想捕获当前帧及其前一帧以进行分析并生成一个新帧来显示。是说我必须写一个变换DirectShow过滤器吗？但我是 DirectShow 的新手。我被MSDN的大量文档弄糊涂了。所以我想知道是否有任何简
directshow - DirectShow 中 GMFBridge 的使用
enter code here我必须动态停止和启动 Video Renderer Filter。在不创建新图表的情况下，使用“普通”直接展示架构是不可能的。但是使用 GMFBridge 似乎是可能的。
directshow - DirectShow 中的自动 YUV -> RGB 用于自定义解码器
在网上搜索了几个小时后，我非常想找到解决方案。我已经在 DirectShow 中启动并运行 OGG Theora 解码器，它输出 YV12 和 YUY2 颜色模型。现在，我想为这个输出制作一个 RG
directshow - 是否有任何现代 Windows PC 中至少没有一个 DirectShow 捕获设备？
就目前而言，这个问题不适合我们的问答形式。我们希望答案得到事实、引用或专业知识的支持，但这个问题可能会引起辩论、争论、投票或扩展讨论。如果您觉得这个问题可以改进并可能重新打开，visit the he
directshow - 如何处理原始 UDP 数据包，以便它们可以被 Directshow 源过滤器中的解码器过滤器解码
长篇故事: 有一个 H264/MPEG-4 源我可以用 RTSP 协议(protocol)连接这个源。我可以使用 RTP 协议(protocol)获取原始 UDP 数据包。然后将这些原始 UDP
c++ - 使用 DirectShow 从非 DirectShow 源捕获到 AVI
这可能是一个愚蠢的问题，但我很难概念化我需要在这里做什么......过去我使用 DirectShow 连接到相机并使用源过滤器捕获 AVI，AVI多路复用器、压缩过滤器、运行图表等……小菜一碟。在这种
directshow - 将 DirectShow 过滤器迁移到 Media Foundation MFT 的快速方法
我有一些自定义 DirectShow 过滤器(用于编码/解码/多路复用/多路分离)。我想在 Media Foundation 上将它们用作 MFT。我听过一些人说: “微软声称所有 DirectS
directshow - 如何在 DirectShow 中将 SampleGrabber 与 Matroska Muxer 一起使用？
我做了一个简单的图表来编写 MKV 文件。但我不想使用文件编写器。我想使用 SampleGabber 并使用简单的程序获取流。问题是我从 Matroska muxer 以外的任何地方获取数据!(我该怎
c# - DirectShow 库-如何使用 directshow 库捕获图像而不在 PictureBox 或 Panel 上显示网络摄像头实时图像
我正在使用使用 DirectShow 库的 WPF 应用程序，它可以很好地抓取图像或记录实时提要，但我希望最终用户不应该看到任何网络摄像头正在拍照，即我想要用户不应看到正在呈现来自网络摄像头的提要的图
directshow - 编写直接显示源过滤器
我应该写一个直接显示过滤器从实时源获取输入(视频、音频)。它应该将数据(视频，音频:已编码)提供给解码器过滤器 MyCustomDirectShowSourceFilter ---> Decode
DirectShow 过滤器图形编辑器不显示远程图形
我在从 DirectShow 筛选器图表编辑器连接到远程图表时遇到问题。当我运行创建直接显示图形的应用程序时，在我的 Windows XP 计算机上，图形显示在远程图形列表中，但在 Windows 7
video - DirectShow - 它是什么？
什么是微软的 DirectShow ，以及它与以下内容有何关系: 编解码器？容器？编解码器和容器之间的确切区别是什么？最佳答案用简单的英语: 编解码器是一种算法和/或代码，可将音频或视频从 R
encoding - DirectShow 中的实时视频编码
我开发了一个使用 DirectShow 从外部设备捕获视频的 Windows 应用程序。图像分辨率为 640x480，未经压缩保存的视频尺寸非常大(大约每秒 27MB)。我的目标是尽可能地减小这个大
webcam - DirectShow - 在查看时捕获网络摄像头？
我正在尝试用 C# 制作一个允许用户录制视频的网络摄像头应用程序。我一直在使用 DirectShow.Net 来预览网络摄像头并拍摄快照。但需要能够在预览网络摄像头的同时捕捉视频和音频。我尝试过的一件
.net - DirectShow Dotnet音频输入和渲染器
我已经找了很长时间，但是找不到解决方案。如何从USB(符合Directshow规范)输入设备捕获音频并直接传递到PC扬声器(“音频渲染器”)？我相信这不是一项艰巨的任务，但是我确实在网络上找不到任
winapi - DirectShow 音频捕获设备的注册表位置
我正在从我的应用程序执行 VLC 以从 DirectShow 音频捕获设备捕获和编码。 VLC 通过 STDOUT 将编码数据发送到我的应用程序。我需要一种枚举 DirectShow 音频捕获设备的方

太空宇宙

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

c# - 使用 DirectShow .NET 从 SampleGrabber 计算 FFT