ios - 核心音频 : Float32 to SInt16 conversion artefacts-6ren

ios - 核心音频 : Float32 to SInt16 conversion artefacts

转载作者：行者123 更新时间：2023-11-28 21:47:23

我正在从以下格式转换:

const int four_bytes_per_float = 4;
const int eight_bits_per_byte = 8;
_stereoGraphStreamFormat.mFormatID          = kAudioFormatLinearPCM;
_stereoGraphStreamFormat.mFormatFlags       = kAudioFormatFlagsNativeFloatPacked | kAudioFormatFlagIsNonInterleaved;
_stereoGraphStreamFormat.mBytesPerPacket    = four_bytes_per_float;
_stereoGraphStreamFormat.mFramesPerPacket   = 1;
_stereoGraphStreamFormat.mBytesPerFrame     = four_bytes_per_float;
_stereoGraphStreamFormat.mChannelsPerFrame  = 2;     
_stereoGraphStreamFormat.mBitsPerChannel    = eight_bits_per_byte * four_bytes_per_float;
_stereoGraphStreamFormat.mSampleRate        = 44100;

格式如下:

interleavedAudioDescription.mFormatID          = kAudioFormatLinearPCM;
interleavedAudioDescription.mFormatFlags       = kAudioFormatFlagIsSignedInteger;
interleavedAudioDescription.mChannelsPerFrame  = 2;
interleavedAudioDescription.mBytesPerPacket    = sizeof(SInt16)*interleavedAudioDescription.mChannelsPerFrame;
interleavedAudioDescription.mFramesPerPacket   = 1;
interleavedAudioDescription.mBytesPerFrame     = sizeof(SInt16)*interleavedAudioDescription.mChannelsPerFrame;
interleavedAudioDescription.mBitsPerChannel    = 8 * sizeof(SInt16);
interleavedAudioDescription.mSampleRate        = 44100;

使用以下代码:

int32_t availableBytes = 0;

void* tailL = TPCircularBufferTail(inputBufferL(), &availableBytes);
void* tailR = TPCircularBufferTail(inputBufferR(), &availableBytes);

// If we have no data in the buffer, we simply return
if (availableBytes <= 0)
{
    return;
}

// ========== Non-Interleaved to Interleaved (Plus Samplerate Conversion) =========

// Get the number of frames available
UInt32 frames = availableBytes / this->mInputFormat.mBytesPerFrame;

pcmOutputBuffer->mBuffers[0].mDataByteSize = frames * interleavedAudioDescription.mBytesPerFrame;

struct complexInputDataProc_t data = (struct complexInputDataProc_t) { .self = this, .sourceL = tailL, .sourceR = tailR, .byteLength = availableBytes };

// Do the conversion
OSStatus result = AudioConverterFillComplexBuffer(interleavedAudioConverter,
                                                  complexInputDataProc,
                                                  &data,
                                                  &frames,
                                                  pcmOutputBuffer,
                                                  NULL);

// Tell the buffers how much data we consumed during the conversion so that it can be removed
TPCircularBufferConsume(inputBufferL(), availableBytes);
TPCircularBufferConsume(inputBufferR(), availableBytes);

// ========== Buffering Of Interleaved Samples =========

// If we got converted frames back from the converter, we want to add it to a separate buffer
if (frames > 0)
{
    // Make sure we have enough space in the buffer to store the new data
    TPCircularBufferHead(&pcmCircularBuffer, &availableBytes);

    if (availableBytes > pcmOutputBuffer->mBuffers[0].mDataByteSize)
    {
        // Add the newly converted data to the buffer
        TPCircularBufferProduceBytes(&pcmCircularBuffer, pcmOutputBuffer->mBuffers[0].mData, frames * interleavedAudioDescription.mBytesPerFrame);
    }
    else
    {
        printf("No Space in Buffer\n");
    }
}

但是我得到以下输出:

Sine Wave After Conversion

它应该是一个完美的正弦波，但正如您所见，它不是。

我已经为此工作好几天了，但似乎找不到哪里出了问题。谁能看到我可能遗漏的东西？

编辑:

缓冲区初始化:

TPCircularBuffer        pcmCircularBuffer;
static SInt16           pcmOutputBuf[BUFFER_SIZE];

pcmOutputBuffer = (AudioBufferList*)malloc(sizeof(AudioBufferList));
pcmOutputBuffer->mNumberBuffers = 1;
pcmOutputBuffer->mBuffers[0].mNumberChannels = 2;
pcmOutputBuffer->mBuffers[0].mData = pcmOutputBuf;

TPCircularBufferInit(&pcmCircularBuffer, BUFFER_SIZE);

复杂的输入数据过程:

static OSStatus complexInputDataProc(AudioConverterRef             inAudioConverter,
                                 UInt32                        *ioNumberDataPackets,
                                 AudioBufferList               *ioData,
                                 AudioStreamPacketDescription  **outDataPacketDescription,
                                 void                          *inUserData) {

struct complexInputDataProc_t *arg = (struct complexInputDataProc_t*)inUserData;
BroadcastingServices::MP3Encoder *self = (BroadcastingServices::MP3Encoder*)arg->self;

if ( arg->byteLength <= 0 )
{
    *ioNumberDataPackets = 0;
    return 100; //kNoMoreDataErr;
}

UInt32 framesAvailable = arg->byteLength / self->interleavedAudioDescription.mBytesPerFrame;

if (*ioNumberDataPackets > framesAvailable)
{
    *ioNumberDataPackets = framesAvailable;
}

ioData->mBuffers[0].mData = arg->sourceL;
ioData->mBuffers[0].mDataByteSize = arg->byteLength;

ioData->mBuffers[1].mData = arg->sourceR;
ioData->mBuffers[1].mDataByteSize = arg->byteLength;

arg->byteLength = 0;

return noErr;

最佳答案

我看到了一些值得警惕的事情。

1) 如上面的评论所述，您正在用右侧的输入覆盖左侧输入的 availableBytes:

void* tailL = TPCircularBufferTail(inputBufferL(), &availableBytes);
void* tailR = TPCircularBufferTail(inputBufferR(), &availableBytes);

如果两个输入流与此代码异步更改，那么您很可能会遇到竞争条件。

2) 截断错误:availableBytes 不一定是每帧字节数的倍数。如果不是，则以下代码可能会导致您消耗比实际转换更多的字节。

void* tailL = TPCircularBufferTail(inputBufferL(), &availableBytes);
void* tailR = TPCircularBufferTail(inputBufferR(), &availableBytes);
...
UInt32 frames = availableBytes / this->mInputFormat.mBytesPerFrame;
...
TPCircularBufferConsume(inputBufferL(), availableBytes);
TPCircularBufferConsume(inputBufferR(), availableBytes);

3) 如果输出缓冲区还没有准备好消耗所有输入，您只需将输入缓冲区扔掉。这发生在这段代码中。

if (availableBytes > pcmOutputBuffer->mBuffers[0].mDataByteSize)
{
    ...
}
else
{
    printf("No Space in Buffer\n");
}

如果您看到打印输出，我会非常好奇。

这是我建议的做法。这将是伪代码，因为我没有任何必要的东西来编译和测试它。

int32_t availableBytesInL = 0;
int32_t availableBytesInR = 0;
int32_t availableBytesOut = 0;

// figure out how many bytes are available in each buffer.
void* tailL = TPCircularBufferTail(inputBufferL(), &availableBytesInL);
void* tailR = TPCircularBufferTail(inputBufferR(), &availableBytesInR);
TPCircularBufferHead(&pcmCircularBuffer, &availableBytesOut);

// figure out how many full frames are available
UInt32 framesInL = availableBytesInL / mInputFormat.mBytesPerFrame;
UInt32 framesInR = availableBytesInR / mInputFormat.mBytesPerFrame;
UInt32 framesOut = availableBytesOut / interleavedAudioDescription.mBytesPerFrame;

// figure out how many frames to process this time.
UInt32 frames = min(min(framesInL, framesInL), framesOut);

if (frames == 0)
    return;

int32_t bytesConsumed = frames * mInputFormat.mBytesPerFrame;

struct complexInputDataProc_t data = (struct complexInputDataProc_t) {
    .self = this, .sourceL = tailL, .sourceR = tailR, .byteLength = bytesConsumed };

// Do the conversion
OSStatus result = AudioConverterFillComplexBuffer(interleavedAudioConverter,
                                                  complexInputDataProc,
                                                  &data,
                                                  &frames,
                                                  pcmOutputBuffer,
                                                  NULL);

int32_t bytesProduced = frames * interleavedAudioDescription.mBytesPerFrame;

// Tell the buffers how much data we consumed during the conversion so that it can be removed
TPCircularBufferConsume(inputBufferL(), bytesConsumed);
TPCircularBufferConsume(inputBufferR(), bytesConsumed);
TPCircularBufferProduceBytes(&pcmCircularBuffer, pcmOutputBuffer->mBuffers[0].mData, bytesProduced);

基本上我在这里所做的是预先计算出应该处理多少帧，确保我只处理输出缓冲区可以处理的帧数。如果是我，我还会添加一些检查输出缓冲区欠载和输入缓冲区溢出的检查。最后，我不太确定 AudioConverterFillComplexBuffer 与传入和传出的 frame 参数的语义。可以想象出的 # 帧数将少于或多于输入的帧数。尽管如此，由于您没有进行采样率转换，所以这可能不会发生。我试图通过在转换后分配 bytesProduced 来解释这种情况。

希望这对您有所帮助。如果不是，您还有其他 2 条线索。一是辍学是周期性的，二是辍学的规模看起来大致相同。如果您能算出每个样本有多少个样本，那么您就可以在代码中查找这些数字。

关于ios - 核心音频 : Float32 to SInt16 conversion artefacts，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/29619205/

文章推荐： tomcat - 可以在 Jetty 上配置 WebDav 访问吗？

文章推荐： python - 正则表达式 : Remove Time-Stamp from File-Name

文章推荐： ios - 应用程序卡住但 CPU 使用率随时间增加

文章推荐： ios - cocos2dx - 绕过 body 接触

watson-conversation - Watson Conversation 支持嵌套的意图吗？
有没有人有嵌套 Intent 的好例子，尤其是在 #yes 和 #no 是子节点的情况下。我得到的情况是 API 返回的是 Intent 值，但输出文本来自“Anything else”! 最佳答案
watson-conversation - 是否有一种编程方式可以导出 Watson Conversation 的意图？
我知道您可以转到 Watson Conversation 界面，右键单击工作区，然后下载工作区的 JSON，其中包含意图，如下所示:Is there any way to export intents
watson-conversation - 在为 Watson Conversation API 流创建流时，我可以使用意图的置信度评级吗？
我能否在 Watson Conversation API 的对话流中使用节点条件中的意图置信度评级？最佳答案要做到这一点，请创建一个条件来寻找您的意图，然后检查置信度。你会拥有的示例条件 #te
c++ - 错误 83 错误 C2398 : conversion from 'double' to 'float' requires a narrowing conversion
我找到了很多关于这个错误的帖子，但我可以找到克服它的方法。这是触发错误的代码: void main(){ float f{1.3}; } 为什么在初始化列表中没有像其他变量那样发生转换？例如，
c++ - Derived pointer to Base pointer conversion using static_cast, dynamic_cast, or explicit conversion 不会调用基函数
我有以下代码。 #include using namespace std; class Base { public: virtual int f(){coutf(); ///base
c++ - 不一致的警告 "conversion from ' const unsigned char' to 'const float' requires a narrowing conversion”
Visual C++ 2017 和 gcc 5.4 产生 conversion from 'const unsigned char' to 'const float' requires a narro
c - Microchip XC8 警告 "conversion to shorter data type"和 "implicit conversion signed to unsigned"
我正在为 PIC18F2420 使用带有 xc8 1.35 编译器的 MPLABX 3.20，我收到了两个我不理解的奇怪警告: 这是生成警告的源代码之一 9 void write(Pin _Pin,
c++ - 在 visual studio 2015 community c++ 中，如何修复警告 C4838 : conversion from 'unsigned int' to 'int' requires a narrowing conversion
我正在尝试在 win32 API(无 mfc)上编写一些直接的 c++。有了这个更现代的 c++ 编译器，我得到: 警告 C4838:从“unsigned int”到“int”的转换需要收缩转换它发
编译错误 'incompatible pointer to integer conversion passing ' int (int, int )' to parameter of type ' int' [-Wint-conversion]"
此代码采用用户输入(字符 C、T、B)和(int 0-24 和 0-60)来计算 parking 成本关于用户输入的车辆类型。我怀疑错误发生在函数 charged 中，因此我无法在收到此错误的最后一
type-conversion - 元组与元组
为什么在使用 tuple 或 Tuple 转换向量时会得到以下不同的结果？ julia> a = [1, 2, 3] 3-element Vector{Int64}: 1 2 3 julia> tup
code-conversion - 如何管理代码转换？
关闭。这个问题是opinion-based .它目前不接受答案。想改善这个问题吗？更新问题，以便可以通过 editing this post 用事实和引文回答问题. 8年前关闭。 Improve t
document-conversion - 如何预览文件？
我正在开发一个文件共享网站，我需要一种方法来对上传的文档进行截图。该站点将支持多种文件格式，从纯文本到办公文档(doc、xls、ppt...)、视频(mpeg、avi...)、图像(jpg、gif、
type-conversion - 整数到实数转换函数
在 VHDL 中是否有将整数类型对象转换为实数类型的通用转换函数？这是针对测试平台的，因此可综合性不是问题。最佳答案您可以将整数转换为实数，如下所示: signal i: integer; si
type-conversion - 字符串选项到字符串转换
如何在 Ocaml 中将字符串选项数据类型转换为字符串？ let function1 data = match data with None -> "" | Some str -> s
type-conversion - SRA不能有这样的操作数吗？
我已经在 VHDL 中编写了一个算法，但是我有一条消息，我不明白“sra/sla 在这种情况下不能有这样的操作数。”。请问有什么帮助吗？ library ieee; use ieee.std_logi
type-conversion - 比较任意两项的最有效运算符是什么？
我经常需要将数据从一种类型转换为另一种类型，然后进行比较。一些运算符会先转换为特定类型，这种转换可能会导致效率损失。例如，我可能有 my $a, $b = 0, "foo"; # initial va
watson-conversation - 沃森没有得到零
假设我在 IBM Watson 中配置了一个对话服务，可以识别以单词和片段形式给出的数字。例如，如果我有号码 1320 , 可以发送为 thirteen twenty或 thirteen two ze
type-conversion - 转换容器类型？
也许我错过了一些显而易见的事情...在整个文档中，我似乎都认为Kotlin具有各种类型的序列，这些序列不能互操作。即使复制序列可能效率不高–当我需要将其作为语义相同但不同的类型传递给函数时，这也无济于
c++ - QTcpSocket模拟netcat 'conversation'
在我的Linux终端中，我想要使用QTcpSocket从qt运行以下“对话”: S user@domain:~ $ netcat 1.1.1.2 9230 R HELO SOME MORE I
c++ - "Conversion"从类型到相同类型导致错误
我有一个模板函数，其中枚举类型转换为它的底层类型，工作正常，但我写了一个重载，它应该接受一个整数并返回它自己，它给我一个错误，指出 int 不是枚举类型。在我的模板中，这应该已经被过滤掉了。怎么了？

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

ios - 核心音频 : Float32 to SInt16 conversion artefacts