ios - GNAudioSourceMic原始音频位置-6ren

ios - GNAudioSourceMic原始音频位置

转载作者：行者123 更新时间：2023-12-03 02:11:06

我目前正在开发一个使用Gracenote Mobile Client创建指纹以及识别我正在听的音乐的应用程序。我已经在我的项目中成功实现了它，但是由于业务需求，我不得不将Gracenote录制的音频用于其他处理。

关键是:由于GNAudioSourceMic封装了整个麦克风录制操作，例如startRecording / stopRecording，因此我无法访问麦克风原始音频。

这是我正在使用的代码:

- (void)viewDidLoad
{
    [super viewDidLoad];
    [self setNeedsStatusBarAppearanceUpdate];
    [self setupUI];

    @try {
        self.config = [GNConfig init:GRACENOTE_CLIENTID];
    }
    @catch (NSException * e) {
        NSLog(@"%s clientId can't be nil or the empty string",__PRETTY_FUNCTION__);
        [self.view setUserInteractionEnabled:FALSE];
        return;
    }

    // Debug is disabled in the GUI by default
#ifdef DEBUG
    [self.config setProperty:@"debugEnabled" value:@"1"];
#else
    [self.config setProperty:@"debugEnabled" value:@"0"];
#endif
    [self.config setProperty:@"lookupmodelocalonly" value:@"0"];

    // -------------------------------------------------------------------------------
    //Init AudioSource to Start Recording.
    // -------------------------------------------------------------------------------

    self.recognizeFromPCM = [GNRecognizeStream gNRecognizeStream:self.config];
    self.audioConfig = [GNAudioConfig gNAudioConfigWithSampleRate:44100 bytesPerSample:2 numChannels:1];

    self.objAudioSource = [GNAudioSourceMic gNAudioSourceMic:self.audioConfig];
    self.objAudioSource.delegate=self;

    NSError *err;

    RecognizeStreamOperation *op = [RecognizeStreamOperation recognizeStreamOperation:self.config];
    op.viewControllerDelegate = self;
    err = [self.recognizeFromPCM startRecognizeSession:op audioConfig:self.audioConfig];

    if (err) {
        NSLog(@"ERROR: %@",[err localizedDescription]);
    }

    [self.objAudioSource startRecording];

    [self performSelectorInBackground:@selector(setUpRecognizePCMSession) withObject:nil];

}

-(void) startRecordMicrophone{
    #ifdef DEBUG
        NSLog(@"%s startRecording",__PRETTY_FUNCTION__);
    #endif

    NSError *error;
    error = [self.recognizeFromPCM idNow];

    if (error) {
        NSLog(@"ERROR: %@",[error localizedDescription]);
    }

}

是否有人面临与上述相同的需求？

提前致谢

最佳答案

经过昨天的大量搜寻之后，我想出了一个解决方案，它不是我以前期望的，但是它可以按我的意愿很好地工作。我决定自己录制iOS麦克风，然后在Grancenote SDK上调用一个方法来识别我刚刚录制的内容。

这是对我有用的东西。

MicrophoneInput.h

#import <Foundation/Foundation.h>
#import <UIKit/UIKit.h>
#import <AVFoundation/AVFoundation.h>

@interface MicrophoneInput : UIViewController {
    AVAudioPlayer *audioPlayer;
    AVAudioRecorder *audioRecorder;
    int recordEncoding;
    enum
    {
        ENC_AAC = 1,
        ENC_ALAC = 2,
        ENC_IMA4 = 3,
        ENC_ILBC = 4,
        ENC_ULAW = 5,
        ENC_PCM = 6,
    } encodingTypes;
}

-(IBAction) startRecording;
-(IBAction) stopRecording;

@end

MicrophoneInput.m

#import "MicrophoneInput.h"


@implementation MicrophoneInput

- (void)viewDidLoad
{
    [super viewDidLoad];
    recordEncoding = ENC_PCM;
}

-(IBAction) startRecording
{
    NSLog(@"startRecording");
    [audioRecorder release];
    audioRecorder = nil;

    // Init audio with record capability
    AVAudioSession *audioSession = [AVAudioSession sharedInstance];
    [audioSession setCategory:AVAudioSessionCategoryRecord error:nil];

    NSMutableDictionary *recordSettings = [[NSMutableDictionary alloc] initWithCapacity:10];
    recordSettings[AVFormatIDKey] = @(kAudioFormatLinearPCM);
    recordSettings[AVSampleRateKey] = @8000.0f;
    recordSettings[AVNumberOfChannelsKey] = @1;
    recordSettings[AVLinearPCMBitDepthKey] = @16;
    recordSettings[AVLinearPCMIsBigEndianKey] = @NO;
    recordSettings[AVLinearPCMIsFloatKey] = @NO;   

    //set the export session's outputURL to <Documents>/output.caf
    NSArray *paths = NSSearchPathForDirectoriesInDomains(NSDocumentDirectory, NSUserDomainMask, YES);
    NSString *documentsDirectory = paths[0];
    NSURL* outURL = [NSURL fileURLWithPath:[documentsDirectory stringByAppendingPathComponent:@"output.caf"]];
    [[NSFileManager defaultManager] removeItemAtURL:outURL error:nil];
    NSLog(@"url loc is %@", outURL);

    NSError *error = nil;
    audioRecorder = [[ AVAudioRecorder alloc] initWithURL:outURL settings:recordSettings error:&error];

    if ([audioRecorder prepareToRecord] == YES){
        [audioRecorder record];
    }else {
        int errorCode = CFSwapInt32HostToBig ([error code]); 
        NSLog(@"Error: %@ [%4.4s])" , [error localizedDescription], (char*)&errorCode); 

    }
    NSLog(@"recording");
}

-(IBAction) stopRecording
{
    NSLog(@"stopRecording");
    [audioRecorder stop];
    NSLog(@"stopped");
}


- (void)dealloc
{
    [audioPlayer release];
    [audioRecorder release];
    [super dealloc];
}


@end

观察:如果您使用的是ARC，请不要忘记在Compiling BuildPhase上添加-fno-objc-arc编译器标志，如下所示。

YourViewController.h

//Libraries
#import <AVFoundation/AVFoundation.h>
#import <AudioToolbox/AudioToolbox.h>

//Echonest Codegen
#import "MicrophoneInput.h"

//GracenoteMusic
#import <GracenoteMusicID/GNRecognizeStream.h>
#import <GracenoteMusicID/GNAudioSourceMic.h>
#import <GracenoteMusicID/GNAudioConfig.h>
#import <GracenoteMusicID/GNCacheStatus.h>
#import <GracenoteMusicID/GNConfig.h>
#import <GracenoteMusicID/GNSampleBuffer.h>
#import <GracenoteMusicID/GNOperations.h>
#import <GracenoteMusicID/GNSearchResponse.h>

@interface YourViewController : UIViewController<GNSearchResultReady>


@end

YourViewController.m

#import "YourViewController.h"

@interface YourViewController ()
//Record
@property(strong,nonatomic) MicrophoneInput* recorder;
@property (strong,nonatomic) GNConfig *config;
@end

@implementation YourViewController


#pragma mark - UIViewController lifecycle

- (void)viewDidLoad
{
    [super viewDidLoad];
    self.recorder = [[MicrophoneInput alloc] init];

    @try {
        self.config = [GNConfig init:GRACENOTE_CLIENTID];
    }
    @catch (NSException * e) {
        NSLog(@"%s clientId can't be nil or the empty string",__PRETTY_FUNCTION__);
        [self.view setUserInteractionEnabled:FALSE];
        return;
    }

    // Debug is disabled in the GUI by default
#ifdef DEBUG
    [self.config setProperty:@"debugEnabled" value:@"1"];
#else
    [self.config setProperty:@"debugEnabled" value:@"0"];
#endif
    [self.config setProperty:@"lookupmodelocalonly" value:@"0"];
}    

-(void)viewDidAppear:(BOOL)animated{
    [self performSelectorInBackground:@selector(startRecordMicrophone) withObject:nil];
}

-(void) startRecordMicrophone{
    #ifdef DEBUG
        NSLog(@"%s startRecording",__PRETTY_FUNCTION__);
    #endif
    [self.recorder startRecording];
    [self performSelectorOnMainThread:@selector(makeMyProgressBarMoving) withObject:nil waitUntilDone:NO];
}

-(void) stopRecordMicrophone{
#ifdef DEBUG
    NSLog(@"%s stopRecording",__PRETTY_FUNCTION__);
#endif
    [self.recorder stopRecording];

    NSArray *paths = NSSearchPathForDirectoriesInDomains(NSDocumentDirectory, NSUserDomainMask, YES);
    NSString *documentsDirectory = paths[0];
    NSString *filePath =[documentsDirectory stringByAppendingPathComponent:@"output.caf"];

    NSData* sampleData = [[NSData alloc] initWithContentsOfFile:filePath];
        GNSampleBuffer *sampleBuffer = [GNSampleBuffer gNSampleBuffer:sampleData numChannels:1 sampleRate:8000];
    [GNOperations recognizeMIDStreamFromPcm:self config:self.config sampleBuffer:sampleBuffer];
}

#pragma mark - UI methods

-(void)makeMyProgressBarMoving {

    float actual = [self.progressBar progress];
    if (actual < 1) {
        [self.loadingAnimationView showNextLevel];
        self.progressBar.progress = actual + 0.0125;
        [NSTimer scheduledTimerWithTimeInterval:0.25f target:self selector:@selector(makeMyProgressBarMoving) userInfo:nil repeats:NO];
    }
    else{
        self.progressBar.hidden = YES;
        [self stopRecordMicrophone];
    }

}            

#pragma mark - GNSearchResultReady methods
- (void) GNResultReady:(GNSearchResult*)result{
    NSLog(@"%s",__PRETTY_FUNCTION__);
}

@end

有关MicrophoneInput解决方案的信息，请转到Brian Whitman和Echo Nest Library。

希望它可以帮助面临同样情况的人。

干杯

关于ios - GNAudioSourceMic原始音频位置，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/23898280/

文章推荐： elasticsearch - Elasticsearch更改内部时区

文章推荐： javascript - HTML麦克风可视化器

文章推荐： ios - 在 iOS 设备上流式传输一些音频格式文件

java - 原始 + ""与 Wrapper.toString(原始)
当需要将原始类型转换为字符串时，例如传递给需要字符串的方法时，基本上有两种选择。以int为例，给出: int i; 我们可以执行以下操作之一: someStringMethod(Integer.to
r - Bootstrapping : Error in statistic(data, 原始，...):未使用的参数(原始)
我有一个位置估计数据库，并且想要计算每月的内核利用率分布。我可以使用 R 中的 adehabitat 包来完成此操作，但我想使用引导数据库中的样本来估计这些值的 95% 置信区间。今天我一直在尝试引导
PowerShell 原始 FTP
我希望使用 FTP 编写大型机作业流。为此，我可以通过 FTP 连接到大型机并运行以下命令: QUOTE TYPE E QUOTE SITE FILETYPE=JES PUT myjob.jcl 那么
WPF:将画笔恢复为默认/原始
我是 WPF 的新手。目前，我正在为名为“LabeledTextbox”的表单元素制作一个用户控件，其中包含一个标签、一个文本框和一个用于错误消息的文本 block 。当使用代码添加错误消息时，我
SignalR(原始)不向客户端发送消息
我们正在使用 SignalR(原始版本，而不是 Core 版本)并注意到一些无法解释的行为。我们的情况如下: 我们有一个通过 GenericCommand() 方法接受命令的集线器(见下文)。这些命
Python请求 - 打印整个http请求(原始)？
使用 requests module 时，有没有办法打印原始 HTTP 请求？我不只想要标题，我想要请求行、标题和内容打印输出。是否可以看到最终由 HTTP 请求构造的内容？最佳答案 Since
你需要知道的三种VMware磁盘类型：原始、厚和精简
与直接访问现有本地磁盘或分区的物理磁盘相比，虚拟磁盘为文件存储提供更好的可移植性和效率。VMware有三种不同的磁盘类型：原始磁盘、厚磁盘和精简磁盘，它们各自分配不同的存储空间。 VMware
unity3d - 预制件(原始)和变体预制件有什么区别？
我有一个用一些颜色着色器等创建的门。前段时间我拖着门，它问我该怎么办时，我选择了变体。但现在我决定选择创建原始预制件和门颜色，或者着色器变成粉红色。这是资源中原始预制件和变体的屏幕截图。粉红色的
forms - Symfony2 form_label 原始
我想呈现原始翻译，所以我决定在 Twig 模板中使用“原始”选项。但它不起作用。例子: {{ form_label(form.sfGuardUserProfile.roules_acceptance)
sqlite - 文字(原始)值作为sqlite中的外键
是否可以在sqlite中制作类似的东西？ FOREIGN KEY(TypeCode, 'ARawValue', IdServeur) REFERENCES OTHERTABLE(TypeCode, T
geolocation - 原始 geoip 数据从何而来？
这个问题是一个更具体问题的一般版本 asked here .但是，这些答案无法使用。问题: geoIP数据的原始来源是什么？许多网站会告诉我我的 IP 在哪里，但它们似乎都在使用来自不到 5 家公
docker - Openshift/原始-基于Wildfly创建图像
对于Openshift:如何基于Wildfly创建docker镜像？这是使用的Dockerfile: FROM openshift/wildfly-101-centos7 # Install exa
Groovy 原始 double 算术
结果是 127 double middle = 255 / 2 虽然这产生了 127.5 Double middle = 255 / 2 同时这也会产生 127.5 double middle = (
delphi - 以编程方式逐个像素地交换小位图(原始)的颜色
在此处下载带有已编译可执行文件的源代码(大小:161 KB(165,230 字节)):http://www.eyeClaxton.com/download/delphi/ColorSwap.zip 原
string - 有没有办法在lua(原始)中定义自动转义字符串？
以下几行是我需要在 lua 中使用的任意正则表达式。 ['\";=] !^(?:(?:[a-z]{3,10}\s+(?:\w{3,7}?://[\w\-\./]*(?::\d+)?)?/[^?#]*(
geolocation - 原始 geoip 数据从何而来？
这个问题是一个更具体问题的一般版本 asked here .但是，这些答案无法使用。问题: geoIP数据的原始来源是什么？许多网站会告诉我我的 IP 在哪里，但它们似乎都在使用来自不到 5 家公
api - 原始.M数组字符串？以相同的结构响应http请求
我正在使用GoLang做服务器api，试图管理和回答所发出的请求。使用net/http和github.com/gorilla/mux。收到请求时，我使用以下结构创建响应: type Response
c++ - 原始 static_vector 实现中可能未定义的行为
tl; dr:我认为我的 static_vector 有未定义的行为，但我找不到它。这个问题是在 Microsoft Visual C++ 17 上。我有这个简单且未完成的 static_vecto
awk - 原始 awk 源代码的旧版本存档？
我试图找到原始 Awk (a/k/a One True Awk) 源代码的“历史”版本。我找到了 Kernighan's occasionally-updated site ，它似乎总是链接到最新版本
Python 原始 IPv6 套接字错误
我在 python 中使用原始 IPv6 套接字时遇到一些问题。我通过以下方式连接: if self._socket != None: # Close out old sock

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

ios - GNAudioSourceMic原始音频位置