gpt4 book ai didi

elasticsearch - 使用Elasticsearch File System Crawler将pdf文件索引到AWS Elasticsearch Service

转载 作者:行者123 更新时间:2023-12-02 23:59:25 24 4
gpt4 key购买 nike

我可以使用Elasticsearch File System Crawler将pdf文件索引到本地Elasticsearch。 fscrawler默认设置具有端口,主机和方案参数,如下所示。

{
"name" : "job_name2",
"fs" : {
"url" : "/tmp/es",
"update_rate" : "15m",
"excludes" : [ "~*" ],
"json_support" : false,
"filename_as_id" : false,
"add_filesize" : true,
"remove_deleted" : true,
"add_as_inner_object" : false,
"store_source" : false,
"index_content" : true,
"attributes_support" : false,
"raw_metadata" : true,
"xml_support" : false,
"index_folders" : true,
"lang_detect" : false,
"continue_on_error" : false,
"pdf_ocr" : true,
"ocr" : {
"language" : "eng"
}
},
"elasticsearch" : {
"nodes" : [ {
"host" : "127.0.0.1",
"port" : 9200,
"scheme" : "HTTP"
} ],
"bulk_size" : 100,
"flush_interval" : "5s"
},
"rest" : {
"scheme" : "HTTP",
"host" : "127.0.0.1",
"port" : 8080,
"endpoint" : "fscrawler"
}
}

但是,我很难用它来索引AWS Elasticsearch服务,因为要索引到AWS Elasticsearch服务,我必须提供 here所记录的AWS_ACCESS_KEY,AWS_SECRET_KEY,区域和服务。有关如何将pdf文件索引到AWS Elasticsearch服务的任何帮助是高度赞赏。

最佳答案

FSCrawler不支持尚未提供AWS_ACCESS_KEYAWS_SECRET_KEY
也许值得在FSCrawler中打开一个问题?

就是说,https://cloud.elastic.co受支持,但使用“登录名/密码”。 See documentation for more details

关于elasticsearch - 使用Elasticsearch File System Crawler将pdf文件索引到AWS Elasticsearch Service,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/50451003/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com