sql - Redshift 光谱 : how to import only certain files-6ren

sql - Redshift 光谱 : how to import only certain files

转载作者：行者123 更新时间：2023-12-04 15:42:24

30

4

使用 Redshift 光谱时，您似乎只能导入提供位置直到文件夹的数据，并且导入文件夹内的所有文件。

有没有办法从包含多个文件的文件夹中导入仅导入一个文件。当提供带有 filename 的完整路径时，我认为它将文件视为 list 文件并给出错误: list 太大或不支持 JSON。

有没有其他办法？

最佳答案

您无意中回答了自己的问题:使用 list 文件

来自 CREATE EXTERNAL TABLE - Amazon Redshift :

LOCATION { 's3://bucket/folder/' | 's3://bucket/manifest_file' }

The path to the Amazon S3 bucket or folder that contains the data files or a manifest file that contains a list of Amazon S3 object paths. The buckets must be in the same AWS Region as the Amazon Redshift cluster.

If the path specifies a manifest file, the s3://bucket/manifest_file argument must explicitly reference a single file—for example,'s3://mybucket/manifest.txt'. It can't reference a key prefix.

The manifest is a text file in JSON format that lists the URL of each file that is to be loaded from Amazon S3 and the size of the file, in bytes. The URL includes the bucket name and full object path for the file. The files that are specified in the manifest can be in different buckets, but all the buckets must be in the same AWS Region as the Amazon Redshift cluster.

我不确定为什么它需要每个文件的长度。它可用于在多个节点之间分配工作负载。

关于sql - Redshift 光谱 : how to import only certain files，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/57337676/

30

4

0

文章推荐： rest - 微服务 "JOIN"不同数据库内的表和数据复制

文章推荐： reactjs - React Redux 与 use-global-hook？哪个更好？

sql - Redshift 光谱 : how to import only certain files
使用 Redshift 光谱时，您似乎只能导入提供位置直到文件夹的数据，并且导入文件夹内的所有文件。有没有办法从包含多个文件的文件夹中导入仅导入一个文件。当提供带有 filename 的完整路径时，
amazon-web-services - 雅典娜 vs Redshift 光谱
我正在评估 Athena 和 Redshift Spectrum。两者都有相同的目的，Spectrum 需要一个 Redshift 集群，而 Athena 是纯粹的无服务器集群。 Athena 使用
amazon-s3 - Redshift 光谱 : Automatically partition tables by date/folder
我们目前生成每日 CSV 导出，并将其上传到 S3 存储桶，结构如下: |--reportDate- |-- part0.csv.gz |-- part1.csv.gz 我们希望能够
amazon-web-services - Redshift 光谱 : Query Anonymous JSON array structure
我在 S3 中有一个 JSON 结构数组，它已被 Glue 成功抓取和编目。 [{"key":"value"}, {"key":"value"}] 我正在使用自定义分类器: $[*] 然而，当尝试从

首页

博学

6Ren·AI

商城

sql - Redshift 光谱 : how to import only certain files