gpt4 book ai didi

hadoop - 解析参数错误,amazon aws emr

转载 作者:可可西里 更新时间:2023-11-01 15:29:00 30 4
gpt4 key购买 nike

我正在尝试通过 Linux 控制台创建一个步骤:

aws emr add-steps --cluster-id j-XXXXXXXXXX --steps Type=CUSTOM_JAR,Name="S3DistCp step",Jar=/home/hadoop/lib/emr-s3distcp-1.0.jar,\ 
Args=["--s3Endpoint,s3-eu-west-1.amazonaws.com","--src,s3://folder-name/logs/j-XXXXXXXXXX/node/","--dest,hdfs:///output","--srcPattern,.*[a-zA-Z,]+"]

我跳了下面的错误

Error parsing parameter '--steps': Expected: ',', received: '+' for input

我该如何解决?

我正在寻找将多个文件上传到 S3 和 S3DistCp 的解决方案,Hive 为 Amazon EMR 收集这些文件。还有其他办法吗?

我还有一个问题:现在我正在创建一个 SSH 隧道来连接到 Hive,我如何连接 PHP?


目前我已经通过删除“src Pattern”解决了错误,但是给了我另一个错误,我在下面包含了图片

Image error

这是出现的错误

INFO Synchronously wait child process to complete : hadoop jar /var/lib/aws/emr/step-runner/hadoop- 
INFO waitProcessCompletion ended with exit code 1 : hadoop jar
/var/lib/aws/emr/step-runner/hadoop-
INFO total process run time: 2 seconds
2016-07-12T14:26:48.744Z INFO Step created jobs:
2016-07-12T14:26:48.744Z WARN Step failed with exitCode 1 and took 2 seconds

谢谢!!!

最佳答案

尝试JSON配置

[
{
"Name":"S3DistCp step",
"Args":["s3-dist-cp","--s3Endpoint=s3.amazonaws.com","--src=s3://mybucket/logs/j-3GYXXXXXX9IOJ/node/","--dest=hdfs:///output","--srcPattern=.*[a-zA-Z,]+"],
"ActionOnFailure":"CONTINUE",
"Type":"CUSTOM_JAR",
"Jar":"command-runner.jar"
}
]

aws emr add-steps --cluster-id j-3GYXXXXXX9IOK --steps file://./myStep.json

http://docs.aws.amazon.com/emr/latest/ReleaseGuide/UsingEMR_s3distcp.html#UsingEMR_s3distcp.step

关于hadoop - 解析参数错误,amazon aws emr,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/38325770/

30 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com