gpt4 book ai didi

hadoop - 用户是否需要在所有节点上都存在才能被hadoop集群/HDFS识别?

转载 作者:行者123 更新时间:2023-12-02 20:25:07 25 4
gpt4 key购买 nike

在MapR hadoop中,为了使用户能够访问HDFS或对程序使用YARN,他们需要存在于群集群集的所有节点中(具有相同的uid和gid),其中包括不起作用的客户端节点作为数据节点或控制节点(MapR实际上没有名称节点的概念)。 Hortonworks HDP是否也一样?

最佳答案

在Hortonworks community网站上找到以下答案:

User should not have account on all the nodes of the cluster. He should only have account on edge node.

For a new user there are 2 types are directories we need to create before the user access the cluster.

1- User home directory [directory created on Linux Filesystem ie. /home/]

2- User HDFS directory [directory created on HDFS filesystem ie. /user/]

...you only need to create HDFS home directory[ie. /user/] on edge node [not sure the meaning here since HDFS does not seem to have anything to do with any particular edge node]. You can still run jobs with the new user on cluster, even if you haven't created his home directory in linux.


** 更新:
根据用户@ cricket_007的评论,看来 用户也必须存在于namenode服务器上,以及。我可以找到最接近文档的文档,其中明确指出了这个 says:

Each file or directory operation passes the full path name to the NameNode, and the permissions checks are applied along the path for each operation. The client framework will implicitly associate the user identity with the connection to the NameNode, reducing the need for changes to the existing client API. [...] For instance, when the client first begins reading a file, it makes a first request to the NameNode to discover the location of the first blocks of the file.

关于hadoop - 用户是否需要在所有节点上都存在才能被hadoop集群/HDFS识别?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/57319080/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com