kubernetes - 如何使用 fabric8 kubernetes Java 客户端 API 在容器上设置 GPU 资源要求-6ren

kubernetes - 如何使用 fabric8 kubernetes Java 客户端 API 在容器上设置 GPU 资源要求

转载作者：行者123 更新时间：2023-12-02 11:53:44

27

4

我用fabric8 kubernetes Java客户端API写了一个例子来设置容器的GPU资源需求。我有以下运行时错误:

spec.containers[0].resources.requests[gpu]: Invalid value: "gpu": must be a standard resource type or fully qualified, 
spec.containers[0].resources.requests[gpu]: Invalid value: "gpu": must be a standard resource for containers.

fabric8 jar 的版本是 4.3.0(最新)。到目前为止，fabric8似乎不支持gpu资源需求，因为我删除了“addToRequests(“gpu”，new Quantity(“1”))”这行，它可以正常工作。

那么如何在 Java/Scala 应用程序中启用 GPU 资源需求呢？

该示例的整个源代码如下:

/**
 * Copyright (C) 2015 Red Hat, Inc.
 *
 * Licensed under the Apache License, Version 2.0 (the "License");
 * you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at
 *
 *         http://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */
package com.exam.docker.kubernetes.examples;


import io.fabric8.kubernetes.api.model.*;
import io.fabric8.kubernetes.client.*;
import io.fabric8.kubernetes.client.Config;
import io.fabric8.kubernetes.client.ConfigBuilder;
import org.slf4j.Logger;
import org.slf4j.LoggerFactory;

import java.util.UUID;
import java.util.concurrent.CountDownLatch;
import java.util.concurrent.TimeUnit;


public class PodResExamples {

  private static final Logger logger = LoggerFactory.getLogger(PodResExamples.class);

  public static void main(String[] args) {
    String master = "http://127.0.0.1:8080/";
    if (args.length == 1) {
      master = args[0];
    }
    String ns = "thisisatest";
    String serviceName = "cuda-vector-add-"+ UUID.randomUUID();

    Config config = new ConfigBuilder().withMasterUrl(master).build();
    try (KubernetesClient client = new DefaultKubernetesClient(config)) {
      try {
        if(client.namespaces().withName(ns).get() == null) {
          log("Create namespace:", client.namespaces().create(new NamespaceBuilder().withNewMetadata().withName(ns).endMetadata().build()));
        }

        String imageStr = "k8s.gcr.io/cuda-vector-add:v0.1";
        String cmd = "";

        final ResourceRequirements resources = new ResourceRequirementsBuilder()
                .addToRequests("cpu", new Quantity("2"))
                .addToRequests("memory", new Quantity("10Gi"))
                .addToRequests("gpu", new Quantity("1"))
                .build();

        Container container = new ContainerBuilder().withName(serviceName)
                .withImage(imageStr).withImagePullPolicy("IfNotPresent")
                .withArgs(cmd)
                .withResources(resources)
                .build();

        Pod createdPod = client.pods().inNamespace(ns).createNew()
                .withNewMetadata()
                .withName(serviceName)
                .addToLabels("podres", "cuda-vector")
                .endMetadata()
                .withNewSpec()
                .addToContainers(container)
                .withRestartPolicy("Never")
                .endSpec().done();
        log("Created pod cuda-vector-add:", createdPod);

        final CountDownLatch watchLatch = new CountDownLatch(1);
        try (final Watch ignored = client.pods().inNamespace(ns).withLabel("podres").watch(new Watcher<Pod>() {
          @Override
          public void eventReceived(final Action action, Pod pod) {
            if (pod.getStatus().getPhase().equals("Succeeded")) {
              logger.info("Pod cuda-vector is completed!");
              logger.info(client.pods().inNamespace(ns).withName(pod.getMetadata().getName()).getLog());
              watchLatch.countDown();
            } else if (pod.getStatus().getPhase().equals("Pending")) {
              logger.info("Pod cuda-vector is Pending!");                  
            }
          }

          @Override
          public void onClose(final KubernetesClientException e) {
            logger.info("Cleaning up pod.");
          }
        })) {
          watchLatch.await(30, TimeUnit.SECONDS);
        } catch (final KubernetesClientException | InterruptedException e) {
          e.printStackTrace();
          logger.error("Could not watch pod", e);
        }

      } catch (KubernetesClientException e) {
        logger.error(e.getMessage(), e);
      } finally {
        log("Pod cuda-vector log: \n", client.pods().inNamespace(ns).withName(serviceName).getLog());
        client.namespaces().withName(ns).delete();
      }
    }
  }

  private static void log(String action, Object obj) {
    logger.info("{}: {}", action, obj);
  }

  private static void log(String action) {
    logger.info(action);
  }

}

最佳答案

引用 Kubernetes Docs您可以尝试使用 nvidia.com/gpu而不是 gpu :

apiVersion: v1
kind: Pod
metadata:
  name: cuda-vector-add
spec:
  containers:
    - name: cuda-vector-add
      image: "k8s.gcr.io/cuda-vector-add:v0.1"
      resources:
        limits:
          nvidia.com/gpu: 1 # requesting 1 GPU

如果您的应用程序改用 AMD GPU，请尝试 amd.com/gpu 重要提示:除非您还设置了等于请求的限制，否则您无法设置 GPU 请求。

关于kubernetes - 如何使用 fabric8 kubernetes Java 客户端 API 在容器上设置 GPU 资源要求，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/56747571/

27

4

0

文章推荐： docker - 我是否在容器上运行？

文章推荐： kubernetes - Kubernetes Pod通过StackDriver重新启动警报

文章推荐： kubernetes - 在cronjob.yaml中添加多个命令

文章推荐： mongodb - k8s中的mongo设置不使用持久卷

Java:服务器/客户端 -> 客户端/客户端
我想在一些计算机之间建立点对点连接，这样用户就可以在没有外部服务器的情况下聊天和交换文件。我的第一个想法如下: 我在服务器上创建了一个中央 ServerSocket，所有应用程序都可以连接到该服务器。
c# - 客户端-服务器-客户端*游戏系统
我正在 Unity 中构建多人游戏。为此，我必须将一些值从客户端发送到两个或多个通过服务器连接的客户端。我想将其构建为服务器真实游戏。客户端将使用 Android，他们的数据将通过服务器同步(可能是一
客户端 read() 获取消息的随机尾随字符(使用套接字的 TCP 客户端-服务器)
练习 C 网络编程:我正在编写一个简单的 TCP 客户端-服务器应用程序，它应该将消息(在每个客户端的单独线程中)作为字符串从服务器发送到客户端并在客户端(稍后将成为控制台商店应用程序)。我首先发送消
amazon-web-services - AWS 客户端 VPN 客户端-客户端通信
我使用证书身份验证设置了 AWS Client VPN。我正在为客户端-客户端访问系统进行设置，基本上如 this AWS scenario/example 中所述.一切正常，如果我知道他们的 IP
Java:客户端、客户端、(线程)服务器、流客户端信息、JPanel 创建但消息(？)阻止游戏开始
我正在开发一个小型客户端1/客户端2、服务器(线程)TCP 游戏。在尝试处理延迟问题时，我意识到我的 transmitState() 中存在缺陷。它强制将不必要的信息传递到通讯流中，从而造成迟缓，将汽
azure - 如何将我的 Azure AD 应用程序更改为 secret 客户端？ (非公共(public)客户端)
来自文档:Configurable token lifetimes in Azure Active Directory (Public Preview) 它提到“ secret 客户端”，刷新 tok
react-native - Apollo 客户端 devtool 无法在 React Native 应用程序中检测到 Apollo 客户端
Apollo 客户端开发工具无法连接到我的应用程序。我已在 ApolloClient 构造函数中将 connectToDevTools 传递为 true，但没有任何 react 。我也试过this p
java - 我想在 Pod 内使用 Fabric8 kubernetes 客户端(java)。如何获取部署集群的 kubernetes 客户端？
我想在 Pod 内使用 Fabric8 kubernetes 客户端 (java)。如何获取部署集群的 kubernetes 客户端？我可以使用该集群的 kubeconfig 文件获取任何集群的配置
oracle - Oracle 客户端 11.2 和 Oracle 客户端 12 是否存在 Log4j 安全问题？
我正在阅读 the security issue with Log4j我了解此产品受此漏洞影响。但是 Oracle 客户端 11.2 和 12 是否受此问题影响？我找不到这些产品是否使用任何 Log
spring-boot - 微服务( Eureka 客户端)未注册 Eureka 服务器/ Eureka 服务器未发现 Eureka 客户端
Eureka 服务器设置 pom.xml 1.8 Hoxton.SR1 org.springframework.cloud spring
java - java netty(客户端/服务器)设置中的 TLS 服务器和普通 TCP 客户端(通过本地 LAN)
我有一个点对点(客户端/服务器)设置(通过本地 LAN)，它使用 Netty，一个 Java 网络框架。我使用原始 TCP/IP(例如，没有 HTTP)进行通信和传输。现在，根据要求，我们希望转向 T
基于WebSocket的modbus通信（二）-客户端
上一篇已经实现了ModbusTcp服务器和8个主要的功能码，只是还没有实现错误处理功能。但是在测试客户端时却发现了上一篇的一个错误，那就是写数据成功，服务器不需要响应。接下来要做的就是实现Modb
JavaScript数组到PNG？ - 客户端
有没有办法将二维十六进制代码数组转换为 png 图像？数组看起来像这样(只是更大) [ [ '#FF0000', '#00FF00' ], [ '#0000FF'
连接服务器-客户端
我是套接字编程的新手。每次我运行客户端程序时，它都会说“无法连接到服务器”。谁能告诉我我在哪里犯了错误。任何帮助将不胜感激。这是client.c #include #include #inclu
客户端-服务器程序
我们在UNIX环境下制作了简单的client.c和server.c程序。我们使用它来传输一个简单的文本文件，首先打开它，然后读取它并使用 open、read 和 send 系统调用发送；在客户端，我接
客户端/服务器交互
当我的程序来自 my previous question正在响应客户端，它应该发送加密消息。当客户端连接时，它会发送一条类似“YourMessage”的消息。现在我想做的是，当客户端连接时，应该以某
客户端/服务器打印数组并写回
我正在使用 C 和 putty 编写客户端/服务器程序。两个 c 文件位于同一系统上。我目前在向客户端写回其正在使用的框架以及打印我的框架时遇到问题。它打印出 3 0 9 8，但随后开始打印 134
客户端-服务器餐厅模拟
我正在使用 C 中的 select() 制作一个模拟快餐或其他任何东西的客户端服务器。我有客户随机点 1-5 种“食物”。服务器每 30 秒决定一次。所有客户最喜欢的食物是什么？他为那些客户提供服务
客户端-服务器游戏算法
对于单机游戏，基本的游戏循环是(来源:维基百科) while( user doesn't exit ) check for user input run AI move enemies
CentOS安装TortoiseSVN 客户端
1、CentOS安装TortoiseSVN 复制代码代码如下: yum install -y subversion 2、SVN客户端命令

首页

博学

6Ren·AI

商城

kubernetes - 如何使用 fabric8 kubernetes Java 客户端 API 在容器上设置 GPU 资源要求