kubernetes - Kubectl推出重新启动以实现有状态集-6ren

kubernetes - Kubectl推出重新启动以实现有状态集

转载作者：行者123 更新时间：2023-12-02 11:29:22

根据kubectl docs，kubectl rollout restart适用于部署，守护程序和状态集。它可以按预期进行部署。但是对于有状态集，它仅重新启动2个Pod中的一个Pod。

✗ k rollout restart statefulset alertmanager-main                       (playground-fdp/monitoring)
statefulset.apps/alertmanager-main restarted

✗ k rollout status statefulset alertmanager-main                        (playground-fdp/monitoring)
Waiting for 1 pods to be ready...
Waiting for 1 pods to be ready...
statefulset rolling update complete 2 pods at revision alertmanager-main-59d7ccf598...

✗ kgp -l app=alertmanager                                               (playground-fdp/monitoring)
NAME                  READY   STATUS    RESTARTS   AGE
alertmanager-main-0   2/2     Running   0          21h
alertmanager-main-1   2/2     Running   0          20s

如您所见，pod alertmanager-main-1已重新启动，其使用期限为20s。而有状态警报管理器中的另一个Pod，即pod alertmanager-main-0尚未重新启动，它的年龄是21h。知道如何在状态映射集使用的某些configmap更新后如何重新启动它吗？

[更新1]这是statefulset配置。如您所见，没有设置 .spec.updateStrategy.rollingUpdate.partition。

apiVersion: apps/v1
kind: StatefulSet
metadata:
  annotations:
    kubectl.kubernetes.io/last-applied-configuration: |
      {"apiVersion":"monitoring.coreos.com/v1","kind":"Alertmanager","metadata":{"annotations":{},"labels":{"alertmanager":"main"},"name":"main","namespace":"monitoring"},"spec":{"baseImage":"10.47.2.76:80/alm/alertmanager","nodeSelector":{"kubernetes.io/os":"linux"},"replicas":2,"securityContext":{"fsGroup":2000,"runAsNonRoot":true,"runAsUser":1000},"serviceAccountName":"alertmanager-main","version":"v0.19.0"}}
  creationTimestamp: "2019-12-02T07:17:49Z"
  generation: 4
  labels:
    alertmanager: main
  name: alertmanager-main
  namespace: monitoring
  ownerReferences:
  - apiVersion: monitoring.coreos.com/v1
    blockOwnerDeletion: true
    controller: true
    kind: Alertmanager
    name: main
    uid: 3e3bd062-6077-468e-ac51-909b0bce1c32
  resourceVersion: "521307"
  selfLink: /apis/apps/v1/namespaces/monitoring/statefulsets/alertmanager-main
  uid: ed4765bf-395f-4d91-8ec0-4ae23c812a42
spec:
  podManagementPolicy: Parallel
  replicas: 2
  revisionHistoryLimit: 10
  selector:
    matchLabels:
      alertmanager: main
      app: alertmanager
  serviceName: alertmanager-operated
  template:
    metadata:
      creationTimestamp: null
      labels:
        alertmanager: main
        app: alertmanager
    spec:
      containers:
      - args:
        - --config.file=/etc/alertmanager/config/alertmanager.yaml
        - --cluster.listen-address=[$(POD_IP)]:9094
        - --storage.path=/alertmanager
        - --data.retention=120h
        - --web.listen-address=:9093
        - --web.external-url=http://10.47.0.234
        - --web.route-prefix=/
        - --cluster.peer=alertmanager-main-0.alertmanager-operated.monitoring.svc:9094
        - --cluster.peer=alertmanager-main-1.alertmanager-operated.monitoring.svc:9094
        env:
        - name: POD_IP
          valueFrom:
            fieldRef:
              apiVersion: v1
              fieldPath: status.podIP
        image: 10.47.2.76:80/alm/alertmanager:v0.19.0
        imagePullPolicy: IfNotPresent
        livenessProbe:
          failureThreshold: 10
          httpGet:
            path: /-/healthy
            port: web
            scheme: HTTP
          periodSeconds: 10
          successThreshold: 1
          timeoutSeconds: 3
        name: alertmanager
        ports:
        - containerPort: 9093
          name: web
          protocol: TCP
        - containerPort: 9094
          name: mesh-tcp
          protocol: TCP
        - containerPort: 9094
          name: mesh-udp
          protocol: UDP
        readinessProbe:
          failureThreshold: 10
          httpGet:
            path: /-/ready
            port: web
            scheme: HTTP
          initialDelaySeconds: 3
          periodSeconds: 5
          successThreshold: 1
          timeoutSeconds: 3
        resources:
          requests:
            memory: 200Mi
        terminationMessagePath: /dev/termination-log
        terminationMessagePolicy: File
        volumeMounts:
        - mountPath: /etc/alertmanager/config
          name: config-volume
        - mountPath: /alertmanager
          name: alertmanager-main-db
      - args:
        - -webhook-url=http://localhost:9093/-/reload
        - -volume-dir=/etc/alertmanager/config
        image: 10.47.2.76:80/alm/configmap-reload:v0.0.1
        imagePullPolicy: IfNotPresent
        name: config-reloader
        resources:
          limits:
            cpu: 100m
            memory: 25Mi
        terminationMessagePath: /dev/termination-log
        terminationMessagePolicy: File
        volumeMounts:
        - mountPath: /etc/alertmanager/config
          name: config-volume
          readOnly: true
      dnsPolicy: ClusterFirst
      nodeSelector:
        kubernetes.io/os: linux
      restartPolicy: Always
      schedulerName: default-scheduler
      securityContext:
        fsGroup: 2000
        runAsNonRoot: true
        runAsUser: 1000
      serviceAccount: alertmanager-main
      serviceAccountName: alertmanager-main
      terminationGracePeriodSeconds: 120
      volumes:
      - name: config-volume
        secret:
          defaultMode: 420
          secretName: alertmanager-main
      - emptyDir: {}
        name: alertmanager-main-db
  updateStrategy:
    type: RollingUpdate
status:
  collisionCount: 0
  currentReplicas: 2
  currentRevision: alertmanager-main-59d7ccf598
  observedGeneration: 4
  readyReplicas: 2
  replicas: 2
  updateRevision: alertmanager-main-59d7ccf598
  updatedReplicas: 2

最佳答案

您没有提供整个方案。它可能取决于Readiness Probe或Update Strategy。
StatefulSet从索引0 to n-1重新启动pod。可以在here中找到详细信息。

原因1 *
Statefulset有4个update strategies。

删除

滚动更新

分区

强制回滚

在 Partition更新中，您可以找到以下信息:

If a partition is specified, all Pods with an ordinal that is greater than or equal to the partition will be updated when the StatefulSet’s .spec.template is updated. All Pods with an ordinal that is less than the partition will not be updated, and, even if they are deleted, they will be recreated at the previous version. If a StatefulSet’s .spec.updateStrategy.rollingUpdate.partition is greater than its .spec.replicas, updates to its .spec.template will not be propagated to its Pods. In most cases you will not need to use a partition, but they are useful if you want to stage an update, roll out a canary, or perform a phased roll out.

因此，如果您在 StatefulSet中的某处设置了 updateStrategy.rollingUpdate.partition: 1，它将重新启动索引为1或更高的所有Pod。
partition: 3的示例

NAME    READY   STATUS    RESTARTS   AGE
web-0   1/1     Running   0          30m
web-1   1/1     Running   0          30m
web-2   1/1     Running   0          31m
web-3   1/1     Running   0          2m45s
web-4   1/1     Running   0          3m
web-5   1/1     Running   0          3m13s

原因2
Readiness probe的配置。

如果 initialDelaySeconds和 periodSeconds的值较高，则可能需要一段时间才能重新启动另一个。有关这些参数的详细信息，请参见 here。

在下面的示例中，pod将等待10秒钟，它将运行，而 readiness probe每2秒钟检查一次。取决于值，这可能是此行为的原因。

    readinessProbe:
      failureThreshold: 3
      httpGet:
        path: /
        port: 80
        scheme: HTTP
      initialDelaySeconds: 10
      periodSeconds: 2
      successThreshold: 1
      timeoutSeconds: 1

原因3

我看到每个 pods 中有2个容器。

NAME                  READY   STATUS    RESTARTS   AGE
alertmanager-main-0   2/2     Running   0          21h
alertmanager-main-1   2/2     Running   0          20s

如 docs中所述:

Running - The Pod has been bound to a node, and all of the Containers have been created. At least one Container is still running, or is in the process of starting or restarting.

最好用 containers(readinessProbe/livenessProbe，重新启动等)检查一切是否正常。

关于kubernetes - Kubectl推出重新启动以实现有状态集，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/59168406/

文章推荐： kubernetes - 所有容器都显示就绪，但 pod 未就绪

文章推荐： kubernetes - Kubernetes 中有状态服务的分片负载均衡

文章推荐： deployment - Kubernetes 部署挂起

文章推荐： amazon-web-services - 无法使用 Kops 验证 Kubernetes 集群

android - 在 Flash AS3 中使用 Adobe Air 重新启动/重新启动 android 应用程序
我搜索了重启我的 android 应用程序的替代方法，但我发现重启的唯一方法是使用 Flex 构建. 我可以用 as3 flash 重启我的 android adobe air 应用程序吗？我该怎么做
Python程序不循环/重新启动
我有一个学校评估，是为了制作一个 child 的拼写游戏，当玩家单击"is"时，它必须循环/重新启动。到目前为止，当我测试游戏时，询问玩家是否想再次玩的选项/easygui.buttonbox 以
docker - 重新启动:始终强制重新创建
在.yml文件中，我定义了:restart: always。是否可以将此重启创建为--force-recreate标志的等效项？我的XVFB有问题，标准重启无法解决问题，但通过--force-rec
java - 重新启动 while 循环
我正在尝试重新启动 while 循环。我已经声明了 boolean 类型的变量 keepGoing 。如果 int 变量 x 超出窗口，则 keepGoing 更改为 false。然后reset()方
java - 重新启动 Chromecast
如何使用 Cast SDK 或其他方式让我的应用以官方 Chromecast 应用的方式触发 Chromecast 重启？如果是“否则”，Google Play 可能会对这种做法不友善吗？最佳答案
postgresql - 重新启动 postgres
运行/etc/init.d/postgresql restart有没有危险？我们刚刚发生了一些关系“消失”的事件，我运行了上述命令。刚刚被系统管理员骂了一顿，但是他没有解释为什么这是一件坏事。我确实将
php - 重新启动 While 循环
是否可以重新启动 while 循环？我目前在 foreach 循环中存在一个 while 循环，并且每次都需要 while 语句从头开始。 $sql = mysqli_query($link, "SE
iphone - NSTimer 重新启动
我有如下倒计时器: - (void)updateCounterLabel:(NSTimer *)theTimer { if(secondsLeft > 0 ){ secondsLeft
python - 重新启动 if then 语句
就像我在 python 中一样。 choice1 = raw_input('John Blue Green') if choice1 == 'A': print('blah') elif cho
python - 重新启动 Pygame？
我的游戏在 True 循环中运行一段时间，我希望能够要求用户“再玩一次？”我已经有了用于弹出文本的矩形的代码，但我需要一种方法让用户单击矩形或按 y 表示是，然后代码再次自行运行。最佳答案在您的主
linux - 重新启动 Nginx
我是 nginx 的初学者。我正在使用 Ubuntu 16.04。我按照步骤操作， sudo apt-get 更新。 sudo apt-get install nginx sudo apt-get 升
javascript - 重新启动/重置并重播过渡CSS？
我需要使用 javascript 重放一个 css 转换。当我重置我的 div 的 css 样式并应用新的过渡时，没有任何反应...... 我认为这两个代码是在同一个执行框架中执行的，并且通过优化，它
c# 重新启动 for 循环
所以我有这几行代码: string[] newData = File.ReadAllLines(fileName) int length = newData.Length; for (int i =
javascript - 重新启动 setInterval
所以我有一个计时器，每 5 秒旋转一组图像。因此，我在文档启动时运行它。 $(document).ready(function() { var intervalID=setInterval(funct
linux - 重新启动 Apache 服务器的问题
好吧，我在重新启动 Apache 服务器时遇到了一些问题。我修改了服务器上的 ulimit 但我无法重新启动 httpd；我在 CentOS 5.8 x64 上运行服务器. httpd -V 的输出
docker - 重新启动 docker 服务会杀死所有容器吗？
我在使用 docker 时遇到问题 docker ps不会返回并被卡住。我发现做 docker service restart 之类的sudo service docker restart (htt
wpf - 重新启动 WPF Storyboard
从 .net 代码停止和重新启动 Storyboard的正确方法是什么？我想 ... myStory.Stop(this); 期望随后调用 .Begin(this);将从零开始从时间线重新开始，但
apache - 重新启动(启动)apache网络服务器时我可以执行shell脚本吗
我有一个带有一些缓存后端的应用程序，我想在重新启动网络服务器时清除缓存。在网络服务器(重新)启动时是否有 apache 配置指令或任何其他方式来执行 shell 脚本？谢谢，菲尔正如一些答案已
java - 重新启动 Swing 应用程序
我愿意在我的应用程序中添加一个按钮，单击该按钮将重新启动应用程序。我搜索了谷歌，但发现除了 this one 没有任何帮助.但是这里遵循的程序违反了 Java 的 WORA 概念。是否还有其他以 J
coldfusion - 重新启动 ColdFusion 邮件队列
我们目前遇到间歇性邮件队列中断。我是 seeking diagnostic help in another area . 同时，有没有办法在不重启整个服务的情况下重启CF邮件队列？ CF8标准 Win

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

kubernetes - Kubectl推出重新启动以实现有状态集