Non-disruptive way of resetting the pod restart count in Kubernetes(以无中断的方式在Kubernetes中重置Pod重启计数)-6ren

Non-disruptive way of resetting the pod restart count in Kubernetes(以无中断的方式在Kubernetes中重置Pod重启计数)

转载作者：bug小助手更新时间：2023-10-28 11:08:07

26

4

Currently our monitoring is designed in a such a way that it alerts if any pod restarts more than 50 times.

目前，我们的监控程序是这样设计的，如果任何Pod重启超过50次，它就会发出警报。

This is an example alert we get

这是我们收到的示例警报

summary = More than 50 restarts in pod xxx on cluster xxx

In some situations because of planned maintenance activities the specific application pods get restarted and the restart count goes more than 50 and subsequently we get alerted.

在某些情况下，由于计划的维护活动，特定的应用程序Pod会重新启动，并且重新启动计数超过50，随后我们会收到警报。

This alert will be active until the count resets back to 0 again.

此警报将处于活动状态，直到计数再次重置为0。

So for non-prod environments we delete that pod (with more than 50 restarts) , then the deployment creates a new one and automatically the restart count (for new pod) comes to 0 and we are all happy.

因此，对于非生产环境，我们删除该Pod(重新启动50次以上)，然后部署创建一个新Pod，并且重新启动计数(对于新Pod)自动变为0，我们都很满意。

But we dont have that leverage to do the same destructive operation of deleting a pod in production. And we if dont do it, the restart count remains as more than 50 always , alert keeps on coming. And there is also a good chance we loose a genuine alert in between this.

但我们没有那个筹码来做同样的破坏性操作，在生产中删除一个豆荚。如果我们不这样做，重启次数始终保持在50以上，警报不断到来。在此期间，我们也很有可能失去真正的警报。

How can we overcome this. I assume this should be the same problem everyone faces in the k8s world.

我们如何才能克服这一点。我想这应该是每个K8世界里的人都面临的问题。

This is the prometheus metric we use to track the restart count

这是我们用来跟踪重新启动计数的普罗米修斯指标

kube_pod_container_status_restarts_total > 50

KUBE_POD_CONTAINER_STATUS_RESTARTS_TOTAL>50

Tried looking out for k8s documentation to reset the pod counter directly from the k8s etc database, but that doest seem like a recommended approach.

我试着寻找K8s文档，直接从K8s ETC数据库重置Pod计数器，但这似乎不是推荐的方法。

How can we overcome this. What is the best possible approach.

我们如何才能克服这一点。什么是最好的可能方法。

更多回答

Why do you consider deleting a pod "destructive"? Normally its controlling Deployment or StatefulSet will recreate it immediately. I might suggest tooling like kubectl rollout restart deployment ... which will delete and recreate all of a Deployment's Pods in one fell swoop.

为什么你认为删除一个Pod是“破坏性的”？通常，它的控制部署或状态集会立即重新创建它。我可能会建议像kubectl推出重启部署这样的工具...这将一举删除并重新创建部署的所有Pod。

@DavidMaze as we all know , any change in production that impacts the app availability goes through lot of approvals and scrutiny. In this case if we are trying to do a rollout restart - thats technically a new deployment in production and Which practically makes things im-possible.

@DavidMaze众所周知，任何影响应用可用性的生产变化都要经过大量的批准和审查。在这种情况下，如果我们试图重新启动-从技术上讲，这是在生产中的新部署，这实际上使事情变得不可能。

Assuming you have more than one replica (you do, right?) neither deleting individual Pods nor kubectl rollout restart should significantly affect availability. The Service will continue to route requests to the Pods that are still running. The Deployment-restart sequence will only recreate some of the Pods at a time, waiting to destroy old ones until new ones pass their liveness and readiness probes.

假设您有多个复制品(您有，对吗？)删除单个Pod或重新启动kubectl都不会对可用性产生重大影响。该服务将继续将请求路由到仍在运行的Pod。部署-重启序列一次只会重新创建一些吊舱，等待摧毁旧的吊舱，直到新的吊舱通过其活性和就绪探测。

Keep in mind that in Kubernetes pod restart is equal to pod delete. Pods are supposed to be mortal and temporary.

请记住，在Kubernetes中，实例重新启动等同于实例删除。豆荚应该是凡人和暂时的。

优秀答案推荐

This can only be accomplish by restarting the pod.

这只能通过重新启动Pod来完成。

Also, a feature related to this has been rejected.

此外，与此相关的一项功能也被拒绝了。

https://github.com/kubernetes/kubernetes/issues/50375

https://github.com/kubernetes/kubernetes/issues/50375

更多回答

26

4

0

文章推荐： html - Bootstrap 3 将页脚平齐到底部。不固定

java - 如何在android中使用工具栏创 build 置
我有一个“设置首选项”屏幕。它有一个 ListPreference 和一个 CheckBoxPreference。当我选择 ListPreference 的一项时，我想更改应用程序的日期格式。另外，通
c++ - Qt如何创 build 置/配置窗口
我试图找到创 build 置/配置窗口的示例。单击菜单项中的“选项”操作可启动设置窗口。我想弄清楚如何从主窗口打开第二个窗口。以及新窗口如何将设置信息返回主窗口。尝试使用 QDialog 或一些继承的
c++ - 为 Qt 项目创 build 置
我在 Lnux 上有 Qt 应用程序。我想为此创建一个可执行文件/设置以便在 Windows 上分发它并且不需要安装 Qt。我通过包含所有 dll 为此创建了可执行文件但要运行它，用户需要进入文件夹。
Javascript - 创 build 置 div 宽度的动态类
我正在尝试创建一个有点动态的 html 类，它根据类末尾包含的数字设置宽度 %。注意:类名将始终以“gallery-item-”开头示例:div.gallery-item-20 = 20% 宽度我
android - 如何创 build 置 Activity 以从底部出现一半的屏幕？
关闭。这个问题需要更多focused .它目前不接受答案。想改进这个问题吗？更新问题，使其只关注一个问题 editing this post . 关闭 6 年前。 Improve this qu
android - 如何在 Android 应用程序中创 build 置
在我的应用程序中，我想记住一些变量，例如，如果用户登录过一次，那么他们将在下次重新打开应用程序时登录，或者如果他们决定禁用某些提醒，应用程序可以检查该变量是否是错误的，将不再显示该提醒。理想情况下，这
java - 如何为 Java 应用程序创 build 置？
我在 Netbeans 中开发了一个应用程序，它连接到远程计算机的消息队列并发送消息。该应用程序还有其他功能。项目完成后，我清理并构建应用程序，然后 Netbeans 创建一个 jar 文件。但我的
.net - 为 Outlook 2010 加载项创 build 置
我创建了一个 Outlook 加载项，需要创建一个设置以使其可分发(我是新手，所以请原谅新手评论) Outlook -2010 Vs -2010 .Net 4.0 我读了一些地方，最简单的方法就是发
java - 在 java swing 应用程序中创 build 置
这个问题已经有答案了: 已关闭10 年前。 Possible Duplicate: How to make installer pack of Java swing Application Proje
c# - 在 WPF 应用程序中创 build 置 View
这个问题肯定已经被很多人解决过很多次了，但是经过几个小时的研究，我仍然没有找到我要找的东西。我有一个 ExportSettings.settings 文件，其中包含一堆设置( bool 值、字符串、
linux - 为 Linux C 项目创 build 置
我想为我的项目创建一个安装程序，以便它可以安装在任何电脑上而无需安装头文件。我怎样才能做到这一点？最佳答案一般有两种分发程序的方法: 源代码分发(要构建的源代码)。最常见的方法是使用 GNU au
java - 如何为 Android 动态壁纸创 build 置 Activity
如何在这样的动态壁纸中创 build 置 Activity ？ Example Picture 我只用一个简单的文本构建了设置 Activity ，但遇到了一些问题。第一个问题是我不能为此 Activ
python - 如何为具有依赖项的 Python 项目创 build 置/安装程序？
我用 GUI 创建了一个简单的软件。它有几个源文件。我可以在我的编辑器中运行该项目。我认为它已经为 1.0 版本做好了准备。但我不知道如何为我的软件创 build 置/安装程序。源代码是python
android - 在 Android P 上创 build 置 Activity
我的 SettingsActivity当前扩展了 Android Studio 生成的类，AppCompatPreferenceActivity扩展 PreferenceActivity . Acti
c# - 创 build 置 (MSI) 以注册(regasm)程序集
我正在使用 .NET 为 IE 开发工具栏。目前，我使用 gacutil 插入我的 .NET 程序集，并使用 regasm 注册我的 COM 程序集。我想为项目创建一个设置 (MSI)，但我似乎无法
android - 创 build 置 Activity 时出现 boolean 参数问题
在为设置页面创建 Activity 后，我注意到 if (mCurrentValue !== value) 中的 mCurrentValue !== value 返回警告: Identity equa
c# - 在 visual studio 10 中创 build 置
我在 Visual Studio 10 中创建了一个项目，该项目使用 Mysql 数据库和 Crystalreports 以及它。但是我不知道如何进行自动安装 Mysql 和 Crystalrepo
c# - 在 C# 项目中使用 sqlite 数据库并创 build 置
我正在尝试在我的 C# 项目中使用 Sqlite 数据库，并且我在 IDE 中做得很好。我的问题是当我为我的项目制作安装包并安装它时，程序无法访问 sqlite 数据库。我也知道这是因为用户没有访问文
c# - 如何使用 Web 平台安装程序为 Web 应用程序创 build 置
我有一个大型 Web 应用程序(带有 11 子系统的 ErP)，我想使用 Microsoft WebPI 为它创建一个设置。目前，我们每周向客户发送一次应用程序(用于每周更新)。我们在此应用程序中
visual-studio - 在 visual studio 2008 中为项目解决方案创 build 置
所以我对工资单申请的最终查询是 - 如何为薪资申请创 build 置？我需要知道的一切- 如何将设置项目添加到我现有的解决方案如何将解决方案中的文件添加到安装项目中，以及添加哪些文件添加和在什么文

首页

博学

6Ren·AI

商城

Non-disruptive way of resetting the pod restart count in Kubernetes(以无中断的方式在Kubernetes中重置Pod重启计数)