gpt4 book ai didi

windows - Service Fabric 多节点 X509 群集 - 等待安装程序服务完成超时

转载 作者:行者123 更新时间:2023-12-03 04:25:01 26 4
gpt4 key购买 nike

为了创建 Azure SF 测试环境,我在开发测试实验室中创建了三个 Azure VM。这些将通过 X509 进行保护。

我使用了信息Here & Here

这些机器是:

  • Windows 2016 数据中心
  • 在同一虚拟网络上
  • 所有防火墙均已禁用(可以从另一台计算机对每台计算机执行 ping 操作)
  • 全部使用同一管理员帐户

我已使用文档提供的 certsetup.ps1 文件创建了自签名证书。按照建议合并一份服务器和集群证书。

如果我运行 TestConfiguration.ps1,则会得到以下输出。

LocalAdminPrivilege        : True
IsJsonValid : True
IsCabValid :
RequiredPortsOpen : True
RemoteRegistryAvailable : True
FirewallAvailable : True
RpcCheckPassed : True
NoConflictingInstallations : True
FabricInstallable : True
DataDrivesAvailable : True
Passed : True

显然 IsCabValid 字段为空,但“已通过”字段仍然表明可以安装。我继续运行下一个 powershell 命令来开始安装。

.\CreateServiceFabricCluster.ps1 -ClusterConfigFilePath .\ClusterConfig.X509.MultiMachine.json

执行上述命令后,进程启动,控制台窗口中填充以下文本,表明节点间通信正常。

Creating Service Fabric Cluster...
If it's taking too long, please check in Task Manager details and see if Fabric.exe for each node is running. If not, please look at: 1. traces in DeploymentTraces directory and 2. traces in FabricLogRoot configured in ClusterConfig.json.
Trace folder already exists. Traces will be written to existing trace folder: C:\StandaloneCluster\DeploymentTraces
Running Best Practices Analyzer...
Best Practices Analyzer completed successfully.
Creating Service Fabric Cluster...
Processing and validating cluster config.
Configuring nodes.
Default installation directory chosen based on system drive of machine '10.0.0.4'.
Copying installer to all machines.
Configuring machine '10.0.0.4'.
Configuring machine '10.0.0.5'.
Configuring machine '10.0.0.6'.
Machine 10.0.0.6 configured.
Machine 10.0.0.5 configured.
Machine 10.0.0.4 configured.
Running Fabric service installation.
Successfully started FabricInstallerSvc on machine 10.0.0.4
Successfully started FabricInstallerSvc on machine 10.0.0.6
Successfully started FabricInstallerSvc on machine 10.0.0.5

发生几分钟的长时间暂停,然后显示超时错误,但没有真正说明原因。我已经搜索了节点上的窗口日志,但无法发现任何进一步的信息。 PS控制台显示错误如下:

 Timed out waiting for Installer Service to complete for machine 10.0.0.4. Investigation order: FabricInstallerService -> FabricSetup -> FabricDeployer -> Fabric
Timed out waiting for Installer Service to complete for machine 10.0.0.6. Investigation order: FabricInstallerService -> FabricSetup -> FabricDeployer -> Fabric
Timed out waiting for Installer Service to complete for machine 10.0.0.5. Investigation order: FabricInstallerService -> FabricSetup -> FabricDeployer -> Fabric
CreateCluster Error: System.AggregateException: One or more errors occurred. ---> System.ServiceProcess.TimeoutException: Timed out waiting for Installer Service to complete for machine 10.0.0.5. Investigation order: FabricInstallerService -> FabricSetup -> FabricDeploye
r -> Fabric
at Microsoft.ServiceFabric.DeploymentManager.DeploymentManagerInternal.StartAndValidateInstallerServiceCompletion(String machineName, ServiceController installerSvc)
at System.Threading.Tasks.Parallel.<>c__DisplayClass17_0`1.<ForWorker>b__1()
at System.Threading.Tasks.Task.InnerInvokeWithArg(Task childTask)
at System.Threading.Tasks.Task.<>c__DisplayClass176_0.<ExecuteSelfReplicating>b__0(Object )
--- End of inner exception stack trace ---
at System.Threading.Tasks.Task.ThrowIfExceptional(Boolean includeTaskCanceledExceptions)
at System.Threading.Tasks.Task.Wait(Int32 millisecondsTimeout, CancellationToken cancellationToken)
at System.Threading.Tasks.Parallel.ForWorker[TLocal](Int32 fromInclusive, Int32 toExclusive, ParallelOptions parallelOptions, Action`1 body, Action`2 bodyWithState, Func`4 bodyWithLocal, Func`1 localInit, Action`1 localFinally)
at System.Threading.Tasks.Parallel.ForEachWorker[TSource,TLocal](IEnumerable`1 source, ParallelOptions parallelOptions, Action`1 body, Action`2 bodyWithState, Action`3 bodyWithStateAndIndex, Func`4 bodyWithStateAndLocal, Func`5 bodyWithEverything, Func`1 localInit, Ac
tion`1 localFinally)
at System.Threading.Tasks.Parallel.ForEach[TSource](IEnumerable`1 source, Action`1 body)
at Microsoft.ServiceFabric.DeploymentManager.DeploymentManagerInternal.RunFabricServices(List`1 machines, FabricPackageType fabricPackageType)
at Microsoft.ServiceFabric.DeploymentManager.DeploymentManagerInternal.<CreateClusterAsyncInternal>d__7.MoveNext()
---> (Inner Exception #0) System.ServiceProcess.TimeoutException: Timed out waiting for Installer Service to complete for machine 10.0.0.5. Investigation order: FabricInstallerService -> FabricSetup -> FabricDeployer -> Fabric
at Microsoft.ServiceFabric.DeploymentManager.DeploymentManagerInternal.StartAndValidateInstallerServiceCompletion(String machineName, ServiceController installerSvc)
at System.Threading.Tasks.Parallel.<>c__DisplayClass17_0`1.<ForWorker>b__1()
at System.Threading.Tasks.Task.InnerInvokeWithArg(Task childTask)
at System.Threading.Tasks.Task.<>c__DisplayClass176_0.<ExecuteSelfReplicating>b__0(Object )<---

---> (Inner Exception #1) System.ServiceProcess.TimeoutException: Timed out waiting for Installer Service to complete for machine 10.0.0.6. Investigation order: FabricInstallerService -> FabricSetup -> FabricDeployer -> Fabric
at Microsoft.ServiceFabric.DeploymentManager.DeploymentManagerInternal.StartAndValidateInstallerServiceCompletion(String machineName, ServiceController installerSvc)
at System.Threading.Tasks.Parallel.<>c__DisplayClass17_0`1.<ForWorker>b__1()
at System.Threading.Tasks.Task.InnerInvokeWithArg(Task childTask)
at System.Threading.Tasks.Task.<>c__DisplayClass176_0.<ExecuteSelfReplicating>b__0(Object )<---

---> (Inner Exception #2) System.ServiceProcess.TimeoutException: Timed out waiting for Installer Service to complete for machine 10.0.0.4. Investigation order: FabricInstallerService -> FabricSetup -> FabricDeployer -> Fabric
at Microsoft.ServiceFabric.DeploymentManager.DeploymentManagerInternal.StartAndValidateInstallerServiceCompletion(String machineName, ServiceController installerSvc)
at System.Threading.Tasks.Parallel.<>c__DisplayClass17_0`1.<ForWorker>b__1()
at System.Threading.Tasks.Task.InnerInvokeWithArg(Task childTask)
at System.Threading.Tasks.Task.<>c__DisplayClass176_0.<ExecuteSelfReplicating>b__0(Object )<---

Trace folder already exists. Traces will be written to existing trace folder: C:\StandaloneCluster\DeploymentTraces
Cleaning up faulted installation.
Removing configuration from machine 10.0.0.5
Removing configuration from machine 10.0.0.4
Removing configuration from machine 10.0.0.6

是否有 Azure SF 爱好者可以阐明这个问题,或者对我哪里出错提出任何建议?

最佳答案

这是 FabricHost 无法启动时出现的通用故障模式,发生这种情况的原因有多种。

由于您使用的是原始 Azure VM,而不是 SF VMSS 部署,因此您还必须确保在每台计算机上打开集群配置 NodeType 下设置的上游端口。要测试此设置是否正确,请首先尝试在这些虚拟机上部署不安全的集群。

如果上述方法有效,要进行调查,请使用 -NoCleanupOnFailure 标志运行部署,并检查其中一台故障计算机的“应用程序和服务日志 > Microsoft-Service Fabric > 管理”下的事件日志。

错误/警告日志应指示读取证书是否存在问题,或者是否存在任何其他阻塞问题。检查证书是否已通过 ACL 访问每台计算机上的网络服务,因为这是 doc 中列出的要求之一.

当证书指纹包含无效字符时,会发生其他常见故障之一。 Windows 证书管理工具中存在一个错误,导致显示的指纹包含此类隐藏的无效字符,当直接复制到配置中时,会导致部署问题。请使用十六进制编辑器(例如 HxD )验证配置指纹仅包含有效字符。

如果这没有为您提供足够的信息来找出问题,请运行 Standalone package 中包含的 Tools\Microsoft.Azure.ServiceFabric.WindowsServer.SupportPackage.zip 中的日志收集器工具。 ,并将收集的日志上传到您选择的存储位置,以便与我们的团队共享。您可以将链接邮寄至[email protected]我们可以帮助您调查这个问题。

关于windows - Service Fabric 多节点 X509 群集 - 等待安装程序服务完成超时,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/44281825/

26 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com