Solved: Subject: ALL_SERVICES_ALERTS Danger Event

BillW · ‎04-08-2016

Hi All,

I received this alert and can't find anything in the logs that would back this alert up. I know this is a canned watch. The only thing that comes close is that it seems the CMS auto restarted for some reason. As the Date Modified shows that in the CMC. If the CMS restarts then everything on the server would also restart. The other servers have older Date Modified entries.

What would cause this to happen?

Subject: ALL_SERVICES_ALERTS Danger Event

Danger Rule evaluated to true for "ALL_SERVICES_ALERTS" watch.

Danger Rule: BOProdCluster.APS.Visualization$'Health State'==0 || BOProdCluster.APS.Visualization$'Health State'==5 || BOProdCluster.APS.Analysis$'Health State'==0 || BOProdCluster.APS.Analysis$'Health State'==5 || BOProdCluster.APS.Auditing$'Health State'==0 || BOProdCluster.APS.Auditing$'Health State'==5 || BOProdCluster.APS.Connectivity$'Health State'==0 || BOProdCluster.APS.Core$'Health State'==0 || BOProdCluster.APS.Core$'Health State'==0 || BOProdCluster.APS.DF$'Health State'==0 || BOProdCluster.APS.LCM$'Health State'==0 || BOProdCluster.APS.Monitoring$'Health State'==0 || BOProdCluster.APS.Search$'Health State'==0 || BOProdCluster.APS.WEBI$'Health State'==0 || BOProdCluster.APS.WEBIDSLBridge$'Health State'==0 || BOProdCluster.APS.WEBIDSLBridge1$'Health State'==0 || BOProdCluster.AdaptiveJobServer$'Health State'==0 || BOProdCluster.CentralManagementServer$'Health State'==0 || BOProdCluster.ConnectionServer$'Health State'==0 || BOProdCluster.ConnectionServer32$'Health State'==0 || BOProdCluster.ConnectionServer32$'Health State'==0 || BOProdCluster.InputFileRepository$'Health State'==0 || BOProdCluster.OutputFileRepository$'Health State'==0 || BOProdCluster.WebApplicationContainerServer$'Health State'==0 || BOProdCluster.WebIntelligenceProcessingServer$'Health State'==0 || BOProdCluster.WebIntelligenceProcessingServer1$'Health State'==0 || BOProdCluster.WebIntelligenceProcessingServer2$'Health State'==0 || BOProdCluster.WebIntelligenceProcessingServer3$'Health State'==0 || Cluster58.APS.Analysis$'Health State'==0 || Cluster58.APS.Auditing$'Health State'==0 || Cluster58.APS.Connectivity$'Health State'==0 || Cluster58.APS.Core$'Health State'==0 || Cluster58.APS.DF$'Health State'==0 || Cluster58.APS.LCM$'Health State'==0 || Cluster58.APS.Search$'Health State'==0 || Cluster58.APS.Visualization$'Health State'==0 || Cluster58.APS.WEBI$'Health State'==0 || Cluster58.APS.WEBIDSLBridge$'Health State'==0 || Cluster58.APS.WEBIDSLBridge1$'Health State'==0 || Cluster58.AdaptiveJobServer$'Health State'==0 || Cluster58.CentralManagementServer$'Health State'==0 || Cluster58.ConnectionServer$'Health State'==0 || Cluster58.ConnectionServer32$'Health State'==0 || Cluster58.DashboardsCacheServer$'Health State'==0 || Cluster58.DashboardsProcessingServer$'Health State'==0 || Cluster58.EventServer$'Health State'==0 || Cluster58.InputFileRepository$'Health State'==0 || Cluster58.OutputFileRepository$'Health State'==0 || Cluster58.WebApplicationContainerServer$'Health State'==0 || Cluster58.WebIntelligenceProcessingServer$'Health State'==0 || Cluster58.WebIntelligenceProcessingServer1$'Health State'==0 || Cluster58.WebIntelligenceProcessingServer2$'Health State'==0 || Cluster58.WebIntelligenceProcessingServer3$'Health State'==0 || BOProdCluster.APS.Connectivity$'Health State'==0 || BOProdCluster.APS.Core$'Health State'==0 || BOProdCluster.APS.Core$'Health State'==0 || BOProdCluster.APS.DF$'Health State'==0 || BOProdCluster.APS.LCM$'Health State'==0 || BOProdCluster.APS.Monitoring$'Health State'==0 || BOProdCluster.APS.Search$'Health State'==0 || BOProdCluster.APS.WEBI$'Health State'==0 || BOProdCluster.APS.WEBIDSLBridge$'Health State'==0 || BOProdCluster.APS.WEBIDSLBridge1$'Health State'==0 || BOProdCluster.AdaptiveJobServer$'Health State'==0 || BOProdCluster.CentralManagementServer$'Health State'==0 || BOProdCluster.ConnectionServer$'Health State'==0 || BOProdCluster.ConnectionServer32$'Health State'==0 || BOProdCluster.ConnectionServer32$'Health State'==0 || BOProdCluster.InputFileRepository$'Health State'==0 || BOProdCluster.OutputFileRepository$'Health State'==0 || BOProdCluster.WebApplicationContainerServer$'Health State'==0 || BOProdCluster.WebIntelligenceProcessingServer$'Health State'==0 || BOProdCluster.WebIntelligenceProcessingServer1$'Health State'==0 || BOProdCluster.WebIntelligenceProcessingServer2$'Health State'==0 || BOProdCluster.WebIntelligenceProcessingServer3$'Health State'==0 || Cluster58.APS.Analysis$'Health State'==0 || Cluster58.APS.Auditing$'Health State'==0 || Cluster58.APS.Connectivity$'Health State'==0 || Cluster58.APS.Core$'Health State'==0 || Cluster58.APS.DF$'Health State'==0 || Cluster58.APS.LCM$'Health State'==0 || Cluster58.APS.Search$'Health State'==0 || Cluster58.APS.Visualization$'Health State'==0 || Cluster58.APS.WEBI$'Health State'==0 || Cluster58.APS.WEBIDSLBridge$'Health State'==0 || Cluster58.APS.WEBIDSLBridge1$'Health State'==0 || Cluster58.AdaptiveJobServer$'Health State'==0 || Cluster58.CentralManagementServer$'Health State'==0 || Cluster58.ConnectionServer$'Health State'==0 || Cluster58.ConnectionServer32$'Health State'==0 || Cluster58.DashboardsCacheServer$'Health State'==0 || Cluster58.DashboardsProcessingServer$'Health State'==0 || Cluster58.EventServer$'Health State'==0 || Cluster58.InputFileRepository$'Health State'==0 || Cluster58.OutputFileRepository$'Health State'==0 || Cluster58.WebApplicationContainerServer$'Health State'==0 || Cluster58.WebIntelligenceProcessingServer$'Health State'==0 || Cluster58.WebIntelligenceProcessingServer1$'Health State'==0 || Cluster58.WebIntelligenceProcessingServer2$'Health State'==0 || Cluster58.WebIntelligenceProcessingServer3$'Health State'==0

The metrics that have crossed their respective thresholds:

BOProdCluster.CentralManagementServer$'Health State'

Appreciate any suggestions.

BW

Toby_Johnston · ‎04-08-2016

Hey Bill,

If the CMS restarted then it would trigger this alert since BOProdCluster.CentralManagementServer$'Health State' watch would be triggered if the server is stopped.

One thing you can do is edit the watch and change the threshold to only trigger the watch if it has been in danger state for > 10 minutes for example. This way, if the server is simply restarted the watch won't give an alert.

Cheers

Toby

Subject: ALL_SERVICES_ALERTS Danger Event

Accepted Solutions (1)

Accepted Solutions (1)

Answers (0)

Re: Integrate an external task system to Cloud ALM...

Re: error during install SAP S/4HANA Server 2022

Re: Mass Update Zfield in Std Table with Split & J...

Re: Unable to connect to Datasphere using the CLI

Re: Mass Update Zfield in Std Table with Split & J...