I'm hoping that one of the resident SW guru's can help me out with setting up an alert that is absolutely giving me fits.
I have a pair of Linux servers, which have a java based program that monitors six services and then outputs the results of that to a webpage. I have set up an HTTP QoE monitor that is checks the webpage to see if any of the services are not available. The http monitor is set up to go to the URL and then search the URL for a NOT_AVAILABLE
I then set up an alert based on this http monitor which looks like this:
Because the alert was triggering, literally, every poll, I changed the polling frequency on the application monitor to every 10 minutes, and as you see the condition must exist for more than 20 minutes. This alert only seems to be triggering on one of the servers, which tells me that I set the monitor and alert up correctly, so the problem therefore lies with the alerting server. I have brought this up to them (along with the fact that all the services that they are monitoring can be monitored in SW) but until they get it figured out, I was hoping that someone could explain to me how I can set this to only send an alert if it fails for 3 consecutive polls.
Finally, and I'm not sure that this has anything to do with it, when I do get application down alerts they showing up as a 500 Internal Server Error.....
Bueller....Bueller.....