Two servers, one of which is within the core and one of which is entirely separate, monitor
the Checkbridge infrastructure. The internal and external monitoring servers are used to
monitor each other so that Checkbridge support teams are aware of any failure.
The internal server monitors and restarts critical service and SNMPv3 to monitor available
disk space, CPU load, open TCP sockets and number of processes on each server within
the infrastructure, including the satellites. On the other devices network usage, port errors,
TCP sessions and throughput are monitored. In addition the following are also monitored:

Each server has a 'scand' port open.
All interfaces are pinged every three minutes.
Each web server is running a svscan process.
Each web server and reverse proxy can serve a test page within an acceptable amount of time.
Each MySQL can perform a specified operation within an acceptable amount of time.
Mail servers are monitored to ensure they can deliver email and serve DNS records
within an acceptable amount of time.

The external monitoring server ensures that each web server is able to serve a web page,
each mail server will accept and deliver mail and DNS records arec available. Automated daily
vulnerability tests provide reports to Checkbridge staff to ensure the infrastructure remains secure.
The SNMP data is compiled by Nagios, which is also used to monitor thresholds and generates
alerts, which are displayed to Checkbridge support staff as well as being emailed to a ticketing
system and sent via SMS.
© Checkbridge 2010