Network Monitoring and Management
Network Monitoring and Management Software
AggreGate Platform

Performance Monitoring

AggreGate Network Manager ensures 360° network performance visibility required for proactive diagnosing host/application downtime and capacity planning. Key network performance indicators include:

  • Router/switch bandwitch usage
  • Server/router CPU load and memory utilization
  • Storage space usage for servers, disk arrays, tape libraries, and other NAS devices
  • Application/service response times
  • Server uptime
  • Network links quality (latencies, jitter, packet loss rates, etc.)
  • Custom performance metrics (e.g. wireless signal strength)

Every performance indicator is backed by numerous analysis tools, such as alerts, charts or reports. Performance metrics history is stored in the server database for long-term trending.

CPU Load Monitoring

The Network Manager tracks CPU load of your servers, routers, managed switches, and any other devices supporting CPU utilization reporting via SNMP. The utilization percentage is tracked individually for every processor or processor core.

An alert may be raised when device's average CPU load exceeds a configurable threshold for longer than a certain time (also configurable). Alert settings, notification procedures and corrective actions may be shared between devices or set up for every device individually.

It's also possible to track CPU usage of individual processes running on remote machines.

Storage Space and Memory Utilization Monitoring

The disk space of your servers and NAS devices is also monitored via SNMP. AggreGate Network Manager tracks and displays labels and free/used space for every disk, partition and filesystem. Customizable disk space and memory utilization charts are provided out-of-the-box.

Storage space charts may be accompanied with linear regression trends for predicting the moment of near 100% utilization.

The Network Manager can also track absolute and relative memory usage of individual processes running on remote servers/workstations.

Bandwidth Usage Monitoring

With AggreGate Network Manager, the system administrator gets warned once current traffic of any router/switch interface comes close to the theoretical interface speed (or other user-defined threshold). To help catching a bandwidth hog, the Network Manager uses traffic decomposition allowing to figure out which host and application caused this traffic. The analysis is performed using dedicated traffic analysis protocols (e.g. NetFlow).

See Traffic Monitoring for more information.

Response Time Monitoring

Performance of a networked application may degrade due to different reasons, such as slow network link, low server memory or high CPU usage, application-specific problem. A synthetic metric mirroring application health in a relatively correct way is its response time, e.g. period between sending an application-specific request and receiving the full response.

Application response time examples:

  • Web page load time
  • SQL query execution time
  • Time required for downloading a file from remote FTP server
  • Remote shell script execution time

In addition to application response times, the Network Manager tracks response time of the whole servers/workstations that is measured via ICMP echo (ping). Combined with packet loss rate, ping response time measurements greatly help with detecting network links degradation.

Custom Performance Metrics

AggreGate Network Manager may collect custom performance indicators from any networked devices via SNMP, WMI, CLI, and any other remote monitoring protocols and corresponding device drivers (Modbus, BACnet, OPC, etc.)

Here are some examples:

  • Wireless signal strength (SNMP)
  • Application server thread count (JMX)
  • Windows Paging File Size (WMI)

AggreGate IoT Platform provides rich processing and analysis options for any custom metric. See Technology for more information.

"Top 10" Performance Dashboard

System administrator's duty starts with an overview of his "empire". If everything runs smoothly, he may enjoy his job right in the morning. The Top 10 dashboard is designed to eliminate numerous checks of different network resources by combining information about problematic and heavily loaded resources in the single place.

Performance Degradation Alerting

Advanced alerting engine provided by AggreGate Platform solves even the most complicated performance degradation detection tasks. Here are just some examples of complex alert trigger conditions:

  • Detect DDOS attack by triggering an alert if multiple "overload" events occur within a certain time frame
  • Raise an alert if CPU load exceeds 80% for longer than 5 minutes and deactivate it automatically only if the load goes below 30% for more than an hour
  • Trigger an alert if more than X servers belonging to a cluster become unavailable or don't match a specific performance criteria
  • Predict future SLA violation of a multi-component service by analyzing its KPI trend

In addition to notification broadcasting, every alert may be followed by forced operator acknowledgement and automatic/inteactive corrective actions. For example, the Network Manager can restart a service or reboot a server upon a performance alert.

Contact Us:

Send