|
Fault Management can be stated as the most vital part of Web NMS. Effective fault management paves way to identify failure in the networked environment with very less effort. AdventNet Web NMS Fault Management provides sophisticated features like filtering of trap/event/alert, parsing of traps and events, provision to execute automated actions, event correlation and alerting. This section is divided into subtopics based on the key functionality and they cover the entire Fault Management.
Fault Management Overview - This section gives you an overview of Fault Management, explaining its architecture and its general flow.
Fault Management Database Schema - This page lists the Fault Management tables present and gives a brief description about the same.
Network Notifications - This section deals with how the notifications about the failure of a node/element in the system are received by the Fault Manager. The various notifications that could be received in a networked environment like the SNMP traps, TL1 messages, JMX notifications are explained.
Listening for Notifications - This section deals with how to configure the trap port for listening to notifications and a description about how to register multiple applications to listen for traps in the same port simultaneously.
Filtering and Processing Notifications - This section deals in detail about the filtering and processing of notifications (traps and autonomous messages). It also contains examples for filtering traps.
Observing for Traps - This topic explains how external applications can register themselves to get notified when Web NMS receives traps.
Notification Propagation - This topic explains how data flow and thread flow occur in the Fault module of Web NMS.
Handling Events - This section deals in detail the generation, parsing, filtering and clean up of events. Complete description about the Event API and the different actions that can be triggered using an event are explained here. Examples are also included for Event Filters.
Handling Alerts - This section deals in detail the mechanism, filtering, managing and grouping of alerts. Complete description about the Alert API and the different actions that can be triggered when an alert satisfies the matching criteria are explained here along with examples. The new enhanced Alert Listener concept is also explained here.
Policies - This section deals with the policies available with Fault module.
Print Option - This section details the various operating system-specific commands to be used for server side printing of Events and Alerts.
Plugging in Protocol - This chapter describes the steps to be followed in order to plug in different protocols into Fault module.
Developer Tips - This sections covers frequently asked questions, troubleshooting tips and performance tuning tips.
Monitoring CPU Utilization - A Case study on fault management
|