1. Cisco Confidential 2011 Cisco and/or its affiliates. All
rights reserved. 1 Nagios overview and application on TSTV-YES
Yating Yang May 31, 2013
2. 2011 Cisco and/or its affiliates. All rights reserved. Cisco
Confidential 2 Background Nagios Overview Nagios feature Nagios
Installation Nagios Basic configuration How Checks Work Nagios
application on TSTV-YES testing Reference
3. 2011 Cisco and/or its affiliates. All rights reserved. Cisco
Confidential 3 Uses CGIs written in C for faster response and
scalability. Background
4. 2011 Cisco and/or its affiliates. All rights reserved. Cisco
Confidential 4 Nagios is a powerful monitoring system that enables
organizations to identify and resolve IT infrastructure problems
before they affect critical business processes. With Nagios you
can: Monitor your entire IT infrastructure Spot problems before
they occur Know immediately when problems arise Share availability
data with stakeholders Detect security breaches Plan and budget for
IT upgrades Reduce downtime and business losses
5. 2011 Cisco and/or its affiliates. All rights reserved. Cisco
Confidential 5 Verification of availability is delegated to
plugins: The product's architecture is simple enough that writing
new plugins is fairly easy in the language of your choice. There
are many, many plugins available. Has intelligent checking
capabilities. Attempts to distribute the server load of running
Nagios (for larger sites) and the load placed on devices being
checked. Configuration is done in simple, plain text files, but
that can contain much detail and are based on templates. Nagios
reads it's configuration from an entire directory. You decide how
to define individual files.
6. 2011 Cisco and/or its affiliates. All rights reserved. Cisco
Confidential 6 Utilizes topology to determine dependencies. Nagios
differentiates between what is down vs. what is not available. This
way it avoids running unnecessary checks. Nagios allows you to
define how you send notifications based on combinations of:
Contacts and lists of contacts Devices and groups of devices
Services and groups of services Defined hours by persons or groups.
The state of a service.
7. 2011 Cisco and/or its affiliates. All rights reserved. Cisco
Confidential 7 Installation reference wiki:
http://zed.cisco.com/confluence/display/SPVTG/Videoscape+EPIC+Installation+and+Co
nfiguration
8. 2011 Cisco and/or its affiliates. All rights reserved. Cisco
Confidential 8
9. 2011 Cisco and/or its affiliates. All rights reserved. Cisco
Confidential 9 Define how you are going to test a service. define
command{ command_name check_nrpe command_line $USER1$/check_nrpe -H
$HOSTADDRESS$ -c $ARG1$ } define command { command_name
check_cpu_perf command_line $USER1$/check_cpu_perf.sh -t 30 -a
$ARG1$ $ARG2$ }
13. 2011 Cisco and/or its affiliates. All rights reserved.
Cisco Confidential 13 define service{ use tstv-service
hostgroup_name tstv-vms-servers service_description CPU Usage
check_command check_nrpe!check_cpu_perf }
14. 2011 Cisco and/or its affiliates. All rights reserved.
Cisco Confidential 14 Nodes can have parents. For example, the
parent of a PC connected to a switch would be the switch. This
allows us to specify the network dependencies that exist between
machines, switches, routers, etc. This avoids having Nagios send
alarms when a parent does not respond. A node can have multiple
parents.
15. 2011 Cisco and/or its affiliates. All rights reserved.
Cisco Confidential 15 A node/host/device consists of one or more
service checks (PING, HTTP, MYSQL, SSH, etc) Periodically Nagios
checks each service for each node and determines if state has
changed. State changes are: CRITICAL WARNING UNKNOWN For each state
change you can assign: Notification options (as mentioned before)
Event handlers
16. 2011 Cisco and/or its affiliates. All rights reserved.
Cisco Confidential 16 In this manner it can take some time before a
host change's its state to down as Nagios first does a service
check and then a node check. By default Nagios does a node check 3
times before it will change the nodes state to down. You can, of
course, change all this.
17. 2011 Cisco and/or its affiliates. All rights reserved.
Cisco Confidential 17 NRPE (Nagios Remote Plugin Executor) NSCA
(Nagios Service Check Acceptor) These solutions may be best if you
are able to compile and install software on the other machine. But
in our solution, CDS-IS, VOS, Appsuite are customized linux OS and
have limitation to install above plugin. Need to develop plugin to
access these subsystem.
18. 2011 Cisco and/or its affiliates. All rights reserved.
Cisco Confidential 18
19. 2011 Cisco and/or its affiliates. All rights reserved.
Cisco Confidential 19 Nagios web site http://www.nagios.org/ Nagios
plugins site http://sourceforge.net/projects/nagiosplug/ A Debian
tutorial on Nagios http://www.debianhelp.co.uk/nagios.htm
Commercial Nagios support http://www.nagios.com/ Nagios
http://nagios-cn.sourceforge.net/nagios-cn/index.html