Get the Big Picture! End-to-End Monitoring of Heterogeneous Middleware and Apps

Preview:

DESCRIPTION

SL's CTO, Tom Lubinksi, explains in this webinar how you can learn to provide the most complete picture of the health state of your critical Oracle-centric business applications. Aggregate and consolidate metrics in real time in order to provide visibility across an entire application infrastructure including Oracle WebLogic Server, Oracle Coherence and Oracle databases, as well as other critical middleware components. Quickly provide end-to-end views, intuitive diagnosis and rapid resolution of issues across a broad range of applications, middleware components and monitoring tools. Techniques for the rapid delivery of tailored displays, filtered alerts, deep-dive troubleshooting and historical analysis of system behavior will also be addressed.

Citation preview

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.1

Get the Big Picture! End-to-End Monitoring of Heterogeneous Middleware and Apps

Tom Lubinski, CTO, SL Corporation

8/9 October, 2013

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.2

RTView – Get the Big Picture !

• What is end-to-end monitoring• Why should you care about it ?

• Traditional system management tools are not enough• Agent-based transaction monitoring tools not enough

• What RTView does differently to address this need• Customer Use Case

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.3

What is End-To-End Monitoring ?

• Most people think this is end-to-end monitoring:

A B C D E

A Linear Data Flow …

Where A, B, C are JSPs, Servlets, Topics, Queues, etc.

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.4

What is End-To-End Monitoring ?

• Sometimes it can be more complex:

A B C D E

With Loops and Error Paths …but it’s still “one-dimensional”

G F

Error

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.5

What is End-To-End Monitoring ?

• It’s even more complex than this

• Two more dimensions must be added to the picture …

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.6

What is End-To-End Monitoring ?

• A second dimension to capture the nested levels systems are implemented in – component layering

Host LayerPhysical Servers, Network, Disk, OS

App Server CachingMessaging

Servlet JSP

EJB

Topic Queue

Route

Cache Service

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.7

What is End-To-End Monitoring ?

WebLogicServer

CoherenceCache

Load-Balanced Servers backed by Distributed Cache

A third dimension that most people don’t even think of … process distribution

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.8

Why Should You Care ?

• These three dimensions create complexity – all going on at same time

• Taken together they generate huge volumes of monitoring data and complex relationships between the application and the underlying components.

• Add in a fourth dimension, time, and the challenge becomes even greater

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.9

Why Should You Care ?

• System Mgmt tools give you lots of data, but typically one component at time … only the 2nd dimension

• Transaction monitoring tools only 1st dimension.

• Both are essentially “after the fact” - like alerts … you can tell when a transaction or component has failed but it needs to be correlated with state of entire system

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.10

How RTView Addresses This Problem

• RTView collects data from all these dimensions and is aware of the relationships.

• If an app is dependent on a specific msg queue, RTView can map this to the JMS server that contains the queue, as well as the system that hosts the server and correlate the metrics

• RTView is aware of Server load-balancing and the distributed nature of Coherence caches

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.11

How RTView Addresses This Problem

www.sl.com

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.12

RTView Use Case

Application: On-Line Store

Large on-line store providing product search and ordering services for consumer software products

WebLogic for Application Server

Coherence for Database Caching

Multiple other technologies, including TIBCO EMS for communications services

VMWare Virtual Infrastructure

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.13

RTView Use Case

Application Team

Small Group (< 10) responsible for 100+ WebLogic Servers + 100 Coherence nodes replicated in DEV,

TEST, and PROD

Some peripheral monitoring, e.g. Splunk for log files, Omniture for web tracking, OEM for WLS and OC

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.14

RTView Use Case

Support Challenges

Difficult to have confidence that store is “OK”Only know when something goes wrong

When it does, difficult to determine the cause

Each subsystem implemented as a WL ClusterWebLogic = OEM only allows them to see one server at a

time, but the store works off of “clusters”

Coherence = complete black box

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.15

RTView Use Case

RTView Solution

Hi-Level Overview Diagram showing “health state” – provides confidence that all subsystems are OK

Present WebLogic information in “clusters” – makes it possible to see aggregate metrics and load

balancing for each app

Correlate Coherence metrics with WL to provide confidence that Coherence black box is OK

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.16

On-Line Store Overview Diagram

RTView used to create system overview diagram

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.17

WebLogic Cluster/Server Summary

All Servers Organized by Cluster, with Health State

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.18

WebLogic Cluster App Summary

Each Cluster shown as a unit, with server metrics aggregated

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.19

Load Balance Analysis

Load Balance Comparison of multiple metrics across WebLogic and Coherence

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.20

Aggregating Other Middleware Information

Health State of each service aggregated from multiple components

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.21

Aggregating Other Middleware Information

Health State of each service aggregated from multiple components

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.22

Aggregating Other Middleware Information

Including Aggregate Service Alert History over Time

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.23

Aggregating Other Middleware Information

Including Detailed History of Coherence Cache Service

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.24

The Emerging Standard Oracle Stack

VMware

Oracle Coherence/DatabasesMessaging Middleware

Fusion / TIBCO / MQ

Oracle WebLogic

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.25

Conclusion

• 3-Dimensional End-to-end monitoring required for the emerging standard Oracle application stack

• High-level overviews showing “health state” give confidence that all subsystems are OK

• Ability to present WebLogic information in “clusters” makes it possible to see aggregate metrics and load balancing for each app

• Ability to correlate Coherence metrics with WebLogic helps to ensure that Coherence black box is OK

• Include Virtual Infrastructure State and Components from Other Vendors

© 2012 SL Corporation. All Rights Reserved.

© 2013 SL Corporation. All Rights Reserved.26

Thank you!For more information, please visit

www.sl.com

Recommended