22
Connect. Communicate. Collaborate Place your organisation logo in this ar GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM Meeting, Utrecht 24 th of November 2006

Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Embed Size (px)

Citation preview

Page 1: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. Collaborate

Place your organisation logo in this area

GN2 Multidomain Monitoring Service: Serving IP NOCs

Nicolas Simar, DANTE

APM Meeting, Utrecht

24th of November 2006

Page 2: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateAgenda

• Provide the general concepts of the Multi Domain Monitoring service.– Set the scenes.– You’ll use it soon!– The Support that will be offered to you.

• Demo the visualisations– Provide feedback

• Explain the next steps and what your role will be:– Taking part to the Pilot and Prototype. – Using the tools.– What metric and services will be available – validate the first

choice.

Page 3: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateWhat is JRA1?

• JRA1 (Performance Measurement and Management) main objective is to build a multi-domain monitoring framework inter-operable across which is the basis to offer a Multi-Domain Monitoring (MDM) Service.

• Consists of the following main parts:• Design and develop the framework (perfSONAR).

• Integrate measurement tools and databases within the perfSONAR framework.

• Build user visualisation tools using the perfSONAR framework.

• There are about 25 participants (12.5 FTE), from 17 organisations.– Main partners are CARNet, CESNET, Cynet, Dante, DFN, NORDUnet,

PSNC.

Page 4: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateperfSONAR philosophy

Page 5: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateWhat is perfSONAR?

• perfSONAR is a consortium of organisations who seek to build network performance middleware that is inter-operable across multiple networks.

• perfSONAR is a protocol. – SOAP XML messages and following the Open Grid Forum (OGF)

Network Measurement Working Group (NM-WG).

• perfSONAR is, an example set of code (implementation of web-services using the perfSONAR protocol).

Page 6: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaboratePerfSONAR Web-Services

• The framework takes care of the data movement.• It covers the following perfSONAR web-services

– Auth Service (JRA5)

– Autz Service

– Lookup Service (LS)

– Measurement Archives services (MA)• RRD MA, SQL MA, Hades MA

– Measurement Point services (MP)• BWCTL MP, SSH/Telnet MP, CLI MP (I2), L2 status MP (JRA4)

– Topology Service (TopS, cNIS – SA3).

• Allows diversity on the measurement layer and on the visualization layer.

Page 7: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateperfSONAR philosophy

Page 8: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. Collaborate

Multi-Domain Monitoring Service (MDM)

• User : role – group of people making use of a MDM Service. – There may be several categories of users having different needs.

• An MDM service is an access to a set of metrics or functionalities offered to a group of users by several networks using the perfSONAR protocol.

• An MDM service is offered by deploying on a set of perfSONAR web-services and/or visualisations.

• E2E really means Edge to Edge, not End to End (unless end institutions buy into it).

Page 9: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. Collaborate

Multi-Domain Monitoring Service

OwnOwnUser

GN2Visualisation

perfSONAR SOAP XML + JRA5 AA

GN2Visualisation

Domain A

BWCTL MPOWD MALookup

Domain B

BWCTL MPOWD MALookup

Domain C

BWCTL MPOWD MALookup

UserVisualisation

perfSONAR SOAP XML + JRA5 AA

Page 10: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. Collaborate

Multi-Domain Monitoring Service

• Multi-Domain Monitoring Service– Access to a set of monitoring functionalities (e.g. accessing metric

or performing tests) offered to a group of users accessible directly through an XML SOAP interface (perfSONAR protocol) or through a visualisation tools.

– Based on an underlying set of perfSONAR web-services.

• perfSONAR web-service– Web service (providing data or allowing to perform an action) using

the XML NM-WG. The perfSONAR web-services are the basic building blocs of a MDM service.

Page 11: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateUsers Segmentation

Advance Trouble-shooting

Trouble-shooting

Service Health Check

Project Trouble-shooting

Project Service Health check

SLA Verificat

ion

Tailored Added Value

Functional

Yes Yes Yes

Yes Yes Yes

[optional] [optional] Yes Yes

[optional] [optional] Yes Yes

[optional] [optional] Yes Yes

Yes Yes Yes

[optional]

Security

Layer3 ProjectPIP Project

End-UserNetwork Researcher

NREN non technical Staff

User group and their Monotoring Data Usage.

PERTNOCLayer2 Project

Page 12: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateMDM Service Benefits

• For the NOCs – NRENs, EU RENs, GÉANT2 (Abilene(?), ESnet(?), RNP(?), etc). – In DJ1.1.1

• NOCs encounter 5-10% of the problems involving coordination of between multiple domains.

– E2E services/IP packets don’t stop at the boundaries of a domain.– To have an E2E view.

• In particular when offering added value E2E services. • Link capacity, link utilisation, packet drops, topology.

– To have in multiple domain on stand-by tools to perform basic tests.• TCP throughput, link utilisation, delay, looking glass.

– To have the capability of finding out where the tools are located.– To answer the question “End system vs network based problem?”– Send tests results easily.– Save time.

Page 13: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateMDM Service Benefits

• PERT– Similar than for the NOCs.

• L2 project users (LHC OPN, DEISA, eVLBI).– Can see the health of their service.– Verify SLA.– Integrate the data within their own tools.

• L3 project users (EGEE, eVLBI).– Can see the health of their service.– Verify SLA.– Integrate the data within their own tools– We can provide them added value services (traffic matrix between project

sites).

• End-users when appropriate tools will be made available.– Empowering the network users: indication about the network.– Work not started.

Page 14: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateGoing Operational• Pre-roll Out – define and set-up support structure now – March 07.• Pilot – April 07 – August 07 – 5 RENs + GÉANT2

– For NOC and PERT (no AA)– Understand the issues of going operational.– Validate the support structure, get feedback for next phase.– Release in January, deployment training in February.

• Prototype – October 07 – February 08 – 11 RENs + GÉANT2– For NOC, PERT and a limited number of projects.– Verify the MDM SLA.– Dedicated support team.– Verify how to provide the service to external parties.– Test the turn key solution.

• Operation – April 08– More RENs, closer to end-institution.– More projects supported.

Page 15: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateMDM Service Pilot portfolio

MetricperfsonarUI

CNM NEMOVisualperfSON

AR

JRA4 L2

visualisation

XML access

L3 link utilisation Historical Yes Yes Yes Yes

L3 link capacity Historical Yes Yes Yes Yes

L2 circuit status Latest Yes Yes

L2 circuit status Historical Yes Yes

OWD, IPDV, OWPL, Historical Yes Yes Yes

traceroute Historical Yes

Delay RTT On-demand Yes Yes

show command On-demand Yes Yes

Traceroute On-demand Yes Yes

Achievable throughput (TCP) On-demand Yes Yes

UDP throughput On-demand Yes Yes

Service discovery Yes Yes Yes

Service

Telnet/SSH MP

BWCTL MP

Lookup Service

L2 status MP(*)

RRD MA or SQL MA

SQL MA (*)

Hades MA

(*) L2 status MP or SQL MA

Page 16: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. Collaborate

MDM Service Prototype portfolio

Prototype/Functionalityperfsonar

UICNM NEMO

VisualperfSONAR

JRA4 L2 visualisati

on

XML access

L3 link utilisation Historical Yes Yes Yes Yes YesL3 link capacity Historical Yes Yes Yes Yes YesL3 Interface Output drops Historical Yes Yes Yes Yes YesL3 Interface Input drops Historical Yes Yes Yes Yes Yes

L2 status MP L2 circuit status Latest Yes YesL2 circuit status Historical Yes YesOWD, IPDV, OWPL Historical Yes Yes Yes

Achievable throughput (TCP) Historical Yes Yes YesUDP throughput Historical Yes Yes Yestraceroute Historical Yes Yes Yes

Delay RTT On-demand Yes Yesshow command On-demand Yes Yes

Traceroute On-demand Yes YesAchievable throughput (TCP) On-demand Yes YesUDP throughput On-demand Yes YesOWD, IPDV, OWPL On-demand Yes Yes

Lookup Service Service discovery Yes Yes Yes Yes Yes

Topology Serivce Topology information Yes Yes Yes YesAuthentication Service Yes Yes Yes Yes YesAuthorisation Service Yes Yes Yes Yes Yes

RRD MA or SQL MA

SQL MA

Hades MA

Telnet/SSH MP

Service

BWCTL MP

Hades MP

AutzAuth or GiDP

In orange, the additional foreseen functionality from the prototype over the Pilot.

Page 17: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateTaking part to the Pilot

• Deploy the web-services and provide the appropriate data.• Set-up an MDM Level2 support, provide an operational

service.– Ensuring availability of the web-services (Monitor the web-

services), reporting problems following the MDM service procedures.

• Having the NOC and PERT using the infrastructure, solving issues thanks to it and providing feedback.– Training the NOC and PERT.

• Validate the Service at the end of the phase.– Tools, metrics, services.

Page 18: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. Collaborate

MDM Web-services (Pilot phase)

Metric/Functionality

L3 link utilisation HistoricalL3 link capacity Historical

L2 circuit status Latest L2 circuit status Historical

OWD, IPDV, OWPL Historical

traceroute Historical

Delay RTT On-demand

show command On-demand

Traceroute On-demand

Achievable throughput (TCP) On-demand

UDP throughput On-demandService discovery

Service

Telnet/SSH MP

BWCTL MP (3 instances per REN)

Lookup Service

L2 status MP(*)

RRD MA or SQL MA

SQL MA (*)Hades MA (3 tool deployment per REN)

(*) To offer L2 status information, you can either chose the L2 status MP or SQL MA.An NRENs will only provide L2 status information when offering L2 circuits to LHC and DEISA.

Page 19: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateMDM Service Support

• Infrastructure to support the perfSONAR web-services and the visualisation tools used by the MDM will be set-up.– For the deployers: installation, configuration, incident, monitoring.– For users: installation, utilisation.

Users(NOC, PERT,

Projects)

Deployers(RENs)

SLA

User Service DeskISS

SLADeployer Service

Desk

Page 20: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateMDM Service Support• Level1 – Service Desk (ISS)

– Help to install, configure the tools, run reachability tests, help on usability, track the RFE, forward problem to proper person, log the requests, update the documentation, track bugs. This is a central function (rotating member or group of people - ownership).

• Level2 – Administrator (RENs)– Administrator of the machines where the services are installed. The

function lies within the providers. They are in charge of taking care of the security of the services, of their availability (up) and reachability (no firewall, etc). The service should be available 24/7.

• Level3 – Developers (3 years subcontract).– The JRA1 developers who have build the services. They are in charge of

implementing new features and fixing bugs and of answering the query forwarded by level1.

• The three levels of support will be available to both the users and the deployers.

Page 21: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateMDM Service Support

• A turn key solution service could be provided for the web-services of a MDM service or part of it.– HW bought.– Web-services installed, monitored and managed on the REN

behalves. – REN would still have to do a little bit.

• More information about the MDM service in January– Transition to Service session on Tuesday afternoon. – What question have you got to be answered during that session?

Page 22: Connect. Communicate. Collaborate Place your organisation logo in this area GN2 Multidomain Monitoring Service: Serving IP NOCs Nicolas Simar, DANTE APM

Connect. Communicate. CollaborateVisualisations

Advance Trouble-shooting

Trouble-shooting

Service Health Check

Project Trouble-shooting

Project Service Health check

SLA Verificati

on

Tailored Added Value

Functional

Yes Yes Yes Yes Yes v

Layer2 ProjectLayer3 ProjectPIP Project

Yes Yes [o]+[a] Yes v Yes v

Yes Yes Yes Yes Yes Yes

User group and their Monotoring Data Usage.

PERTNOC

perfsonarUICNM

Visualisation Tools… … and their usages.

NEMO

VisualperfSONARJRA4 E2E L2 visualisation

In Red: Targeted for the Pilot. In Orange: Probably targeted for the Prototype (in addition to the Pilot ones)To find out what user group will use as visualisation tool, chose one type of usageand find out, in the same column in the second table the tools available for this usage.