21
Application of Fault Injection to Globus Grid Middleware Nik Looker & Jie Xu University of Leeds, Leeds. LS2 9JT, UK Tianyu Wo & Jinpeng Huai Beihang University, Beijing 100083, PRC School of Computing FACULTY OF ENGINEERING

Application of Fault Injection to Globus Grid Middleware

Embed Size (px)

DESCRIPTION

School of Computing FACULTY OF ENGINEERING. Application of Fault Injection to Globus Grid Middleware. Nik Looker & Jie Xu University of Leeds, Leeds. LS2 9JT, UK Tianyu Wo & Jinpeng Huai Beihang University, Beijing 100083, PRC. 1. A Historical Perspective. Dependability & Security. - PowerPoint PPT Presentation

Citation preview

Page 1: Application of Fault Injection to Globus Grid Middleware

Application of Fault Injection to Globus Grid Middleware

Nik Looker & Jie Xu

University of Leeds, Leeds. LS2 9JT, UK

Tianyu Wo & Jinpeng Huai

Beihang University, Beijing 100083, PRC

1

School of ComputingFACULTY OF ENGINEERING

Page 2: Application of Fault Injection to Globus Grid Middleware

A Historical Perspective

Page 3: Application of Fault Injection to Globus Grid Middleware

Dependability & Security

• To understand dependability it is important to understand the three main concepts that it utilises:

• Attributes• Measurements of how Dependable and Secure a system is

• Threats• Things that may affect the Dependability and Security of a system

• Means• Ways of increasing the Dependability and Security of a system

Page 4: Application of Fault Injection to Globus Grid Middleware

Attributes

• Availability• The probability that a service is present and ready for use

• Reliability• The capability of maintaining the service and service quality

• Safety• The absence of catastrophic consequences

• Confidentiality• Information is accessible only to those authorised to use it

• Integrity• The absence of improper system alterations

• Maintainability• To undergo modifications and repairs

Page 5: Application of Fault Injection to Globus Grid Middleware

Threats

• Fault• A fault is a defect in a system

• Error• An error is a discrepancy between the behaviour of a system and

its specified behaviour within the system boundary• i.e. it enters an unspecified state

• Failure• A failure is an instance in time when a system displays behaviour

that is contrary to its specification at the system boundary

Page 6: Application of Fault Injection to Globus Grid Middleware

Fault-Error-Failure Chains

• As a general rule:

• A fault, when activated, can lead to an error • An error is an invalid state

• An invalid state generated by an error may lead to either another error or a failure

• A generated error can be treated as another fault• A failure is an observable deviation from the specified

behaviour at the system boundary

Page 7: Application of Fault Injection to Globus Grid Middleware

Means

• Dependability means are ways of breaking fault-error-failure chains.

• Four main classifications:• Fault Prevention• Fault Removal• Fault Forecasting• Fault Tolerance

Page 8: Application of Fault Injection to Globus Grid Middleware

Fault Injection

• Fault Injection• MTBF may be very large

• Attempt to speed up this process by injecting faults • Cause the execution of seldom used control pathways within a system

• Either• A failure may occur• System’s fault tolerance mechanism will handle the fault• or the failure will go undetected and uncorrected :-(

• Network Level Fault Injection• Corrupt• Drop• Reorder

Page 9: Application of Fault Injection to Globus Grid Middleware

Network Level Fault Injection

Physical Network Hardware

Physical

Data Link

Network

Transport

Application

Middleware

Client

Middleware Transport

Physical Network Hardware

Physical

Data Link

Network

Transport

Application

Middleware

Service

Middleware Transport

Physical Medium

Network Protocol Stack

Middleware

Service/Client

Fault Injector

Page 10: Application of Fault Injection to Globus Grid Middleware

Modified Network Level Fault Injection

Network Protocol Stack

Middleware

Service/Client

Network Protocol Stack

Middleware

Client

Middleware Transport

Security

Processing and Dispatch

Fault Injector Hook Code

Network Protocol Stack

Middleware

Service

Middleware Transport

Security

Processing and Dispatch

Fault Injector Hook Code

Fault Injector

This allows a fault injector to intercept an entire middleware message, and thus we can decode it and modify specific parts of it.

Page 11: Application of Fault Injection to Globus Grid Middleware

Grid-FIT

Network Protocol Stack

Middleware

Service/Client

Network Protocol Stack

Axis 1.1

Client

org.apache.axis.transport

Signing and Encryption

Processing and Dispatch

Fault Injector Hook Code

Network Protocol Stack

Axis 1.1

Service

org.apache.axis.transport

Signing and Encryption

Processing and Dispatch

Fault Injector Hook Code

Grid-FIT

Page 12: Application of Fault Injection to Globus Grid Middleware

Injecting Faults in a Production Environment

Network Protocol Stack

Apache Axis 1.1

Service

Mac OS X Platform

Network Protocol Stack

Apache Axis 1.1

Service

Linux Platform

Network Protocol Stack

Apache Axis 1.1

Service

Solaris Platform

Network Protocol Stack

Apache Axis 1.1

Client

Windows Platform

Network Protocol Stack

Apache Axis 1.1

Service

Linux Platform

SOAP Messages over HTTP

Grid-FIT

org.apache.axis.transport

Signing and Encryption

Processing and Dispatch

Fault Injector Hook Code

org.apache.axis.transport

Signing and Encryption

Processing and Dispatch

Fault Injector Hook Code 1 2 4

3 5

Page 13: Application of Fault Injection to Globus Grid Middleware

System Model

Parameters

Messages

Methods

Services

System

wsdl:message

wsdl:part

wsdl:operation wsdl:portType

System Composed of Services

wsdl:message

wsdl:part

Page 14: Application of Fault Injection to Globus Grid Middleware

Extended Fault Model

Fault

Models

Sub-

Groups

High

Level

Groups

High

Level

EFM

Classify fault models according to high-level criteria

Classify according to type of Perturbation

Classify according to data

Specific Fault Models

Page 15: Application of Fault Injection to Globus Grid Middleware

Extended Failure Model

Failure

Mode

Sub-

Groups

High

Level

Groups

EFAM

Classify according to high-level criteria

Classify according to data

Specific Failure Modes

Page 16: Application of Fault Injection to Globus Grid Middleware

Failure Detection

System Model

EFM

EFAM

Page 17: Application of Fault Injection to Globus Grid Middleware

Application to Globus

• Initial experiments were based around Web Services• This resulted in the WS-FIT tool

• (Web Service - Fault Injection Technology)

• Ultimate aim was to apply this method to Grids• This has resulted in the Grid-FIT tool• Modifications and initial experiments have been conducted

• Modified hooks to work with Globus• Adapted FIT decoding to Globus message structure• Repeated an earlier set of experiments rewritten for Globus 4

Page 18: Application of Fault Injection to Globus Grid Middleware

Test Case

Client

Trading Service

Bank Service

Quote Service

Page 19: Application of Fault Injection to Globus Grid Middleware

Results

Page 20: Application of Fault Injection to Globus Grid Middleware

Future Work

• Apply Grid-FIT to complex systems• CoLaB

• Short for Collaboration of Leeds and Beihang, is a joint laboratory founded by the Beihang University, PRC & University of Leeds, UK. in 2005.

• The primary mission of CoLaB is research in Software and Security, each linked through a common objective

• To support the needs of the next generation of Internet computing.• CROWN

• Short for China Research and Development environment Over Wide-area Network, is a grid test bed to facilitate scientific activities in different disciplines.

• We are currently working on integrating Grid-FIT with CROWN• This will give Grid-FIT a large test bed to refine its method and models• This will give CROWN a native Dependability Assessment method• Part of the integration will be to integrate Grid-FIT as an Eclipse plug-in

Page 21: Application of Fault Injection to Globus Grid Middleware

Demonstrations & Workshop

• Demonstrations• Venue: White Rose Grid Stall• Wednesday 20th September 13:45 – 14:30• Thursday 21st September 10:00 - 10:45

• CROWN Tianyu Wo [email protected]• FT-Grid Paul Townend [email protected]• Grid-FIT Nik Looker [email protected]

• Mini-Workshop on UK-China e-Science Collaborations• Venue: Conference Room 1• Wednesday 20th September 17:00 - 19:00