Deploying Server-side File System Monitoring at NERSC · Late Breaking News Acknowledgements and...

Preview:

Citation preview

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.1

Deploying Server-side File SystemMonitoring at NERSCCray Users Group Proceedings May 7, 2009

Andrew UseltonNational Energy Research Scientific Computing Center

Lawrence Berkeley National Lab

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.2

Contents

1 The Franklin Cray XT4CerebroThe Lustre Monitoring ToolThe Lustre Dashboard

2 Data AnalysisMonitoring Specific Tests or IntervalsData Mining for Average and Aggregate Behavior

3 A Simple Model for I/OPoisson DistributionsFranklin’s Actual DistributionLate Breaking NewsAcknowledgements and References

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.3

Monitoring the I/O Subsystem

CN

network10.0

networkNERSC

Cerebro/LMT

Liberty

switch

fc

RAID

RAID

OST

OSS

OST

OST

OST

OSS

OST

OST

OST

OSTswitch

CN

CNfc

MDS

Net

Almanack

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.4

Cerebro

OST

OST

cerebro_metric_lmt_mds.so

cerebro_metric_lmt_ost.so

cerebro_monitor_lmt.so

OSS

/usr/lib/cerebro/*

cerebro_metric_lmt_oss.so

OST

OST

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.5

LMT

stats

uuid

OSS

OST

/proc/meminfo

/proc/stat

/proc/fs/lustre/obdfilter/*/

OST

OST

filesfree

OST

filestotal

kbytesfree

kbytestotal

numrefs

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.6

An OSS Tuple

Cerebro Protocol VersionHost NameCPU UtilizationMemory Utilization

1.0;nid04187;4.990020;39.303989

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.7

OST Data Values

Cerebro Protocol VersionHost NameUUIDBytes ReadBytes WrittenKbytes FreeKbytes UsedInodes FreeInodes Used

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.8

MDS Operations

mysql> select * from OPERATION_INFO;OPERATION_NAME UNITS OPERATION_NAME UNITS

req_waittime usec mds_getattr_lock usecreq_qdepth reqs mds_close usecreq_active reqs mds_reint usec

reqbuf_avail bufs mds_readpage usecost_reply usec mds_connect usec

ost_getattr usec mds_disconnect usecost_setattr usec mds_getstatus usecost_read bytes mds_statfs usecost_write bytes mds_pin usec

ost_create usec mds_unpin usecost_destroy usec mds_sync usecost_get_info usec mds_done_writing usecost_connect usec mds_set_info usec

ost_disconnect usec mds_quotacheck usecost_punch usec mds_quotactl usecost_open usec mds_getxattr usecost_close usec mds_setxattr usecost_statfs usec ldlm_enqueue usec

...

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.9

The Lustre Dashboard

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.10

Four IOR Tests

0

2000

4000

6000

8000

10000

1200022

:45

22:4

8

22:5

1

22:5

4

22:5

7

23:0

0

23:0

3

23:0

6

23:0

9

23:1

2

23:1

5

23:1

8

Dat

a R

ate

(MB/

s)

Time (PDT)

Aggregate OST rates from 2008-07-28 22:45:00

read ratewrite rate

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.11

24 Hours of LMT Data

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

1000008

:00

10:0

0

12:0

0

14:0

0

16:0

0

18:0

0

20:0

0

22:0

0

00:0

0

02:0

0

04:0

0

06:0

0

Dat

a R

ate

(MB

/s)

Time (PDT)

read ratewrite rate

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.12

Daily Averages

0

0.5

1

1.5

2

2.5

3

3.5

4

4.5

07/01 08/01 09/01 10/01 11/01 12/01 01/01 02/01 03/01

Dat

a R

ate

(GB/

s)

Time (PDT)

Average daily rates

ReadWrite

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.13

http://en.wikipedia.org/wiki/Poisson_distribution:

fλ(k) =λk e−λ

k !

•C(m) = N × fλ(int(m/M))

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.13

http://en.wikipedia.org/wiki/Poisson_distribution:

fλ(k) =λk e−λ

k !

•C(m) = N × fλ(int(m/M))

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.14

Poisson Distribution: λ = 2

10

100

1 K

10 K

100 K

1 M

10 M

100 M

0 0.2 GB 0.4 GB 0.6 GB 0.8 GB 1 GB 1.2 GB 1.4 GB 1.6 GB 1.8 GB 2.0 GB

coun

t

m - The amount of data transferred during 5 second interval.

Poisson distribution

lambda = 2, M = 125MB, N = 250M

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.15

Poisson Distribution: λ = 20

10

100

1 K

10 K

100 K

1 M

10 M

100 M

0 0.2 GB 0.4 GB 0.6 GB 0.8 GB 1 GB 1.2 GB 1.4 GB 1.6 GB 1.8 GB 2.0 GB

coun

t

m - The amount of data transferred during 5 second interval.

Poisson distribution

lambda = 20, M = 40MB, N = 250M

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.16

250 M LMT Observations

10

100

1 K

10 K

100 K

1 M

10 M

0 500 1000 1500 2000 2500

Cou

nt

MB

Distribution of LMT observed rates

readwrite

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.17

Two weeks of recent observations

10

100

1 K

10 K

100 K

1 M

0 500 1000 1500 2000 2500

Cou

nt

MB

Distribution of LMT observed rates

readwrite

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.18

I would like to acknowledge and thank:

Al Chu The author of Cerebro.Herb Wartens The author of the Lustre Monitorining Tool

plug-ins.

Both work at Lawrence Livermore National Lab, whichsupported the development of these tools. Both were verygenerous with their time as I deployed the software on Franklin.

DeployingServer-side File

System Monitoring atNERSC

Andrew Uselton

The Franklin Cray XT4Cerebro

The Lustre Monitoring Tool

The Lustre Dashboard

Data AnalysisMonitoring Specific Tests orIntervals

Data Mining for Averageand Aggregate Behavior

A Simple Model for I/OPoisson Distributions

Franklin’s ActualDistribution

Late Breaking News

Acknowledgements andReferences

1.19

The software is available from:

Both applications are open source and available fromSourceforge.

Cerebro http://sourceforge.net/projects/cerebro

LMT http://sourceforge.net/projects/lmt/

If you would like hints and encouragement withgetting this software deployed, contact me:Andrew Uselton (acuselton@lbl.gov)If you get results from your deployment that youwould like to share, please do so.

Recommended