20
d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan d C a c h e Hepix Fall 2003 TRIUMF, Vancouver

d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

d C a c h ePatrick Fuhrmann

presented by Peter van der Reest

Tigran Mkrtchyan

d C a c h e

Hepix Fall 2003TRIUMF, Vancouver

Page 2: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

OverviewOverviewdCache basics

Modes of operationAttraction Model

Dataset Location ManagementdCache Components (Overview)

Native dCache access method (dCap)The GRID Storage Resource Manager (srm)

The Goal of the Srm InitiativeThe Storage System Abstraction

An SRM initiated transfer exampleMajor SRM Topics

Goals

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 3: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

dCache : GoalsdCache : Goals

Reducing Robot activities

Optimizing access to high capacity (slow) tape systems.

Robot related

Unique access methods to highly distributed data repository

Hiding possible media transfers from clients (Tape <-> Disk)

Providing common access protocols (Kerberos, Grid FTP)

Fault tolerant regarding storage nodes (failover of disk storage)

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 4: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

Used as an HSM frontend, the dCache provides standard

caching mechanisms to optimize tape accesses :

Transfer speed adaptionTunable deferred HSM stores (space , time )

Automatic stagingContinuous garbage collection (no tresholds)

Fetch ahead (from Hsm) [in preparation]

dCache : Modes of OperationdCache : Modes of Operation

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 5: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

- dCache Pools without HSM backend can hold :

Precious datasets

Volatile datasets

Files are never automatically removed.System can run out of disk space.

Unused files are automatically removed.System won't run out of disk space.

- The dCache can be operated in hybrid mode,running HSM and NON - HSM pools.

dCache : Modes of OperationdCache : Modes of Operation

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 6: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

No HSM backend but central ReplicaManager

dCache : Modes of OperationdCache : Modes of Operation

Stores precious files on cheap, non RAID disks

Ensures min, max replica count per file

Automatically adjusts min, max count on pool failures

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 7: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

Client host IP or subnet numbersHSM groups

File (resp. store/retrieve requests) are attracted bypools, based on :

Statically configured parameters, e.g. :

Subdirectory treesDynamically taken parameters from live system, e.g. :

Pool CPU cost (number of active movers)Pool Space costs (space left, age of datasets)

dCache : The Attraction ModeldCache : The Attraction Model

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 8: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

On increasing load of pool nodes, the cachecreates dataset duplicates on moderately usednodes to smoothen hot spots.

Decreasing load marks dataset duplicatesfor removal in case space is running short.

Datasets can be defined 'sticky', independetlyof its status, CACHED, DUPLICATED or ON TAPE ONLY.

dCache : Dataset Location ManagementdCache : Dataset Location Management

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 9: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

I/O Door Nodes

GFtp

(K)Ftp

dCap(Krb5,ssl)

Http

dCache Access SchemedCache Access Scheme

dCap Lib

Control Line Data Lines

Application

GFtp

(K)Ftp

dCap(Krb5,ssl)

Http

GFtp

(K)Ftp

dCap(Krb5,ssl)

Http

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 10: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

Pool Manager

I/O Door Nodes

SRMGFtp dCap (K)Ftp(Krb5,ssl) Http Admin

File Name SpaceDatabase

File Name SpaceProvider

OSM Enstore Tsm

Admin Doors

dCache ComponentsdCache Components

Pool Nodes

HSMs

Nfs ServerpNfs

Hpss ?

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 11: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

d2Cache : stacked dCached2Cache : stacked dCache

H1

HSMs

Zeus

OSM Tsm

Others

Level II cache (IT)

Clients Clients Clients

ExperimentspecificPolicies

ITspecificPolicies

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 12: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

Beside supporting Ftp, GFtp, KeberosFtp and Http, dCache defines a native access protocol (dCap), allowing posix like file operations.

dCache provides a dCap c-language implementationAs shared object or preload libraryFor linux, solaris, irix OS and windows XP.

Supporting automatic reconnect on network or server problems

Interfacing ROOTProviding security tunnels for Kerberos and ssl.

dCache : Native Access Method (dCap)dCache : Native Access Method (dCap)

***

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 13: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

dCache : DESY setup and statisticsdCache : DESY setup and statistics

30 TBytes first level cache (owned by experiments)

20 TBytes second level cache (owned by IT) 8 TBytes write cache (owned by IT)

organized in 30 + 25 + 6 hosts

currently 4 files / second ~ 1.6 GBytes/sec sustained

450 TBytes in HSM storage (stk and adic robotics)

50,000 – 160,000 files / day or 2 – 20 Tbytes / day

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 14: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

Grid KA (Karlsruhe) in preparationGrid KA (Karlsruhe) in preparation

MiniBoone (Neutrino) -> GFtp and dCache

Auger (Cosmic Rays) GFtp from Argentina and France

Grid Condor Project (High Throughput Computing)

NeST -> SRM -dCache -> Enstore

Minos (Neutrino) -> GFtp from Soudan mine

D0 (Tevatron) -> dccp, SAM

Moderate Fermi dCache usersModerate Fermi dCache users

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 15: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

Heavy Fermi dCache usersHeavy Fermi dCache users

Installations at Fermi , San Diego, Italy and CERN

100 TBytes of disk space (w Storage backend, enstore)

CMS – US Groups

CDF (Tevatron) Experiment

Measured up to 50 Tbytes transferred per day

Approaching 500 Tbytes of disk space end of next Year20 TBytes of scratch pool space (w/o Storage backend)

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 16: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

The Storage Resource Manager InitiativeIn order to make site local storage resources like disk space,tertiary storage space and large quantaties of HEP information globally available, e.g. in the GRID context, an initiative has been setup by JLAB, FermiLab, LBNL and CERN, defining some kind of abstract storage system, covering methods for :

Storing and retrieving datasetsObtaining status information about datasets

Pinning datasets (guarantee of availability)Negotiating data transfer protocols

Defining dataset lifetimes

HEP sites, intending to access remote storage resources, are assumed to implement the SRM protocol into their local storage system. For the dCache, FermiLab took over this task.

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 17: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

OSM Enstore Tsm

asm

in

e

SRM

HPSS

dCache Cache

HSM

SRM SRM

Virtual Storage Layer

J

SRM

SRM ClientStore/Retrieve

Prestaging, Space allocation, PinningTransfer Protocol Negotiation Data Transfer (e.g. GFtp)

Client <-> SystemSystem <-> System

Storage System AbstractionStorage System Abstraction

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 18: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

SRM Initiated TransferSRM Initiated Transfer

ReplicaCatalog

AnySRM

map LFN -> SURL CERN-dataset/user/p/file1

srm://srm.fnal.gov/cern/user/p/file1

GET (SURL) srm://srm.fnal.gov/cern/user/p/file1

REPLY (SURL)

REPLY (Request ID) #-12345

STATUS (Request ID) #-12345

REPLY (TURL) gsiftp://pnfs/fnal.gov/cern/user/p/file1

GET (TURL) gsiftp://pnfs/fnal.gov/cern/user/p/file1

FNALSRM

dCache

Enstore

STAGE

Page 19: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

SRM Topics (Srm 2.1)SRM Topics (Srm 2.1)

Transfer Protocol Negotiation.

File Space Allocation / Reservation

(includes direct I/O methods rfio,dCap globus-xio)

Dataset / space lifetime definitions (volatile, durable, perm.)Srm to Srm third party transfers.

Directory support (mkdir / rmdir )

Security ( srm will support gsi over http)

Storing and retrieving datasets.

Obtaining dataset status information.Pinning datasets (make DS available)

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003

Page 20: d C a c h e - DESY › ... › HEPiX_2003_2 › dcache.pdf · d C a c h e Patrick Fuhrmann presented by Peter van der Reest Tigran Mkrtchyan Hepix Fall 2003 TRIUMF, Vancouver. Overview

http://sdm.lbl.gov/srm-wg/

For DetailsFor Details

dCache

SRM

www.dCache.org

P. Fuhrmann, T. Mkrtchyan, v.d. Reest Hepix Fall 2003