17
03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring 2014 1

03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Embed Size (px)

Citation preview

Page 1: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

03/17/2014

Data Management System -Data Services-

•Temporary Experiment Data Brief•Data Services News•MUSTANG update- QC on PASSCAL Data

PASSCAL SC Spring 2014 1

Page 2: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Temporary DataIs 26% of DS Holdings

03/17/2014 PASSCAL SC Spring 2014 2

Page 3: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Temporary Experiment DataIs Large, Important

03/17/2014 PASSCAL SC Spring 2014 3

Page 4: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

_PASSCAL Virtual Network 9577 Stations to date

03/17/2014 PASSCAL SC Spring 2014 4

http://www.iris.edu/gmap/_PASSCAL

Page 5: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Notable DS Activities:

03/17/2014 PASSCAL SC Spring 2014 5

• The Livermore Auxiliary Data Center is operational• Currently being used by LLNL staff to access data using existing web services, was delivered to them on time• This ADC is used like its locally located on the same LAN, since it has 10Gb connectivity:

• Archiving of data automatically multicasts to both LLNL and local storage in Seattle; no need to cache

• Can service breq_fast and web service requests, not currently done routinely. (just weekly as function test)

• We anticipate installing a global load balancer to accommodate traffic to be sent to ADC when load is high in Seattle

Page 6: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Notable DS Activities:(Cont’d)

03/17/2014 PASSCAL SC Spring 2014 6

• Data Services has begun integrating a 10% budget decrease

• We will migrate large, write-once read never data like PASSCAL Flex Array “RAW” data to lower-cost tape for example.

• I am currently undergoing an extensive audit to prioritize storage strategies, perform de-duplication, etc

Page 7: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

MUSTANG- QC Across All Data

03/17/2014 7PASSCAL SC Spring 2014

Page 8: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

What does MUSTANG Stand For?

I’ll only say this once: MUSTANG is an acronym violation that stands for:

Modular Utility for Statistical kNowl- edge Gathering

03/17/2014 8PASSCAL SC Spring 2014

Page 9: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Data CoverageNetwork Start End # Records

Per Metric

_GSN 2010-02-26 2013-03-12 3.2 M

_PASSCAL 2007-01-01 2013-11-02 225 K

_OBSIP 2011-11-23 2012-05-20 103 K

_CASCADIA 2011-01-04 2013-03-12 125 K

TA 2013-01-07 2013-01-16 278 K

II 2011-01-04 2013-03-10 580 K

IU 2010-02-27 2013-03-10 789 K

(representative)

Latency measurements (all networks) are currently working on >36,000 channels!

http://www.iris.edu/files/MUSTANG/reports/_PASSCAL.metrics.txt

03/17/2014 9PASSCAL SC Spring 2014

Page 10: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

March 2014: Decision to use Livermore (ADC) for MUSTANG

Since FDSN web services are currently installed and running at the offsite Auxiliary Data Center at LLNL, we’ll utilize the server and storage VM environment and update the RDBMS in Seattle simultaneously.

This will act as a test bed, but in addition will offload Seattle resources which are currently at maximum I/O

03/17/2014 10PASSCAL SC Spring 2014

Page 11: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Accessing MUSTANGMetrics

There will be a “live” web service front end soon: http://service.iris.edu/mustangbeta/measurement

s/1 is main landing page, with help

A URL builder to help construct the query and get correct syntax is directly reached using:

http://service.iris.edu/mustangbeta/measurements/docs/1/builder/

03/17/2014 11PASSCAL SC Spring 2014

Page 12: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Quick Look at the Interface; Similar to all current web services:

03/17/2014 12PASSCAL SC Spring 2014

Page 13: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

Available Soon: “Visualizing” Metrics There is a beta version of a web service front end

that can access the stored metrics that will be similar to this unreleased version:

http://mazamascience.com/MUSTANGDatabrowser/

03/17/2014 13PASSCAL SC Spring 2014

Page 14: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

We are committed to PH5 Format Metadata Exposure

We will enable parsing of station/site metadata that is currently stored in PH5 so that utilities like http://www.iris.edu/gmap can display locations and enable increased awareness

NOTE: This is for PH5 data sets only and will not include “assembled” data sets, as these are in no standardized or parse-able format

03/17/2014 14PASSCAL SC Spring 2014

Page 15: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

We have begun the migration out of Oracle RDBMS We have just entered into an agreement to

migrate our 2.1Tb Oracle RDBMS to Postgres. (EnterpriseDB)

We will not renew FY15 Oracle in October but can still use it, without support, so we are in a hurry

We currently have a solid list of known unknowns and workarounds, and Oracle-specific procedures that we’ll have to work on.

Consider this a traveler’s advisory, but we intend it to be transparent externally

03/17/2014 15PASSCAL SC Spring 2014

Page 16: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

We have begun the migration out of Oracle RDBMS (cont’d)

We will have support and professional help We currently have 9 Postgres databases in

operation so we have PL/PGSQL tribal knowledge that we can leverage.

Not totally new, but PostgreSQL dialects are different.

We hope to have it majorly done in 3 months

03/17/2014 16PASSCAL SC Spring 2014

Page 17: 03/17/2014 Data Management System -Data Services- Temporary Experiment Data Brief Data Services News MUSTANG update- QC on PASSCAL Data PASSCAL SC Spring

That’s all for now

Questions? Comments? Requests?

03/17/2014 PASSCAL SC Spring 2014 17