17
1 Future Directions in HST Data Processing 19 November 2004

1 Future Directions in HST Data Processing 19 November 2004

Embed Size (px)

DESCRIPTION

3 History ( )

Citation preview

Page 1: 1 Future Directions in HST Data Processing 19 November 2004

1

Future Directions in HST Data Processing

19 November 2004

Page 2: 1 Future Directions in HST Data Processing 19 November 2004

2

History (2002-2003) Several parts of the system exhibited unacceptable

performance Processing of data from HST to the Archive Response time to user requests for data from Archive

Several specific causes NFS mount problems Disk corruption in OPUS Jukebox problems Other specific hardware problems

Symptomatic of more general problems with the data processing systems

Page 3: 1 Future Directions in HST Data Processing 19 November 2004

3

History (2002-2003)

Page 4: 1 Future Directions in HST Data Processing 19 November 2004

4

TRU64 VMS

2002 Architecture

SOLARIS

Page 5: 1 Future Directions in HST Data Processing 19 November 2004

5

Goals for re-implementation (2003-2004) Reduce total cost of data processing activities Simplify hardware and network architecture

Reduce Operating Systems from 3 to 1 Consolidate many boxes into two highly reliable boxes Flexible allocation of computing resources

Provide common development, test, and operational environments Development and test systems could not replicate load

of operational systems Improve ability to capture performance data,

metrics, etc.

Page 6: 1 Future Directions in HST Data Processing 19 November 2004

6

Continue planned pipeline evolution DADS Distribution redesign provides more flexibility

to users and operators Storing copy of Raw Data on EMC will dramatically

reduce load and reliance on Jukeboxes Ingest redesign provides opportunity to finally end the

arbitrary boundary between OPUS and DADS

Goals for re-implementation (2003-2004)

Page 7: 1 Future Directions in HST Data Processing 19 November 2004

7

EMC StorageArray

2GB Switch

SUN FIRE15K

EMC BackupManager

SUN BackupManager

HST

FROM: ScienceCommunity(thru StarView)

FUSE Engineering Team

TO: ScienceCommunity

Fatkat

Imagemaker

Fiber Channel

Ethernet 10/100Connections to

Institute Networkand Internet

ARCHC SCARAB

Ops-Jukeboxes

Corsair

Dev-Jukeboxes

DVD DrivesW DVDW DVDW DVD

DVD DrivesW DVDW DVDW DVD

CD Burners

CD Burners

New Architecture

SUN FIRE 15K Domain Config

Opus/Archive

To Test andDevelopmentEnvironment

7 Dynamically Re-Configurable Domains

EMC

EMC

EMC

EMC

EMC

EMC

OPUS/Archive OPS

EMC

Databases OPS

Databases OPS

Code Development

System Test

Database Test

OS/Security Test

Page 8: 1 Future Directions in HST Data Processing 19 November 2004

8

Re-implementation completed in 2004

New Archive Ingest system installed June 2 Removes artificial boundary between OPUS pipeline

and Archive Removes VMS systems from input side of the Archive

Upgraded to 64 bit compiler on June 21 Tuned the EMC disk system over the summer Multidrizzle installed in ACS pipeline Sept. 22

Extra compute load is well within system capacity

Page 9: 1 Future Directions in HST Data Processing 19 November 2004

9

Current Retrieval Performance

Page 10: 1 Future Directions in HST Data Processing 19 November 2004

10

Near-term Plans Clean up remnants of old architecture

Totally decouple Jukeboxes from retrieval/ingest Determine future of safestore

Media Jukeboxes (?)

Science value-added improvements Multidrizzle Accuracy of coordinates Calibration upgrades (e.g. STIS close-out)

Page 11: 1 Future Directions in HST Data Processing 19 November 2004

11

Multidrizzle Dither Working Group is leading development of

Multidrizzle (A. Koekemoer et. al.) Recent steps:

Multidrizzle implemented in Pipeline for ACS associations Same version released last week in STSDAS

Next steps – more Instruments: STSDAS version can handle STIS imaging, WFPC2 data Extend association definitions for ACS, WFPC2 to cover “pos

targs”, and other data groups Implement WFPC2 and STIS imaging in pipeline (mostly requires

additional scientific verification and validation) Use for NICMOS and STIS spectroscopy will require some

additional software development

Page 12: 1 Future Directions in HST Data Processing 19 November 2004

12

Multidrizzle Next steps – Tweakshift

Currently: Pipeline Multidrizzle uses headers for offsets between images STSDAS version can accept measured offsets

Prototype Tweakshift (in STSDAS) to measure offsets from images

Via object identification Via cross-correlations Via wavelets (TBD)

After evaluation via STSDAS release, move to pipeline processing

Eliminates small pointing errors between images and orbits Provides opportunity to improve absolute position reference

Page 13: 1 Future Directions in HST Data Processing 19 November 2004

13

Accuracy of Coordinates Accuracy of WCS coordinates is ~ 1-2 arc-seconds

Original Guide Star Catalog limits accuracy Slow Drift of FGSs relative to SIs has not been accurately calibrated

Imaging data Tying GSCII to Hipparchos frame provides opportunity to improve

accuracy Use objects found by Tweakshift to directly determine WCS

coordinates – not affected by drift of FGSs relative to SIs Evaluations have been successful

Small FOV data sets (e.g. STIS) Retroactively calibrate drift of apertures

Imaging data can provide FGS-to-FGS drifts Goal is accuracies of ~0.1-0.2 arc-seconds

Page 14: 1 Future Directions in HST Data Processing 19 November 2004

14

Longer-Term Plans January, 2004 decision re SM4 led us to look for ways to

augment the scientific value of the HST data processing and archive services

Stimulate an even broader re-use of the Hubble data Support a more aggressive archival research program, in the event there

is a major gap between HST and JWST Timed well with respect STScI planning for other missions we

support (Kepler, JWST, NVO, etc.) We will be developing a proposal for enhancing the Hubble data

services Minimal disruption to the core architecture we have just finished Add an outer “layer” that builds on the core

Higher level science products Faster response time NVO compatible

Page 15: 1 Future Directions in HST Data Processing 19 November 2004

15

Hubble Data ArchiveCurrent Situation

Archive UserSafe Store Core Archive

• Process data from HST• Re-process requested data for user•All data is available

• Retrieval Times: Hours

Page 16: 1 Future Directions in HST Data Processing 19 November 2004

16

Hubble Data ArchivePotential Enhancements

Archive User

Safe StoreCore Archive

• Cutouts• URL-based access• Retrieval Times: Seconds

• Most calibrated data online• Automatic updates from core• Improved WCS coordinates• Object catalogs• Space/time “footprint” service• Other high-level products…

… … …

Extended Archive

Calibrated Data

Calibrated Data

Catalogs

Derived Products

• Process data from HST• Re-process data for extended archive• All data is available

Page 17: 1 Future Directions in HST Data Processing 19 November 2004

17

Next steps Formulate specific plans and schedules for near-

term (~ next year) upgrades Multidrizzle Coordinate accuracies

Document our concept for enhanced HST data services for HSTP and HQ

We welcome STUC input to both processes