13
1 VO-lec 1-2 Edwin A. Valentijn Edwin A. Valentijn VO-lec 1-2 This course This course • A novum • Programme http://www.astro.rug.nl/~valentyn/VO.html – Basis • intro • concepts – systems • Astrometry – photometry – CDS – VO • EURO-VO tools – E – Lofar – AstroWise – AstroGRID – Students VO • A novum • Programme http://www.astro.rug.nl/~valentyn/VO.html – Basis • intro • concepts – systems • Astrometry – photometry – CDS – VO • EURO-VO tools – E – Lofar – AstroWise – AstroGRID – Students VO VO-lec 1-2 E-science E-science • Beyond “workstation science” of the 80-90’s • Distributed services • Distributed communities • Distributed archives • p2p networks – KAZAA- NAPSTAR – Share cpu – Share storage – Share info / meta data /knowledge • Beyond “workstation science” of the 80-90’s • Distributed services • Distributed communities • Distributed archives • p2p networks – KAZAA- NAPSTAR – Share cpu – Share storage – Share info / meta data /knowledge VO-lec 1-2 different views on different views on • Surveys • Templates • Pipelines • Virtual Observatories • Surveys • Templates • Pipelines • Virtual Observatories VO-lec 1-2 Surveys Surveys • Defined area on sky • Homogeneous – Survey limit • Flux (magnitude) • Size • Surface brightness • distance • Quality control • Defined area on sky • Homogeneous – Survey limit • Flux (magnitude) • Size • Surface brightness • distance • Quality control VO-lec 1-2 Templates Templates Standards very important for VO • Observing templates – Astronomical Observing Templates at ESA – Templates / Template signature files at ESO Standards very important for VO • Observing templates – Astronomical Observing Templates at ESA – Templates / Template signature files at ESO

E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

1

VO-lec 1 -2

Edwin A. ValentijnEdwin A. Valentijn

VO-lec 1 -2

This courseThis course

• A novum• Programme

http://www.astro.rug.nl/~valentyn/VO.html– Basis

• intro• concepts – systems• Astrometry – photometry

– CDS – VO• EURO-VO tools

– E – Lofar– AstroWise– AstroGRID– Students VO

• A novum• Programme

http://www.astro.rug.nl/~valentyn/VO.html– Basis

• intro• concepts – systems• Astrometry – photometry

– CDS – VO• EURO-VO tools

– E – Lofar– AstroWise– AstroGRID– Students VO

VO-lec 1 -2

E-scienceE-science

• Beyond “workstation science” of the 80-90’s

• Distributed services• Distributed communities• Distributed archives• p2p networks – KAZAA- NAPSTAR

– Share cpu– Share storage– Share info / meta data /knowledge

• Beyond “workstation science” of the 80-90’s

• Distributed services• Distributed communities• Distributed archives• p2p networks – KAZAA- NAPSTAR

– Share cpu– Share storage– Share info / meta data /knowledge

VO-lec 1 -2

different views on different views on

• Surveys• Templates• Pipelines• Virtual Observatories

• Surveys• Templates• Pipelines• Virtual Observatories

VO-lec 1 -2

SurveysSurveys

• Defined area on sky• Homogeneous

– Survey limit•Flux (magnitude)•Size•Surface brightness•distance

• Quality control

• Defined area on sky• Homogeneous

– Survey limit•Flux (magnitude)•Size•Surface brightness•distance

• Quality control

VO-lec 1 -2

TemplatesTemplates

Standards very important for VO• Observing templates

– Astronomical Observing Templates at ESA– Templates / Template signature files at ESO

Standards very important for VO• Observing templates

– Astronomical Observing Templates at ESA– Templates / Template signature files at ESO

Page 2: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

2

VO-lec 1 -2

ESO parse info via headersESO parse info via headers

VO-lec 1 -2

pipelinespipelines

• Workflow• What triggers a pipeline?

– Data items– Operators

– users

• Workflow• What triggers a pipeline?

– Data items– Operators

– users

VO-lec 1 -2

Data Model / flowData Model / flow

Sanity checksSanity checks

Quality controlQuality controlCalibration proceduresCalibration procedures

Image pipelineImage pipeline

Source pipelineSource pipeline

VO-lec 1 -2

Astro-Wise PipelinesAstro-Wise Pipelines

Photometric pipelinePhotometric pipeline

Bias pipeline

Flatfield pipeline

Image pipeline

Source pipeline

VO-lec 1 -2

Virtual observatoriesVirtual observatories

• ????????????• ????????????

GigaPort seminar for astronomers

link to VO

lecture by EV

GigaPort seminar for astronomers

link to VO

lecture by EV VO-lec 1 -2

Virtual observatoriesVirtual observatories

• Broad VOs– IVOA– Euro VO

• EuroVO DataCenter Alliance • Focussed VOs

– AstroWise

• Broad VOs– IVOA– Euro VO

• EuroVO DataCenter Alliance • Focussed VOs

– AstroWise

Page 3: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

3

VO-lec 1 -2 VO-lec 1 -2

multi-wavelength view of a galaxy merger

multi-wavelength view of a galaxy merger

John Hibbard

http://www.cv.nrao.edu/~jhibbard/n4038/n4038.html

Radio

NASA/CXC/SAO/G. Fabbiano et al.

X-RayOptical

VO-lec 1 -2

SDSS

ROSAT

2MASS

FIRST GAIA

the problemthe problem

VO-lec 1 -2

ROSATFIRST GAIA

SDSS2MASS

work on a solution

VO-lec 1 -2

AVO-> Euro-VOAVO-> Euro-VO

• Standards– FITS– Universal Content descriptor UCD– VO table – VOT

• Communities – workshops/training• Registry• Connecting archives

– Cone search– X-match– All kinds of tools/ web services

• Relatively static

• Standards– FITS– Universal Content descriptor UCD– VO table – VOT

• Communities – workshops/training• Registry• Connecting archives

– Cone search– X-match– All kinds of tools/ web services

• Relatively static

VO-lec 1 -2

standards: Universal content Descriptor - UCD

standards: Universal content Descriptor - UCD

• http://www.ivoa.net/Documents/REC/UCD/UCD-20050812.html#ToC8

• http://www.ivoa.net/Documents/REC/UCD/UCD-20050812.html#ToC8

http://www.ivoa.net/Documents/PR/UCD/UCDlist-20061006.html

http://www.ivoa.net/Documents/PR/UCD/UCDlist-20061006.html

Page 4: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

4

VO-lec 1 -2

Standards:VO table VOTStandards:VO table VOT

• VOtable_SourceList.xml

• http://www.ivoa.net/twiki/bin/view/IVOA/IvoaVOTable

• VOtable_SourceList.xml

• http://www.ivoa.net/twiki/bin/view/IVOA/IvoaVOTable

VO-lec 1 -2

Finding information in the

VO

Registries are here• multiple interfaces

- human readable- machine readable

• simple/advanced search

VO-lec 1 -2VOIndia VOPLot

query lan

guage

keyword

identifier

VO-lec 1 -2

Examples

CDS Aladin: combines registry discovery with other access methods

VO-lec 1 -2

Data available at selected point are highlighted in tree

Field of view outlines are plotted automatically

Image metadata

HST-ACS

DSS

VLT -ISAAC

Chandra

ESO-WFI

2MASS

My Data

AVO prototype based on CDS AladinVO-lec 1 -2

èè ManipulationManipulation

èè XX--matchmatch

èè VisualizationVisualization

èè Direct links to:Direct links to:

CDS Aladin

ALADINALADIN

Page 5: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

5

VO-lec 1 -2

Orionis

cluster

Orionis

cluster

VO-lec 1 -2

Multi archive spectra

emerging conventions and standards

ESA VOSpec

VO-lec 1 -2

VO applications and analysis toolsVO applications and analysis tools• VOSED: a Spectral Energy distribution builder and model fitting.

VO-lec 1 -2VOIndia VOPLot

VOPlot

VO-lec 1 -2

TOPCAT VOTable viewTOPCAT VOTable view

VO-lec 1 -2VOIndia VOPLot

• Astronomy functionality•coordinate units and projections

Page 6: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

6

VO-lec 1 -2

Astronomical archives: take-up of VO standards.

VO-lec 1 -2

Theory in the VOTheory in the VO• Theoretical model web server + TSAP

VO-lec 1 -2

ASTRID -CM: VO ScienceASTRID -CM: VO Science

• From Class 0 to Class III:Search and classification.

VO-lec 1 -2

SVO Thematic Network: VO ScienceSVO Thematic Network: VO Science• Discovery of ultracool (T-type) brown dwarfs.

(IAC+LAEFF)

VO-lec 1 -2

applicationsapplications

• Astro-GRID à Euro-VO– Workflow

Taverna 2 (Bioinformatics-life sciences)client side workflow editor and engine

– Connecting webservices – grid– my space

• Astro-GRID à Euro-VO– Workflow

Taverna 2 (Bioinformatics-life sciences)client side workflow editor and engine

– Connecting webservices – grid– my space

VO-lec 1 -2

Euro –VOconnect and mine archives

Euro –VOconnect and mine archives

Technology-development

Technology-development registryregistry

Data centers take-upData centers take-up

Page 7: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

7

VO-lec 1 -2

Data Center AllianceData Center Alliance

- Consortium AgreementF, I, G, UK, Sp, NL, ESO, ESA

– Now EU funded: FP6 1.5 M Euro COMMUNICATION NETWORK DEVELOPMENTCOORDINATION ACTION•start 1/9/2006 2.5 year (startup project)•~ 1 fte NL, + travel

- NOVA -NL

- Consortium AgreementF, I, G, UK, Sp, NL, ESO, ESA

– Now EU funded: FP6 1.5 M Euro COMMUNICATION NETWORK DEVELOPMENTCOORDINATION ACTION•start 1/9/2006 2.5 year (startup project)•~ 1 fte NL, + travel

- NOVA -NL

VO-lec 1 -2

DCA tasks – WPstake-up VO technology

DCA tasks – WPstake-up VO technology

• WP3: support of take-up and implementation of VO technology– NL: AstroWise/OmegaCAM, Lofar– Census

• WP 4: inclusion theoretical data VObs• WP 5: coord with computational GRIDS

– NL: I-GrID- F.Pasian

• WP 6:support other EU countries

• WP3: support of take-up and implementation of VO technology– NL: AstroWise/OmegaCAM, Lofar– Census

• WP 4: inclusion theoretical data VObs• WP 5: coord with computational GRIDS

– NL: I-GrID- F.Pasian

• WP 6:support other EU countries

VO-lec 1 -2

Theory in the VO: issuesGerard Lemson

Theory in the VO: issuesGerard Lemson

• Simulations not so simple– complex observables– no standardisation (not even HDF5)– archiving ad hoc, for local use

• Moore’s law makes useful lifetime relatively short: few years later can do better

• Current IVOA standards somewhat irrelevant– no common sky– no common objects– requires data models for content, physics, code

• Simulations not so simple– complex observables– no standardisation (not even HDF5)– archiving ad hoc, for local use

• Moore’s law makes useful lifetime relatively short: few years later can do better

• Current IVOA standards somewhat irrelevant– no common sky– no common objects– requires data models for content, physics, code

VO-lec 1 -2

“Moore’s law” for N-body simulations

“Moore’s law” for N-body simulations

Courtesy Simon White

VO-lec 1 -2 VO-lec 1 -2

Detailed predictionsDetailed predictions

Courtesy Volker Springel

Page 8: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

8

VO-lec 1 -2

The Virgo consortium’s Millennium simulation

The Virgo consortium’s Millennium simulation

• Millennium simulation– 10 billion particles, dark matter only– 500 Mpc (~2Gly) periodic box– “concordance model” (as of 2004) initial conditions– 64 snapshots– 350000 CPU hours– O(30Tb) raw + post-processed data

• Postprocessing:– dark matter density fields smoothed at various

scales (45 * 2563 grid cells)– dark matter cluster merger trees (~750 million)– galaxy merger trees (~1 billion/catalogue)

• DeLucia & Balizot, 2006• Bower et al, 2006

• Millennium simulation– 10 billion particles, dark matter only– 500 Mpc (~2Gly) periodic box– “concordance model” (as of 2004) initial conditions– 64 snapshots– 350000 CPU hours– O(30Tb) raw + post-processed data

• Postprocessing:– dark matter density fields smoothed at various

scales (45 * 2563 grid cells)– dark matter cluster merger trees (~750 million)– galaxy merger trees (~1 billion/catalogue)

• DeLucia & Balizot, 2006• Bower et al, 2006

VO-lec 1 -2

VO-lec 1 -2

EvolutionEvolution

VO-lec 1 -2

Dark matter and galaxiesDark matter and galaxies

VO-lec 1 -2

Halos and galaxiesHalos and galaxies

VO-lec 1 -2

the Millenniumdatabase + web server

the Millenniumdatabase + web server

• Post-processing results only• SQLServerdatabase• Web application (Java in Apache tomcat web

server)– portal: http://www.mpa-garching.mpg.de/millennium/– public DB access: http://www.g-vo.org/Millennium– private access: http://www.g-vo.org/MyMillennium– MyDB

• Access methods– browser with plotting capabilities through VOPlot applet– wget + IDL, R– TOPCAT plugin

• Post-processing results only• SQLServerdatabase• Web application (Java in Apache tomcat web

server)– portal: http://www.mpa-garching.mpg.de/millennium/– public DB access: http://www.g-vo.org/Millennium– private access: http://www.g-vo.org/MyMillennium– MyDB

• Access methods– browser with plotting capabilities through VOPlot applet– wget + IDL, R– TOPCAT plugin

Page 9: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

9

VO-lec 1 -2 VO-lec 1 -2

Mock Map Making FacilityMock Map Making Facility

Blaizot , J. et alMon.Not.Roy.Astron.Soc. 360 (2005) 159-175

VO-lec 1 -2 VO-lec 1 -2

VO-lec 1 -2 VO-lec 1 -2

Page 10: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

10

VO-lec 1 -2 VO-lec 1 -2

VO-lec 1 -2 VO-lec 1 -2

VO-lec 1 -2

The observer >2006The observer >2006

• End-user, astronomer, research– Statistical approach– Individual result

• Overload of data:– Hubble ACS: 16 Mpix/image– Gaia: 109 stars– Ground based: OmegaCAM, MegaCAM 256Mpix/image à

105 galaxies/imageà100 Terabyte - Petabyte regime

• Where do we stop? How do we handle this?• New approaches- paradigm- relation to system design

• End-user, astronomer, research– Statistical approach– Individual result

• Overload of data:– Hubble ACS: 16 Mpix/image– Gaia: 109 stars– Ground based: OmegaCAM, MegaCAM 256Mpix/image à

105 galaxies/imageà100 Terabyte - Petabyte regime

• Where do we stop? How do we handle this?• New approaches- paradigm- relation to system design

VO-lec 1 -2

Page 11: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

11

VO-lec 1 -2

LofarLofar

IBM- Blue Gene/LIBM- Blue Gene/L

VO-lec 1 -2

VO-lec 1 -2

from Design-> deliverfrom Design-> deliver

• Scientific requirements - SRD• User requirements - URD• Architectural design - ADD• Detailed design - DDD

Implementation• Quantify • Build• Qualify – unit tests

• Scientific requirements - SRD• User requirements - URD• Architectural design - ADD• Detailed design - DDD

Implementation• Quantify • Build• Qualify – unit tests

VO-lec 1 -2

New approachesnew balances

New approachesnew balances

Anarchy ç àcoordinatedFreedom ß à fixed system

Anarchy ç àcoordinatedFreedom ß à fixed system

Standard data products çàuser tuned products

Data releases ß à user defined huntingStandard data products çàuser tuned products

Data releases ß à user defined hunting

DESIGN

5 Essential STEPS:

DESIGN

5 Essential STEPS:

VO-lec 1 -2

1- calibration plan integrated up-link /down link

1- calibration plan integrated up-link /down link

VO-lec 1 -2

Page 12: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

12

VO-lec 1 -2 VO-lec 1 -2

2 -Procedurizing2 -Procedurizing

• Procedurizing– Data taking at telescope for both science and

calibration data - Templates

• Observing Modes: —Stare—Jitter —Dither —SSO

• Observing Strategies: —Stan —Deep —Freq —Mosaic

– Full integration with data reduction– Design- ADD– Data model (classes) defined for data reduction and

calibration– View pipeline as an administrative problem

• Procedurizing– Data taking at telescope for both science and

calibration data - Templates

• Observing Modes: —Stare—Jitter —Dither —SSO

• Observing Strategies: —Stan —Deep —Freq —Mosaic

– Full integration with data reduction– Design- ADD– Data model (classes) defined for data reduction and

calibration– View pipeline as an administrative problem

VO-lec 1 -2

3 Data Model3 Data Model

Sanity checksSanity checks

Quality controlQuality controlCalibration proceduresCalibration procedures

Image pipelineImage pipeline

Source pipelineSource pipeline

VO-lec 1 -2

4 Integrated archiveand Large Data Volume4 Integrated archive

and Large Data Volume

• Handling of the data is non-trivial– Pipeline data reduction– Calibration with very limited resources– Things change in time:

–Physical changes (atmosphere, various gains)–Code (new methods, bugs)–Human insight in changes

– Working with source lists

Science can only be archive based

• Handling of the data is non-trivial– Pipeline data reduction– Calibration with very limited resources– Things change in time:

–Physical changes (atmosphere, various gains)–Code (new methods, bugs)–Human insight in changes

– Working with source lists

Science can only be archive based

VO-lec 1 -2

4 archive andVirtual Survey System

4 archive andVirtual Survey System

• Environment that provides systematic and controlled– Access to all raw and calibration data– Execution and modification image/calibration pipelines– Execution of source extraction algorithms- catalogues– Archiving or regenerate on the fly dynamically – Paradigm: no static data releases but dynamic on request data– federated to link different data centers

• Environment that provides systematic and controlled– Access to all raw and calibration data– Execution and modification image/calibration pipelines– Execution of source extraction algorithms- catalogues– Archiving or regenerate on the fly dynamically – Paradigm: no static data releases but dynamic on request data– federated to link different data centers

• Dynamical archive continuously grows, can be used for – small or large science projects– generating and checking calibration data– exchanging methods , scripts and configuration

• Dynamical archive continuously grows, can be used for – small or large science projects– generating and checking calibration data– exchanging methods , scripts and configuration

raw pixel data ⇔ pipelines/cal files ⇔ cataloguesraw pixel data ⇔ pipelines/cal files ⇔ catalogues

VO-lec 1 -2

5 Intra-operability peer to peer5 Intra-operability peer to peer

• CVS

• “Advanced Replication” evolving to pointers

• CVS

• “Advanced Replication” evolving to pointers

WRITE

–READ

-ONLY

– READ- ONLY

–R

EAD-

ON

LY

–R

EA

D -O

NLY

WRITE

–REPLIC

ATION

–REPLICATION

– RE

PLIC

AT

ION

–R

EP

LIC

AT

ION

Page 13: E-sciencesciencevalentyn/presentations/VO-lecture1-2.pdfQuality control Calibration procedures Image pipeline Source pipeline VO-lec 1 -2 4 Integrated archive and Large Data Volume

13

VO-lec 1 -2

Case: morphologies parameters vs theory -degeneracy

Case: morphologies parameters vs theory -degeneracy

• U, B, V, R , I, Z, K– m_tot, m_25, m_26, D_26, D_25, D_90%, r_eff, sb_0, sb_eff

• Structure/radial– a/b, pa, B/D, N, exp scale length , delta (B-R)

• colours– Central (B-R), (B_R)_tot,,

• U, B, V, R , I, Z, K– m_tot, m_25, m_26, D_26, D_25, D_90%, r_eff, sb_0, sb_eff

• Structure/radial– a/b, pa, B/D, N, exp scale length , delta (B-R)

• colours– Central (B-R), (B_R)_tot,,

Hubble `Tuning fork’ diagram

Morphologies- Hubble, de Vaucouleurs , CorwinMorphologies- Hubble, de Vaucouleurs , Corwin

VO-lec 1 -2