View
214
Download
0
Category
Preview:
Citation preview
http://www.ogsadai.org.uk
OGSA-DAI: Future Work and Wrap-up
The OGSA-DAI Team
info@ogsadai.org.uk
2http://www.ogsadai.org.uk
Goals for DAIT
Aim to deliver application mechanisms that:– Meet the data requirements of Grid applications
• Functionally, performance and reliability
• Reduce development cost of data centric Grid applications
• Provide consistent interfaces to data resources
– Acceptable and supportable by database providers• Trustable, imposed demand is acceptable, etc.
• Provide a standard framework that satisfies standard requirements
A base for developing higher-level services– Data federation– Distributed query processing– Data mining– Data visualisation
3http://www.ogsadai.org.uk
Current Release
R4 April 2004– Provides Data Access components, an extensible framework for
building applications and some integration components– Built on top of Globus Toolkit 3.2– Supports relational, xml and some files
• MySQL, Oracle, DB2, SQL Server, Postgres, XIndice, CSV
– Supports various delivery options• SOAP, FTP, GridFTP, HTTP, files, email, inter-service
– Supports various transforms• XSLT, ZIP, GZip
– Supports message level security using X509 certificates– Client Toolkit library for application developers– GUI data browser (contributed by FirstDIG project)– Separate Distributed Query Processing components– Comprehensive documentation and tutorials
4http://www.ogsadai.org.uk
Roadmap / Workplan
Roadmap document available for comment:– http://www.ogsadai.org.uk/docs/OtherDocs/OGSA-DAIRoadmapV2.0.pdf
– User feedback required to drive this document
Integrate parts of DQP into OGSA-DAI core– Addressing platform dependencies– Want to include XML data resources
Undertaking data integration requirements capture process– Talked to 11 UK and US projects so far
Move Computation to Data– Java mobile code
5http://www.ogsadai.org.uk
Release 5
R5 October 2004 (Interfaces) – OGSI– Built on Globus Toolkit 3.2.1– Re-engineered to separate interface layer from core
OGSA-DAI functionality– Improved dependability and security integration– New file data resources representing flat files queried using
full text searches (e.g. EMBL format)– Installation and Configuration Wizard, including “all-in-one
installer”– Improved Data Browser which allows XPath querying.– Set of standard benchmarks.– JSP Quick View interface.– Support for other databases (e.g. Access, Exist, HSQL)
6http://www.ogsadai.org.uk
Release 5
Major refactoring exercise– Improve distinction between interface and core– Allow future support for different interfaces
Listened to major user concerns– Adding indexed text file access to support the
bioinformatics community– Improved robustness and usability– Addressed bugs as they were reported– Hope to add Activity Configuration, requested by projects
developing custom activities
Last release primarily targeted at the OGSI/GT3 platform
7http://www.ogsadai.org.uk
WS-I Technical Preview
A limited functionality evaluation version– An OGSA-DAI “Data Service” combining the metadata, configuration
and perform document capabilities of the OGSI-based GDSF and GDS services.
– Access to service metadata provided by a partial implementation of the WS-ResourceProperties specification.
– Example clients are provided for testing and coding reference.
Caveats/Issues:– No registry component, no support for third party delivery.– Security may be available (based on OMII WS-Security plug-in for
Axis).– Document schema and interfaces WILL change.– The WSDL is based on the OGSI-based WSDL from OGSA-DAI – Will not be supported to same level as main release for now.
Also released with OMII middleware distribution (Oct)
8http://www.ogsadai.org.uk
WS-RF Technical Preview
An evaluation version OGSA-DAI based on the Globus Toolkit 4.0 beta implementation of WSRF.
– Provides an amalgamation of the capabilities of the OGSI-based GDSF and GDS services (the metadata and configuration aspects of the GDSF and the metadata and perform document processing aspects of the GDS).
– Access to multiple data resources from a single service provided by data resource identifiers specified by a client within the WS-Addressing endpoint reference to a data service.
– Access to service metadata (database schemas, request status, etc) provided by an implementation of the WS-ResourceProperties specification.
– A WSRF version of the GridDataTransport portType supporting asynchronous data delivery between data services.
Caveats/Issues:– This preview of OGSA-DAI WSRF does not support data service security.– Document schema and interfaces WILL change.– Will not be supported to same level as main release.
Will also be released as part of the Globus Toolkit 4 beta
9http://www.ogsadai.org.uk
OGSA-DAI Project Webpage
http://www.ogsadai.org.uk
Background
News & Events
Software Releases
Documentation
Support
Training Courses
Links
10http://www.ogsadai.org.uk
FAQ, Support, Mailing List
Frequently Asked Questions– http://www.ogsadai.org.uk/support/faq.php– Updated as common problems become clear
Support for OGSA-DAI releases– http://www.ogsadai.org.uk/support– support@ogsadai.org.uk– Use to report problems
Users mailing list– users@ogsadai.org.uk– http://www.ogsadai.org.uk/support/list.php– General discussion of OGSA-DAI, data and the Grid
11http://www.ogsadai.org.uk
Second Users Group Meeting
Held alongside GGF12 in Brussels– ~40 attendees, 12+ projects represented
Users Group Chair appointed– Prof. Beth Plale, Indiana University
User presentations from– AstroGrid (UK), GeneGrid (UK), eDiamond (UK), INWA (UK),
LEAD (USA)– Also representatives from BioGrid (Japan), BRIDGES (UK),
caBIG (USA), GEDDM (UK), GEON (USA), myGrid (UK), SIMDAT (EU),…
A separate independent body to engage with users and feedback to developers in a formal way– Mailing list and website being setup
12http://www.ogsadai.org.uk
Take Home Messages
There are plenty of Research Challenges– Data integration and workflow on a global scale– Data resource and schema heterogeneity– Dynamic variability
• Authorisation, Resources, Data & Schema, Performance
– Some massive data– Metadata for discovery– Provenance tracking
Grasp the theoretical & practical challenges– Working in open & dynamic systems– Incorporate all computation– Welcome “code” visiting your data
13http://www.ogsadai.org.uk
Future DAI requires fundamental CS
What architecture best enables integration of data & computation?– Common conceptual models– Common planning & optimisation– Common enactment of workflows– Common debugging
What Fundamental CS is needed?– Trustworthy code & trustworthy evaluators– Decomposition and recomposition of applications– Metadata
Is there an evolutionary path? Are web services a distraction?
14http://www.ogsadai.org.uk
Conclusions
Still early days– Standardisation process not stabilising quickly enough– Infrastructure still developing and prone to change
OGSA-DAI acting as an enabler– Showing people what can be done– However is it cracking a nut with a sledge hammer?– Evolving and improving with each release
Usage patterns are similar– Call for people to work together to solve similar problems
Some problems are not OGSA-DAI specific– Metadata, time zones, security, …
Data discovery perceived to be important– Is this in the scope of what OGSA-DAI should be doing?
Please try it out!– It’s free and supported– Make suggestions, extend functionality, contribute to DAIS-WG
15http://www.ogsadai.org.uk
Links
OGSA-DAI Webpage– http://www.ogsadai.org.uk/
Globus Toolkit 3.2– http://www.globus.org/ogsa
Database Access and Integration Services (DAIS-WG)– https://forge.gridforum.org/projects/dais-wg
Grid Technology Repository– http://gtr.globus.org
ELDAS - Enterprise-Level Data Access Services (Eldas) – http://www.edikt.org/eldas
Web Services Choreography– http://www.w3.org/2002/ws/chor
Recommended