24
OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week , Milan, Italy 26-29 June 2006

OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

  • View
    214

  • Download
    1

Embed Size (px)

Citation preview

Page 1: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

OpenDSP - Condor to DRMAA layer

(DRMAA to Condor ;-)

Krzysztof Kurowski

Poznan Supercomputing and Networking CenterPoland

European Condor Week , Milan, Italy 26-29 June 2006

Page 2: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

Introduction to PSNC• PSNC was officially established as a non-profit

institution affiliated to the Institute of Bioorganic Chemistry Polish Academy of Science in 1993

• A few years later, a critical mass has been achieved among PSNC and MANs in Poland… and today:

• PSNC is an SUN Excellence Center in Portals, New Generation Networks and Grid Technologies and well known worldwide research lab

• PSNC has been actively involved in 20 EU-funded IST projects

• PSNC is a technology and knowledge transfer center to academia as well as to business (spin-off, start-up, …)

• PSNC is an operator of PIONIER: Polish Optical Internet

European Condor Week , Milan, Italy 26-29 June 2006

Page 3: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

PIONIER Strategic Research Objectives• Construction of the broadband optical Internet, • Development and verification of pilot services and applications for

the Information Society, creating a base for new developments in: eScience, eLearning, eHealth, government and local administration as well as industry and business services,

• Creation of competitive conditions for active software development for new applications and services in the Information Society.

European Condor Week , Milan, Italy 26-29 June 2006

Page 4: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

PIONIER: the current status

You are here

European Condor Week , Milan, Italy 26-29 June 2006

Page 5: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

Grid R&D at PSNC

European Condor Week , Milan, Italy 26-29 June 2006

Page 6: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

iGrid 2005 GridLab demo

Gridge middleware

Computing Nodes

Computing Node

Computing Nodes

Computing Nodes

ExtractingEncoding

ExtractingEncoding

Merging

Displaying

Data flowData flowData flow

European Condor Week , Milan, Italy 26-29 June 2006

Page 7: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

New Generation Mutimedia Services and Portals

• Virtual Laboratory is a distributed workgroup environment, with the main task of providing a remote access to the various kind of rare and expensive scientific laboratory equipment and distributed computational and data resources.

• iTV platform (TV/multimedia on demand) enables secure and scalable digital content provisioning, which paves a new path for large scale interactive services deployment used by various eScience communities.

• eHealth Telemedicine project provides a secure web collaboration platform for surgery and on-line consultation in the hospital treatment process together with mobile services for remote and easy-to-use access to medical databases from mobile devices and PDAs.

• EU eHealth ACGT project brings together internationally recognized leaders, including PSNC, in their respective fields, with the aim to deliver to the cancer research community an integrated Clinico-Genomic ICT environment enabled by a powerful Grid infrastructure.

European Condor Week , Milan, Italy 26-29 June 2006

Page 8: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

New Generation Mutimedia Services and Portals

European Condor Week , Milan, Italy 26-29 June 2006

Page 9: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

• Existing approaches (e.g. Globus) in a relatively simple, script-based and often insecure way try to integrate capabilities provided by Distributed Resource Management (DRM) systems with both end users and administrators applications.

• DRMAA is a GGF API specification for "the submission and control of jobs to one or more Distributed Resource Management (DRM) systems".

• Providing a single interface to DRM resources, DRMAA substantially eliminates a problem currently facing commercial software developers - proprietary interfaces which make it difficult to integrate applications with DRM systems.

• DRMAA substantially accelerates the adoption of direct interfaces to DRM systems, where adoption of proprietary interfaces has traditionally been low.

• DRMAA provides a standard on which people can create new applications using a "write-once" interface and makes grid deployment and management easier for systems administrators.

Ideas behind DRMAA

European Condor Week , Milan, Italy 26-29 June 2006

Page 10: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

DRMAA in DRMs

European Condor Week , Milan, Italy 26-29 June 2006

Page 11: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

• Currently, for each end user a physical local account must be set up in advance to allow him access to operations provided by DRMAA.

• End user has to login first to a submission or execution DRMs host before any routine from DRMAA is invoked.

• It is possible to use DRMAA APIs only locally.• Communication processes between end users' applications

using DRMAA are not encrypted and usually separated from the Internet by firewalls and appropriate Intranet configuration.

• Administrators have limited or no control over DRMAA routines used by end users.

What is possible now?

European Condor Week , Milan, Italy 26-29 June 2006

Page 12: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

Customers and Enterprises needs…

Currently, customers main concerns are security and interoperability. This is the reason why I asked about DRMAA and Web Services. I have no doubt that your openDSP will address a lot of our interoperability issues.This is great stuff!

Melvin Koh Chee Kian Asia Pacific Science & Technology Center Sun Microsystems Inc.

European Condor Week , Milan, Italy 26-29 June 2006

Page 13: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

• Reference implementations of the DRMAA specification available today

(e.g in Sun Grid Engine v6 and Condor) allow much better integration of applications with DRMs, but …

• In addition to DRMAA APIs, consistent, remote and multi-user access, and policy-based job control over various DRMs are needed.

• Based on our experiences and best practice efforts, we have taken seriously security (Authentication Authorization Accounting), performance, interoperability, easy of use, and flexibility requirements in order to design a set of modules based on DRMAA and its current implementations to provide multi-user access to operations on jobs in DRMs, (in particular in SGE and Condor).

• Support for standards accepted and widely used by customers (Web Services, SAML, etc.) in order to allow interoperability between business applications and DRMs.

• Thus, one of the main motivations for designing and implementing DRMAA Service Provider (DSP) is a lack of such solutions today (existing ones do not meet mentioned requirements).

What is still missing?

European Condor Week , Milan, Italy 26-29 June 2006

Page 14: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

OpenDSP architecture

Open DRMAA Service Provider (openDSP) and its modules

Existing DRMAA routines provided

by DRMs

European Condor Week , Milan, Italy 26-29 June 2006

Page 15: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

- Built-in support of DRMAA job submission, control, monitoring and error handling over DRMs - Authentication and encryption mechanisms based on SSL or GSI - Successfully tested with external services based on Globus Toolkit

2.4.x, 3.2.x and 4.0 - Authorization callouts and external access control list - External accounting and auditing database available - Stage-in/out operations supported over DIME/MIME attachments - JSDL 1.0 POSIX based job description - Successfully tested with the following resource/cluster management systems (where is Condor?): SGE (SGEv6u4, SGE v6u5, SGEv6u7 recommended) over the following OS systems:     - Linux x86, kernel 2.4, glibc >= 2.2     - Linux AMD64 (Opteron), kernel 2.4, glibc >= 2.3     - Solaris (Sparc) 9  64-bit     - Solaris (x86) 9     - Apple Mac OS/X - Java, .NET and C-based example clients and SDKs available - Distributed under the GPL license

Release notes (OpenDSP 0.1)

European Condor Week , Milan, Italy 26-29 June 2006

Page 16: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

On Jun 21, 2006, at 3:39 PM, Peter Troeger wrote:>> Hi Becky,>>>> I received a big improvement patch for the Condor DRMAA library by a >> research project in Poland. The people are interested when they could >> see the changes in the official Condor release. Is there a timeframe >> for a new DRMAA library version ?>>>> Best regards,>> Peter.

That's great! I'll add the patch to the next Condor release. Is thepatch currently in SourceForge, or should I obtain it from anotherlocation? I'll pull the latest files when I hear from you as to wherethey are.

Thanks,

Becky

Collaboration with Condor

European Condor Week , Milan, Italy 26-29 June 2006

Page 17: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

>> The Condor people will provide your patched

>> version with Condor 6.7.21. Since 6.7.20

>> is out since yesterday, I expect the next

>> version in 4-6 weeks at the earliest. Would that

>> be enough for you ?

Once a new Condor version is released please visit:

Collaboration with Condor

http://sourceforge.net/projects/opendsp

European Condor Week , Milan, Italy 26-29 June 2006

Page 18: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

What’s next?FedStage Computing Provider

• FedStage Computing Provider is a generic job execution, monitoring and management HPC service provider accessible through a well-defined Web Service interface based on DRMAA specification and its reference implementations supported by local cluster/resource management systems, such as Sun

Grid Engine, Condor or PBS/Torque.• Remote Interfaces: Web Service over SSL, GSI and SSL/SAML2.0 with JSDL 1.0 support• AAA Module: Tested and integrated with internal and external AAA modules (FedStage Identity

Manager and FedStage Authorization Manager)• Distributed Resource Manager: Sun Grid Engine (SGEv6u4, SGE v6u5, SGEv6u7, SGEv6u8),

Condor (6.7.21 and higher), PBS/Torque(2.0 and higher)• Computing Node: Solaris (Sparc), Solaris (x86), FreeBSD, Linux x86, Linux AMD64, 9/10 Mac X OS,

Windows 2000/XP, • Available: commercial license and open source under GPL (OpenDSP project)

European Condor Week , Milan, Italy 26-29 June 2006

Page 19: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

FedStage: a new wave of HPC solutions? • Done from scratch based on our experiences and lessons learned• Interoperability, Peformance, Security are the key issues and therefore FedStage AAA-based solutions are of primary importance for the cross-organization sharing of applications, data, and computational services in a collaborative business environment today. • FedStage products follow Liberty Alliance and WS-I

• W3C standards:  XML, SOAP• OASIS standards: WS-Security, SAML2.0 WS-Notification,

WS-Reliability, • GGF standards(recommendations): DRMAA, JSDL• IETF standards: SSL/TLS

European Condor Week , Milan, Italy 26-29 June 2006

Page 20: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

• The convergence of Web Services, Grid technologies and SOA/ESAs that currently occurs among enterprise, media, mobile and Internet domains requires a standard and secure approach for virtualization of IT resources and technology independent interoperability across platforms, services and systems

• FedStage products are loosely coupled services (SOA) which may be individually useful, or they can be composed to be used in specific business processes.

• FedStage products may be completely self-contained, or they may depend on the availability of other enterprise services.

• FedStage products capabilities as well as communication and data channels among them and consumers are protected by transport and message level security mechanisms.

• To deliver the best-quality software to the market we combine innovations with best practices, and thus follow:

– industry specifications, – security standards and recommendations, – pure Web Services requirements, – WS-* extensions.

European Condor Week , Milan, Italy 26-29 June 2006

FedStage: a new wave of HPC solutions?

Page 21: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

European Condor Week , Milan, Italy 26-29 June 2006

Enterprise Access Management And Control Products:FedStage Identity ManagerFedStage Authorization ManagerEnterprise Infrastructure Service Providers:FedStage Computing ProviderFedStage Storage ProviderFedStage Notification Provider Official release: 3Q 2006

Page 22: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

How old is this picture?

European Condor Week , Milan, Italy 26-29 June 2006

Page 23: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

Collaboration with Condor• Peter Troeger <[email protected]>

• Dan Bradley <[email protected]>

• Becky Gietzel <[email protected]>

• Alan De Smet <[email protected]>

• R. Kent Wenger <[email protected]>

European Condor Week , Milan, Italy 26-29 June 2006

Page 24: OpenDSP - Condor to DRMAA layer (DRMAA to Condor ;-) Krzysztof Kurowski Poznan Supercomputing and Networking Center Poland European Condor Week, Milan,

Thank [email protected]

European Condor Week , Milan, Italy 26-29 June 2006