View
212
Download
0
Embed Size (px)
Citation preview
GLIFLinking the Globe with
LIGHT
Gigi Karmous-EdwardsPrincipal Scientist
APAN 2008, Hawaii
Aloha Kakahiaka !
1. What is GLIF?2. Why does GLIF exists?3. How Does GLIF function?4. What has GLIF accomplished?5. Virtualization6. The Many Challenges ahead7. Conclusions
Agenda
Global Lambda Integrated Facilitywww.glif.is
• GLIF is an international virtual organization that promotes the paradigm of lambda networking
• GLIF participants jointly make lambdas available as an integrated global facility for use in data-intensive research
• GLIF brings together leading networking engineers and researchers worldwide, who collaborate to identify and solve challenges for a Global facility
What is GLIF?
• GLIF is an international virtual organization managed as a cooperative activity with ‘participants’ rather than ‘members’ with a lightweight governance structure.
• Open to anybody sharing the vision of optical interconnection of different facilities, who voluntarily contributes network resources (e.g. equipment, lambdas) or actively participates in relevant activities.
• Please join the mailing list if you have an interest in being part of the solution for facilitating global lambda networks for research and education.
What is GLIF?
GLIF …. More resources are now available, next version in two weeks!
• Researchers need to do their work globally
• E-science: global, large scale scientific collaborations enabled through distributed computational and communication infrastructure
• Combines scientific instruments and sensors, distributed data archives, computing resources and visualization to solve complex scientific problems
• In physics, molecular biology, environmental, Health, Entertainment, etc.
• Future - this facility will be useful for K-20 education not just E-Scientist
Why GLIF exists? … E-science
• Korea’s HVEM • One of a kind in the world • Provide global access to unique
instruments for the purpose of advancing science for humanity • WEB service interface• High capacity optical network for
output
Developing a Global E-science Laboratory (GEL)
• Viewing the real-time video from the CCD camera• Accessing or manipulating the 2-D or 3-D images• Generating the workflow specification and requesting the
workflow to be executed• Searching the images or video files, papers, and
experiments in the databases or storages
Hyuck Han, Hyungsoo Jung, Heon Y. Yeom, Hee S. Kweon, and Jysoo Lee ”HVEM Grid: Experiences in Constructing an Electron Microscopy Grid”
• Need high Capacity - 1Gbs - 10Gbs or more• Need QoS - difficult to guarantee w/ routed
network• Cannot disrupt current users with their large
flowsSo… We need Hybrid Networking (IP + lambda
networking)
• Lightpath: high quality and high capacity optical end-to-end network connection
• Lightpaths provide applications with dedicated bandwidth with fixed characteristics at relatively low costs and with added security
Accommodating Researchers
• September 2001: first Lambda Workshop in Amsterdam followed by open Lambda Workshop organized by TERENA
• Second Lambda Workshop in 2002 in Amsterdam was attached to iGrid2002, hosted by Science Park Amsterdam
• August 2003: third Lambda Workshop in Reykjavik hosted by NORDUnet and attached to the NORDUnet 2003 Conference -GLIF name created
www.GLIF.is
The GLIF Story …
There are Four working groups:– Governance– Research and Applications– Technical – Control Plan
•Secretariat functions by TERENA•Holds Annual meeting– Next Meeting - 8th Annual Global LambdaGrid Workshop,
Seattle, USA, 1-2 October 2008
•Tech and Control working groups also hold semi-annual meetings (past weekend)
How GLIF functions?
Governance and Growth (GOV) Working GroupChair: Kees Neggers (SURFnet)Goals: To identify future goals in terms of
lambdas, connections and applications support, and to decide what cross-domain policies need to be put in place.
Research and Applications (RAP) Working GroupChair: Maxine Brown (UIC) & Larry Smarr (UCSD)Goals: To train a new generation of scientists on
the use of super-networks.
GLIF Working Groups
Technical Issues (Tech) Working GroupCo-Chairs: Erik-Jan Bos (SURFnet) & René Hatem (CANARIE)Goals: To design and implement an international LambdaGrid infrastructure, identify which equipment is being used, what connection requirements are required, and which functions and services should be provided.
Control Plane and Grid Integration Middleware Working GroupChair: Gigi Karmous-Edwards (MCNC)Goals: To agree on the interfaces and protocols that talk to each other on the control planes of the contributed Lambda resources.
GLIF Working Groups
• Documented enabling technologies (middleware, control plane software) and what applications they enable (e.g., DRAGON, UCLP, etc)
• Documented countries’ activities (feedback to NRENs)• Helped applications get started• Provides a resource for groups trying to get funding for GLIF-
related activities; GLIF “branding” adds credibility• Document applications (brief descriptions with URL pointers) (I
will create template and forward to RAP email list)• Developed a GLIF primer (how to find, educate, promote
applications)• Provided PR: What can GLIF do for you?• Provided PR: Promote domain-specific applications (eVLBI,
CineGrid, etc) (provide inspiration and motivation to potential new applications within countries)
GLIF RAP working group Accomplishments
Chairs: Erik-Jan Bos and Rene Hatem, Secretary: Kevin Meynell• Developed concept of GOLEs
• Documented in a centralized database all technical information on contributed resources
• Developed best practices and issues document for Hybrid Networking
• Developed best practices document for fault resolutions
• Hold monthly resource update calls
• Share Open source toolkits such as TL1 toolkit
• And more…
GLIF Tech working group Accomplishments
GLIF Open Lightpath Exchanges
• GLIF lambdas are interconnected through established lightpath exchange points known as GOLEs
• GOLEs are comprised of equipment capable of terminating lambdas and performing lightpath switching, allowing end-to-end connections
• GOLEs have an open connection policy
GOLES
GOLES, example of a GOLE, NetherLight
• AMPATH - Miami• CERN - Geneva• CzechLight - Prague• HKOEP - Hong Kong• KRLight - Daejoen• MAN LAN - New York• MoscowLight - Moscow• NetherLight - Amsterdam• NGIX-East - Washington
DC• NorthernLight -
Copenhagen
Current GLIF Resources • Pacific Wave (Los
Angeles)• PacificWave (Seattle)
PacificWave(Sunnyvale) • StarLight - Chicago• T-LEX - Tokyo• TaiwanLight - Taipei• UKLight - London
• AARNet, • US LHCNet
Chair: Gigi Karmous-Edwards, Sectretary: Licia Florio
• Virtualization of Networking resources as well as other key resources (compute, storage, instruments, etc) via “on-demand” and “advanced reservations”
• Agreed to adopt Network Description Language (NDL) based on RDF
• Work closely with two OGF working groups for standardization
– Grid High Performance Networking wg– Network Markup Language wg
• Shared current research experiments and open source code for controlling lightpaths
• Developed an architecture for next generation lambda resources coordinated with other key resources
• Agreed to focus on Generic Network Interface (GNI)• Comparing existing APIs similar to GNI• Will have an initial GNI specification by October meeting
GLIF Control Plane and Grid Middleware Integration wg
Keep it Simple and
Smart!----------------
Akamai
KISS
GLIF Grid Resource Registry
RB-A
NRM-A
Network-A
RB-B
NRM-B
Network-B
GNI
GAI
CRM-AIRM-B
ResourceRegistry
GC
I
Grid Administrative Domain - A
RB: Resource BrokerDNRM: Domain Network Resource ManagerCRM: Compute Resource ManagerIRM: Instrument Resource ManagerSRM: Storage Resource Manager
User
GN
I
GAI: Grid Application InterfaceGNI: Grid Network InterfaceGCI: Grid Compute InterfaceGSI: Grid Storage InterfaceGII: Grid Instrument Interface
SRM-A
CRM-B
GCIResourceRegistry
Grid Administrative Domain - B
GC
I
GN
I
Publish Resource InformationPublish/Subscribe Broker + Resource Information / References
GII
GNI
RB
Publish/SubscribeGAI
GSI, GII, GxI, etc
Publish/Subscribe
Multi-domainPath Computation
RMsHARC RMs
- Fault Mgmt - Performance
Resource Meta-scheduler
ResourceRegistry
Security/AAA
Policy Engine
RequestProcessor
MonitoringDiscovery
Static Information(Policy, etc)
GLIF Grid Resource Registry
Resource Co-allocation
HARC Acceptors
GNI GNI
NRM
Publish Information to ERB
Topology/Discovery
MonitoringDiscovery
GNI
Static Information(Policy, etc)
Path Computation
Reservationtimetable
ResourceAllocation
Publish/Subscribe
NetworkManagement:- Fault Mgmt- Performance
e.g. TL1, SNMP, XML,MDS, etc.
Domain A
Security/AAA
Policy Engine
RequestProcessor
ResourceRepository
e.g. TL1, SNMP
• MCNC experimenting with new Virtual Compute Services for NC’s K-20 community
• Reservation and Provisioning system– Allocates nodes to users on a reservation basis– Can be now (on-demand) or future (schedule in
advance)– Can allocate both single nodes and clusters of
nodes– Reservation lengths are policy driven
• selection of 1-4 hours• Or open end time allow a month or more
NCSU’s Virtual Computing Lab (VCL) vcl.ncsu.edu
• Will host for NCSU 1000 nodes at MCNC this year• Pilots are under way with K-20 type users• IBM BladeCenter Blade Servers
• Housed in a datacenter - IBM’s Energy efficiency doors
• Standalone workstations• Housed anywhere; we include our lab machines
when the labs are closed
• Working on Sun Blade servers– VCI partners are working Dell and HP blades– Can easily be moved between HPC cluster and VCL
system
– We move nodes to HPC during student breaks
Virtual Computing Lab
Hardware Blades, servers, desktops, storage
OS:
Apps
Win Linux Other …
VirtualLayer
OS: Win Linux …
Apps
e.g.,
WebSphere
e.g., Web
Sphere
…RDP,VNC,
…
e.g., VMWare,
XEN, MSVS2500,..
X-WinClient
Apps.WorkFlow
Services
End-UserAccess
VisServices Other …
Middlewaree.g. LSF
VCLManager
“Applic
ati
on”
Imag
e S
tack
xCAT VCL code IBM TM
WebServer DataBase Etc.
Users“Images”
H/W ResourcesUndifferentiated Local or distributed
Differentiator: User to Image to Resource Mapping, Management & Provenance
Simplicity, Flexibility, ReliabilityScalability, Economy
Image
• About 1000 blades (cca 140 used for VCL individual seats, the rest for VCL HPC cycles), plus several hundred idle student laboratory machines.
• Environment base-lines are typically Windows and Linux with a variety of applications. Depending on how demanding an application is, service may be virtualized (VMWare) or bare-metal.
• About 70,000 single-seat image reservations per semester. Fall 2007, peaked at about 2,500 reservations per day.
• Serving population of 30,000 students (in a semester there may be about 6,000 unique users).
• Most of the “individual seat” requests are on-demand “Now” reservations: cca 90% of requests
• System availability: about 99%
Some Stats
Issues and Challenges
• Key Challenges with Hybrid networking - effect on IP while having dynamic lambdas
• Coordination of network resources and other Grid resources
• Two phase commit for all involved resources - KISS• Topology Abstractions - including end points - or
services• Monitoring - MonALISA, PerfSONAR….• Advertising resources globally - agree on what and
how to represent resources… NDL etc.• Policy• Different implementations of each component (no
need to standardize on how things are done - just interfaces)
• Agree on Functional components • Focus on a couple of KEY interfaces (low set of
options - use lowest common denominator) Prioritize - GNI …
Conclusions
Conclusions• A Global Integrated Facility is necessary for the support of
both Scientific Research, Education, and networking research. Everyday there are more requests for use and more resources contributed.
• GLIF currently behaves as a Global collaborative testbed
• Our goal is to provide Global virtualization of shared resources , including network lambdas, compute, storage, instruments, etc.
• Next Generation Networks will be a hybrid of of routed and lambda switched networks. (not just for high-end research)
• The Research networks (NRENs and Gov sponsored testbeds) are taking these bold steps on GLIF, testbed infrastructures… apply lessons learned to production quickly.
• International Collaboration is a very Key ingredient for the future of Scientific discovery and education - The Optical network plays the most critical role in achieving this!