Upload
miranda-perkins
View
214
Download
1
Embed Size (px)
Citation preview
Scientific Advisory Committee – September 2011
COLA Information Systems
COLA’s Information Systems
2011
Scientific Advisory Committee – September 2011
COLA Information Systems
Computing at COLAMaximizing Productivity and Collaboration
• IT culture of service to scientists• Focus is on in-house data analysis– Large spinning disk volumes, ~400TB – Shared access to data– Servers for data analysis
• GrADS– Primary tool for data analysis and visualization
Scientific Advisory Committee – September 2011
COLA Information Systems
atlas11 node @ 24 cores
256 GB
/homes8 TB
/shared100 TB
cpuX6 nodes @ 8 cores
24-32 GB
colaXX2 nodes @ 8 cores
32 GB
COLA Data Center
Backups
Infiniband (DDR)
Internet 21Gb/s
Internet 120Mb/s
Ethernet 1Gb/s
Quantum i6000 libraryNightly backups~50 TB tape archive
/project33 TB
/data250 TB
Scientific Advisory Committee – September 2011
COLA Information Systems
Offsite Resources• NCAR (CISL)
– Bluefire, Mirage– 20TB project space– 448TB tape archive
• Oak Ridge, NICS (XSEDE)– Kraken, Nautilus– Scratch only disk space– 1.2PB tape archive
• NASA Ames (NAS)– Pleiades– 1TB project disk space– 325TB Tape archive
Total: ~10 millionCPU hours
Scientific Advisory Committee – September 2011
COLA Information Systems
Offsite Computing Trade-offs• Resources are shared, not dedicated
(multi-project/multi-purpose systems)• Relatively small amounts of online disk space• Disk and tape throughput issues– Islands of disk
• Lack of cross-site data analysis capabilities
Scientific Advisory Committee – September 2011
COLA Information Systems
Onsite Computing Benefits• Complete autonomy and flexibility– Scientists have easy access to IS staff– IS staff can quickly respond to science needs
• Fast, cost-effective, large spinning disk volume• Subsets from all remote projects stored and
analyzed locally• Shared data and analysis methods support broad
COLA-wide collaborations
Scientific Advisory Committee – September 2011
COLA Information Systems
Data Management
• New, more scalable design – builds on collaboration with NCAR-CISL
• Isolate and curate frequently used static data (shared)– Active oversight by Data Management Committee– Scientists make data management decisions
• Organize remaining data files and scripts (project)• Tape archive and retrieval• Catalog shared data
/project33 TB
/shared100 TB
Scientific Advisory Committee – September 2011
COLA Information Systems
Future of COLA Computing
• We are anticipating challenges from thenext generation of data sets:– From community: CFSRR, CMIP5, etc.– COLA-generated: ISI, Decadal, …
• Athena project foreshadowed many obstacles• 3-year plan now in place to meet these challenges – expecting to learn valuable lessons– GrADS is an essential component of strategy