Winds of Change: a taxonomy of clouds - oclc.org · Dewey Decimal Classification 23rd Edition...

Preview:

Citation preview

Winds of Change:

a taxonomy of clouds

for libraries

Jay JordanPresident and CEO

OCLC

Matt GoldnerProduct & Technology Advocate

OCLC

IFLA

Puerto Rico

16 August 2011

IFLA President 2009–2011

Ellen R. Tise

Senior Director

Library and Information Services

Stellenbosch University

Stellenbosch, South Africa

Jay Jordan IFLA/OCLC FELLOWS 2011

Cloud Computing and Libraries

Almost all of us use cloud computing

Shifting to the cloud

• Businesses are moving to the cloud

• Success stories

• Some questions still

• A great deal of debate about what is cloud computing

• Consider libraries and the cloud

• How they can benefit from it

• What to consider before moving

What is cloud computing

A style of computing in which scalable and elastic IT-

enabled capabilities are delivered as a service to external

customers using Internet technologies. – Gartner Group

Infrastructure Platform Applications Services

KPMG

How is cloud computing different

• PC world

• Monolithic proprietary operating systems (OS) and

programs

• Long development cycles

• Software design isolated to single application

• Cloud world

• Hardware and functionality on the network

• Cloud becomes development platform and OS

• Reusable constantly updated software components

• Can be embedded or loosely coupled

Not new for libraries

• Large union catalogs

• Online databases

• But a look outside libraries is warranted

Why businesses adopt cloud computing

70% 30%INFRASTRUCTURE INITIATIVE

Amazon.com: http://www.slideshare.net/goodfriday/amazon-web-services-building-a-webscale-computing-architecture

Before cloud computing…After cloud computing

Minnesota Online High School (MNOHS)

• Distributed software

• Many different computers

• Send CDs or Guide students through downloads

• Cloud based software

• Student work not on local PC

• All applications and data in the cloud

• Laptop lost, student work isn’t

• Result

• Shifted energy from technology to education

What can cloud computing do for libraries?

• Amplify the power of cooperation

• Build significant unified Web presence

• Save time and money on technology

• Create efficient workflows

Areas to improve

• Most library systems are pre-Web technology

• Costly and difficult to integrate

• Store and maintain same data thousands of times

• Scattered data weakens Web presence

• Collaboration is difficult and expensive

• Information seekers are in common Web

environments

• Distributed computing power is under utilized

Group to three areas of improvement

• Technology

• Data

• Community

Technology – Why it matters

• Cloud computing solutions

• Built on current technology

• Architected to allow technology shifts

• Library systems

• Developed pre-Internet / Web

• Proprietary and closed

• Costly to integrate to new technologies

• Hard to integrate to external systems

Technology – What can change

• Service Oriented Architecture (SOA)

• Published Application Program Interfaces (API)

• Any programmer can work integrate/extend software

• Libraries not dependent on vendor for everything

Andrew Pace – ― … demands fall short by merely

asking that local systems avail themselves of other

Web services rather than establishing themselves as

services in their own right.‖

Technology – What can change

• Shift focus from technology to:

• Collection building

• Patron services

• Innovation

• No longer maintain servers and software stack

• Use technical skills to extending services

Data efficiencies

• Creation, storage, maintenance and backup

• Easily shared

• Common data opened to everyone

• Some Private data shared by agreement

• Libraries can achieve Web scale

• Massively aggregate data +

• Useful services built on the data, can attract

• Massively aggregated users

• Data now considered more useful to search engines

Community power

• Create an online information network

• Internal collaboration in a single library system

• Internal collaboration between library systems

• External community of information seekers

• The ―Network‖ effect

• Creates scale of savings and efficiencies

• Wider recognition for libraries

• Cooperative intelligence for decision making

• Platform to innovate on

Libraries and discovery services in the cloud

• First aggregate the data

• Extend discovery beyond traditional library

collections

• Explosion of mass digitization and digital

aggregation

• Hathi Trust

• OAIster

• Europeana

Libraries and discovery services in the cloud

• Aggregated user opinion and use = Recommender

services

Beyond library discovery services in the cloud

Marshall Breeding – ―We can’t let the current focus

on front-end interfaces make us complacent

about the software systems that we use to

automate routine library functions‖

• Gain internal and community efficiencies

• Cataloging librarians

• Acquisitions librarians

• Serials librarians

• Electronic resource librarians

• Change in collections = blur in job roles

Beyond library discovery services in the cloud:

a Cooperative Platform

• Openness

• Extensibility

• Data richness

• Collaboration

Caveat emptor (or Buyer beware)

• First questions to ask:

• Will it make my library more efficient?

• Will it help my library offer better service?

Security and Privacy

• Two aspects:

• Technical

• Legal

• Not exclusive to cloud solutions

Scalable and reliable

• Why multi-tenancy architecture matters

• Redundancy

• Data

• Services

Data ownership and rights

• Who owns your data?

• What are your access rights to your data?

• What are provisions for business failure?

Architecture

• Open

• Service oriented

• Mashable and extensible

Conclusion

• Opportunity to improve services and relevance

• Cloud computing is One avenue

• Libraries can:

• Take advantage of rapidly emerging technologies

• Increase visibility and accessibility of collections

• Reduce duplication of effort

• Streamline workflows

• Create cooperative intelligence and improved service

• Make libraries greener

Conclusion

• Allow libraries and their users to participate in an

information network

• Create a powerful, unified presence on the Web

• Give our users a local, regional and global reach

Public purposes:

Further access to the world’s information

Reduce the rate of rise of library costs

72,035 libraries in 170 countries

The OCLC cooperative:

a nonprofit, membership organization

OCLC: 20 offices in 10 countries

Australia:

Footscray, Victoria

Perth, Western Australia

Canada:

Calgary, Alberta

Brossard, Quebec

Winnipeg, Manitoba

China:

Beijing

France:

Asnières sur Seine

Germany:

Berlin

Bonn

Mannheim

Oberhaching

United States:

Dublin, Ohio

Overland Park, Kansas

San Mateo, California

Seattle, Washington

Mexico:

Mexico City

Netherlands:

Leiden

Switzerland:

Basel

United Kingdom:

Birmingham

Sheffield

New OCLC governance structure

25,900+institutions

Members

3councils

Regional Councils

48members

Global Council

16trustees

Board of Trustees

OCLC’s Member leadership

Global Council

Asia Pacific Regional Council EMEA Regional Council Americas Regional Council

OCLC Global Council 2010-2011

Global Council: Leadership transition

President

2010-2011

2011-2012

Jennifer Younger

Vice-President/President-elect

Berndt Dugall ChewLeng Beh

Berndt Dugall

OCLC Trustees

OCLC strategic directions

Develop new

Web-scale services

with libraries

and

Maintain and enhance existing services

Enhancements to services in 2010/2011

Connexion client 2.30

Connexion browser

Direct Request

for Articles

Resource Sharing:

deflection by custom holdings group,

new reciprocity reportsILLiad 8.1

QuestionPoint 1.78

CONTENTdm 6

Digital Collection Gateway 2.0

Local Holdings Record Summary service

for WorldCat Local migration

WebDewey 2.0

WorldCat synchronization with HEBIS

CBS 6.0

Amlib 5.3

OLIB 8.1

SunRise V4.0

LBS4

TouchPoint 1.6

Fiscal 2011: Enhancements to library management

systems

BOND

Dewey Decimal Classification 23rd Edition

200,000 libraries in 138

countries

New translations

• French, German, Hebrew,

Italian, Spanish, Swedish

• Arabic under development

OCLC systems

Servers:

• 750 Intel / AMD

• Highly virtualized

• 1 IBM Mainframe

Storage:

• 600 Terabytes

• 10,000 slot Tape Library

Data Centers:

• Dublin, Westerville, Ohio (today)

• EMEA, Australia—2011

• Canada, EMEA 2—2012

Operations Center, Kilgour Building, Dublin, Ohio, USA

OCLC system performance

22 million transactions a day!

Transactions Per Second (TPS):

peak: 400 transactions per second

average: 225 transactions per second

Average Response time:

<.5 seconds

Maximum concurrent sessions:

15,000

Availability snapshot: Fiscal 2011

Service Monthly Quarterly Fiscal year to date

All Services 100.00% 99.97% 99.88%

Connexion 100.00% 100.00% 99.94%

Resource Sharing 100.00% 100.00% 99.93%

WMS 100.00% 99.52% 99.56%

WorldCat Local 100.00% 99.97% 99.83%

WorldCat today

239.2 millionrecords

1.75+ billionholdings

16 August 2011

WorldCat growth since 1998

Millions of records

39 41 44 47 50 52 5561

67

86

108

139

197

234

0

50

100

150

200

250

1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011

2010

57.5%

Percentage of records

for non-English materialsTotal Records

English

German

French

Spanish

Chinese

Italian

Dutch

Japanese

Latin

Russian

June 30,2010

197 m

83.7m

25.2m

18.1m

8.2 m

5.1 m

3.5 m

3.3 m

3.0 m

3.0 m

3.0 m

Multilingual WorldCat

26 August 1971

WorldCat: Happy 40th!

OCLC: Building Web scale with libraries

Represent the full range of

member collections and services

where the library user is

Advance the future of libraries through

research, advocacy and community making

Create

system-wide

efficiencies in

library management

Build global

infrastructure which

is responsive to local

conditions

OCLC Research: Agenda

1. Research information management

2. Mobilizing unique materials

3. Metadata support & management

4. Infrastructure & standards support

5. System-wide organization

6. User behavior studies & synthesis

ResearchExploration, innovation and community for libraries, archives and museums

Cloud-sourcing collection management

“We are looking at how research

libraries can operate more

efficiently by cloud-sourcing

legacy print collections.”

NYU

Libraries

ReCAP

HathiTrust

Virtual International Authority File

Principals•Library of Congress/NACO

•Deutsche Nationalbibliothek

•Bibliothèque nationale de France

•OCLC

ContributorsNational Library of Australia

National Library of the Czech Republic

Bibliotheca Alexandrina (Egypt)

Getty Research Institute

National Library of Israel

Istituto Centrale per il Catalogo Unico (Italy)

Biblioteca Nacional de Portugal

Biblioteca Nacional de España

National Library of Sweden

Swiss National Library

Vatican Library

NUKAT Center (Poland)

Library and Archives Canada

National Széchényi Library (Hungary)

RERO (Switzerland)

Russian State Library [Currently in test]

Applied research, community building and prototyping of services

in support of research and learning

OCLC Research Library Partnership

OCLC’s pro bono activities

CONSER (host database)

Virtual International Authority File

U.S. Newspaper Program (host database)

OCLC/ALISE Library & Information

Science Research Grant Program

Jay Jordan IFLA/OCLC Early Career

Development Fellowship Program

4th China-U.S. Library Conference

PREMIS Working Group

FBI stolen rare book project

Bosnian National Library

Virtual Collection

OCLC Research

FRBR algorithm

OAI-Harvester

OCLC/LC Fiction Project

Open source software

Membership/research reports

Advocacy programs

Symposia

Videoconferences

WorldCat Local: More than books!

193 million books and…

• 526+ million articles

• 11+ million e-books

• 1,424

databases/collections

Direct links to full-text

articles and open-access

objects

WorldCat Local: Mobile access included

WorldCat Local:

University of California, BUCLE, HathiTrust

2.1 billion items and growing!

239 million bib records

1.75+ billion holdings

526.6 million records 30 million items(Google, HathiTrust, OAIster)

Physical holdings

in WorldCat

Licensed digital

content/articles in

library collections

Local library content

being digitized

Representing the collective collection

Developer Network

170 registered members

Most popular

WorldCat Search API

xISBN, xISSN

WorldCat Registry

WorldCat Identities

QuestionPoint Knowledge Base

xOCLCNUM

70 applications built

20 million calls/month

WorldCat Identities

EasyBib Library Edition:

Bring your library to your students

WorldCat knowledge base:

shortest path to full text

End users get full text the library subscribes to

WorldCat

knowledge base

Collection data

Title data

Linking logic

Holdings

Library

Subscription

dataFull text

End users

EZproxy: Hosted service

EZproxyDatabase

and content

Remote

content

providerEZproxy verifies the

user’s credentials

Your library’s

authentication

method

Web

server

Your library

Change in the library: the impact of

OCLC Web-scale Management Services

UsersPrint

Vendors

LibraryOPAC

ILS

Circulation

Cataloging

Self

Service

Acquisitions

Cataloging

Utility

National/

Global

System

Consortial

System

Electronic

Vendor

A to Z

List

Resolver

ERM

Institutional

Repository

Meta-

search

Data

Library

Users Suppliers

Partners

WMS early adopters:

Norway, The Netherlands, Canada

What’s ahead?

What’s ahead:

OCLC License Manager

A step toward Web-scale management of:

•subscriptions

•licenses

•rights

•local and remote access to licensed

and e-resources

What’s ahead:

2012

New functionality for management systems:

•CBS

•Amlib

•BIBLIOTHECA

•LBS

•OLIB

•SunRise

What’s ahead:

Collection Builder

Unified and consistent

collection-level management

across print, licensed and

digital materials

What’s ahead:

Improve quality control

Global Library and Museum

Identifier (GLIMIR)

What’s ahead:

New platform

• Open

• Extensible

• Extended

Fred Kilgour’s Strategic Plan—1967

• Online union catalog

and shared cataloging

• Serials control

• Technical processing

(acquisitions)

• Interlibrary loan

• Retrieval by subject

(reference)

• Remote catalog access

and circulation control

OCLC data and formats

WorldCat

WorldCat Identities

Terminologies

Dewey Decimal Classification

Knowledge Base

WorldCat Registry

MARC

Dublin Core

XML

Current authority control copies in

well-formatted string of characters to the

destination record/entry.

Value proposition:

Well-formatted data

Rupert Murdoch

Photo: ronkayela.com

News Corporation

Geoff Pryor

Fox News

James Murdoch

Names

Outfoxed Rupert

Murdoch’s war on

journalism

Business the

Rupert Murdoch

Way…

The man who

owns the news…

Works

Australia Biography

Broadcasting

Fox News

Mass media

Newspaper Publishing

Concepts

(subset of links in WorldCat Identities)

With consistent & coherent data

infrastructure, headings have

explicit navigational links that can

be exposed and traversed.

Rupert Murdoch

Copy string of characters and create

navigation links to ―web of data‖.

Value Proposition:

Well-formatted data AND

Automatic data enrichment and navigation

News Corporation

Geoff Pryor

Fox News

James Murdoch

Outfoxed Rupert

Murdoch’s war on

journalism

Business the

Rupert Murdoch

Way…

The man who

owns the news…

Australia Biography

Broadcasting

Fox News

Mass media

Newspaper Publishing

Names

Works

Concepts

Rupert Murdoch

Thank You!