41
1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

Embed Size (px)

Citation preview

Page 1: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

1

Building the NSDL

William Y. ArmsCornell University

Thinking aloud about the NSDL

Page 2: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

2

Acknowledgement and Disclaimer

The NSDL is a program of the National Science Foundation's Directorate for Education and Human Resources, Division of Undergraduate Education.

The ideas discussed in this talk do not represent the official views of the NSF (or of anybody except the author).

Page 3: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

3

What's in a name?

Page 4: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

4

SMETE

Science, Mathematics, Engineering and Technology Education

The NSDL

National Digital

Library

Page 5: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

5

Science?

The NSDL

National Digital

Library

Can we build a comprehensive digital library for science education, without building a National Science Digital Library?

Page 6: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

6

The National Science Digital Library

Page 7: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

7

The National Science Digital Library

It's BIG!

Page 8: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

8

To be comprehensive—all branches of science, all levels of education, very broadly defined:

Five year targets

1,000,000 different users

10,000,000 digital objects

100,000 independent sites

How big might the NSDL be?

Page 9: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

9

Scientific and technical information in digital form

Materials used in education

Digital collections for science

Materials tailored toeducation

Page 10: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

10

Page 11: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

11

Page 12: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

12

Page 13: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

13

Page 14: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

14

Page 15: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

15

Opportunities for the NSDL

• Categories of material that have been given lower priority by libraries and publishers, e.g., datasets, software, and other dynamic content, ...

• Materials that are accessible for automatic processing, e.g., scientific web sites and databases, image collections, ...

• Materials designed for education, e.g.,learning objects, curricula, problem sets, ...

Less opportunity for the NSDL

• Conventional scientific literature with restricted access

Page 16: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

16

Page 17: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

17

The NSF's strategy

Page 18: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

18

The NSF cannot fund all collections

Page 19: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

19

The NSF is funding selected collections ...

Page 20: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

20

The Core Integration task is to provide a coherent set of services for users

across great diversity.

... and a Core Integration team

Page 21: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

21

Resources

Core Integration

Budget $4 million

Staff 25 - 30

Management Diffuse How can a small team, without direct management control, create a very large-scale digital library?

Page 22: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

22

A spectrum of interoperability

Page 23: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

23

Approaches to interoperability

The conventional approach

Wise people develop standards: protocols, formats, etc.

Everybody implements the standards.

This creates an integrated, distributed system.

Unfortunately ...

Standards are expensive to adopt.

Concepts are continually changing.

Systems are continually changing.

Page 24: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

24

Interoperability is about agreements

Technical agreements cover formats, protocols, security systems so that messages can be exchanged, etc.  Content agreements cover the data and metadata, and include semantic agreements on the interpretation of the messages.  Organizational agreements cover the ground rules for access, for changing collections and services, payment, authentication, etc.

The challenge is to create incentives for independent digital libraries to adopt agreements

Page 25: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

25

Function versus cost of acceptance

Function

Cost of acceptance

Many adopters

Few adopters

Page 26: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

26

Example: Textual mark-up

Function

Cost of acceptance

SGML

ASCII

HTML

XML

Page 27: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

27

Federations

Collections follow strict standards for content, metadata, protocols, authentication, etc.

Harvested Collections

Each collection makes metadata about its collections available in a simple exchange format (Open Archives metadata harvesting protocol).

Gathered Collections

Material is gathered automatically by selective web crawling.

Levels of interoperability

Page 28: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

28

Levels of interoperability

Level Agreements Example

Federation Strict use of standards AACR, MARC(syntax, semantic, Z 39.50and business)

Harvesting Digital libraries expose Open Archivesmetadata; simple

protocol and registry

Gathering Digital libraries do not Web crawlerscooperate; services must and search enginesseek out information

Page 29: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

29

Metadata is expensive

The NSDL cannot afford to create it manually

Page 30: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

30

User portals

Distributed collections

Metadata repository

Page 31: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

31

Every collection is different

Page 32: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

32

From an NSF-funded collection: “We are pleased with the technical side…of the database and web access…but we are complete novices in terms of how to make our collection part of the digital library. I assume this hinges on appropriate metadata, but I am not sure exactly what kinds…”

Page 33: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

33

Metadata strategy

• Support eight standard formats

• Collect all existing metadata in these formats

• Provide crosswalks to Dublin Core

• Expose records in the metadata repository for others to harvest

• Concentrate on collection-level metadata

• Use automatic generation to augment item-level metadata

Most Core Integration services will be created automatically from collection-level metadata or directly from the content (e.g automatic indexing of text, automatic reference linking).

Page 34: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

34

Managing the NSDL

Responsibility without authority

Page 35: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

35

A personal observation

Despite all the evidence to the contrary, ...

we repeatedly over-estimate the benefits of collaboration ...

and under-estimate the obstacles.

Page 36: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

36

During the preliminary phases ...

• Each project worked independently (NSF grants have little control)

• Coordination was through a loose set of committees, with mailing lists, bulletin boards, etc.

The NSDL challenge

Page 37: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

37

During the preliminary phases ...

• Each project worked independently (NSF grants have little control)

• Coordination was through a loose set of committees, with mailing lists, bulletin boards, etc.

For the production phase ...

• We must develop a robust, reliable set of services

• We must make compromises, decide priorities, etc.

• Yet we must attract the energy of many independent individuals and organizations

The NSDL challenge

Page 38: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

38

What doesn't workDecision making by online forums

• Become dominated by a few people, not necessarily the most knowledgeable.

• Either usage dies away, or too many low-value messages drive away the busy people.

Decision making without responsibility

• Vision is easy. Implementation is hard.

Page 39: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

39

What does work?Money

• Thank you NSF!

Online discussions on specific topics

• Structured discussions as part of a decision-making process are often productive

Patience and persistence

Success builds on success

Page 40: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

40

The last word

From the Lisle, NY Volunteer Fire BrigadeSeptember 17,2001

United we stand.

God bless America.

Bingo, Tuesday 7:30 - 10:00.

Page 41: 1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL

41

Building the National SDigital Library

William Y. ArmsCornell University