86
Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science Workshop Puebla, Mexico, June 10-12, 1998 Edward A. Fox Virginia Tech, Blacksburg, VA http://fox.cs.vt.edu [email protected]

Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Embed Size (px)

Citation preview

Page 1: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD)

and the Computer ScienceTeaching Center (CSTC)

3rd Computer Science WorkshopPuebla, Mexico, June 10-12, 1998

Edward A. FoxVirginia Tech, Blacksburg, VA

http://fox.cs.vt.edu [email protected]

Page 2: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

THANKS Hosts for this Workshop NSF

– Funding since 1984: AI, Comp. Linguistics, DL, Education, E-pub, HCI, IR, MM, Networking

US Dept. of Education (FIPSE) Contributors

– Adobe, CNI, CGS, CSGS, IBM, Microsoft, OCLC, SURA, ...

– Collaborating universities (you too, soon, I hope!)

Page 3: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

NDLTD PIs:Ed Fox - Computer Science ([email protected])

John Eaton - Graduate School ([email protected])Gail McMillan - Library ([email protected])

CSTC PIs:Deborah Knox - The College of NJ

Scott Grissom - U. Illinois Springfield(Rachelle Heller - GWU - CRIM)

Infrastructure: Net.Work.Virginia (vBNS, Internet 2, LMDS,

Blacksburg Electronic Village, FDI)

ACKNOWLEDGEMENTS / CONTEXT

Page 4: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

A Complete Approach to Advanced Network Access

Virginia Tech

Mid-Atlantic Crossroads (MAX)

Regional/National Leadership

NET.WORK.VIRGINIA Statewide Access

LMDS / Local NAPS Last Mile

Page 5: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

vBNSInternet2

ESnet

jmc 1/3/97

OC3 OC3

OC3

Sprint ROA

Net.Work.Virginia ArchitectureBackbone / Internet Gateway

DS3

Internet

SprintLinkRouter

OC3Sprint WTN

Sprint RIC

Page 6: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

NET.WORK.VIRGINIA

222 Sites by Type and BandwidthMay, 1998

Higher Education

Localities / Libraries

K-12 Education

DS-3 ( 45 Mbps)

OC-3 (155 Mbps)

DS-1 (1.544 Mbps)

State Government

Frame Relay - ATM (FRANI)

Page 7: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Faculty Development

InitiativeEnglish

Philosophy

SociologyPsychologyPolitical Science

HistoryBiology

Physics

Cyberschool I and II

Women's Studies

Religious Studies

ChemistryBiochemistry

Communications

Newman LibraryInformation Systems

Art & Art History

Computer Science

Music

Educational Technologies

Black Studies

Mathematics

Education

Center for Excellence in Undergraduate Teaching

University Honors Human Resources

Page 8: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

URLsCyberschool

http://www.cyber.vt.edu

Cyberschool White Papers http://www.cyber.vt.edu/wp

ACCESS Project http://www.edtech.vt.edu/access

aTECH Projecthttp://juno.atech.vt.edu

LIT Projecthttp://athena.english.vt.edu/LIT/LIT.html

Page 9: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

DigitalLibraries

Why of global interest?Why of interest in computing?DefinitionsNSF Digital Libraries Initiative

Page 10: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

DLs: Why of Global Interest? National projects can preserve antiquities and heritage:

cultural, historical, linguistic, scholarly Knowledge and information are essential to economic and

technological growth, education DL - a domain for international collaboration

– wherein all can contribute and benefit

– which leverages investment in networking

– which provides useful content on Internet & WWW

– which will tie nations and peoples together more strongly and through deeper understanding

Page 11: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Why of Interest in Computing? Next step in fields of DBMS, HT, IR, MM Efficiency requires advances in, e.g.,

– algorithms and data structures (ex., MPHF)– networking (ex., HTTP-NG)– OS (ex., support for streams)

Effectiveness requires advances in, e.g.,– AI (ex., multilingual texts, user adaptation)– HCI (ex., visualization, DLs embedded in activities)

CS Educ. can benefit; CS can aid Dist. Educ.

Page 12: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

DLs: Definitions Super information systems KDI with persistence, organization, usability Libraries extended to include collections of digital objects

and to provide expanded services to distributed user communities without prior limitations of space, time, physical instantiation

Latest implementation of visions of Bush, Licklider, Nelson, and previous scholars

Systems, services, institutions, enterprises, and projects of the digital library community

Page 13: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

DIGITAL LIBRARIES INITIATIVEFunded through a joint initiative of:

National Science FoundationDefense Advanced Research Projects Agency

National Aeronautics and Space Administration

Stephen M. GriffinDivision of Information and Intelligent Systems

National Science Foundation

[email protected]

http://dli.grainger.uiuc.edu/national.htmNational Synchronization Home Page

Page 14: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Computing (flops)Digital content

Com

mun

icat

ions

(ban

dwid

th, c

onne

ctiv

ity)

Locating Digital Libraries in Computing andCommunications Technology Space

Digital Libraries technologytrajectory: intellectualaccess to globally distributed information

less more

Page 15: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Core Sponsors: NSF, DARPA, NLM, LoC, NASA, NEH ~$8-10 million/yr for 4-5 years (beginning FY98) sponsor a full-spectrum of activities

– fundamental research, content & collections development, domain applications, testbeds, operational environments, new resources for education and preserving America’s cultural heritage

address topics over entire DL lifecycle– information creation, dissemination, access, use, preservation, impact, contexts

implement a modular, open program structure– add new sponsors, performers, projects at any time

Digital Libraries Initiative - Phase 2

Program Goals: new DL research, technologies and applications to advance

the use of distributed, networked information of all types around the nation and the world

Page 16: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Digital Libraries Initiative - Phase 2

Planning Underway to Secure Funding and Launch Full-Scale

Program of Support for International Collaborative

Activities in Digital Libraries Beginning in FY 1999

Page 17: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Goals for the Future Gather information and build collections

(to understand the incompleteness of our knowledge)

• Create new communities(to communicate and collaborate)

• Make technology disappear(from our awareness and experience)

Page 18: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

NDLTDNews, Background/HistoryVision, Benefits, Approach,

PossibilitiesConcerns, Problems, OppositionSolutions, Implementation, Results,

Plans

Page 19: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

NEWS,BACKGROUND/

HISTORY

Page 20: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Media

ETD Web Sitehttp://www.ndltd.org/

ETDs Got Your Interest?

Graduate Students

ChronicleNPRNY Times ...

U. Laval

Page 21: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

What led to today’s situation? 1987 mtg in Ann Arbor: UMI, VT, … 1992 mtg in Washington: CNI, CGS, UMI, VT and 10

universities with 3 reps each 1993 mtg in Atlanta to start Monticello Electronic

Library (MEL): SURA, SOLINET 1994 mtg in Blacksburg re ETD project: std of PDF +

SGML + multimedia objects 1996 funding by SURA and US Dept. of Education

(FIPSE) for regional, national projects (NDLTD)

Page 22: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

ETDInitiative

SGML (1985)

PDF(1992)

DL (1994)

LibraryCancellations

(1988)

UniversityScholarlyElectronic

Pub. (1988)

Info.Literacy(1995)

GraduateEducation

Internet(1984)

WWW(1994)

Multimedia(1986)

Page 23: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Aiding universities to enhance grad educ., publishing and IPR efforts: to help improve the availability and content of theses and dissertations

Educating ALL future scholars so they can publish electronically and effectively use digital libraries (i.e., are Information Literate and can be more expressive)

Demonstrating how, for other organizations

What are we doing?

Page 24: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

VISION,BENEFITS,

APPROACH,POSSIBILITIES

Page 25: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

A Digital Library Case Study

Electronic theses and dissertations (ETDs)

Submission: http://etd.vt.edu

Collection: http://www.theses.org

Networked Digital Library of Theses and Dissertations (NDLTD) http://www.ndltd.org (formerly “National” because of Fed. funds, before international members started joining)

Page 26: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Something for EveryoneStudents - contribute -> gain acclaimUniversities - join -> help your students, gain

increased DL experience + visibilityResearchers - use, encourage -> contentPublishers - liaise, support -> have more

knowledgeable authors + backup detailsDL enthusiasts - adapt resources / ideas -> have

exemplary pilot / model project

Page 27: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

What are the key ideas? People can switch to electronic documents

– Becoming more expressive with hypermedia Mandating ETDs will change all future

scholarship Scalability

– Empower authors to submit to DL, as a natural part of the educational process

– Study workflow & apply automation, so institutions streamline processing and build their part of the DL

– Federate along most suitable cultural/political lines

Page 28: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Key Ideas: Networked infrastructure

Scalability

Education is the rationale

University collaboration

Workflow, automation

Authors must submitMaximal access

PDF, SGML, MMStandards

Federated search

8th graders vs. grads

MARC, DC, URNs

Page 29: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

What are the benefits?

Save students money Save handling, shelf space in libraries Build the Networked Digital Library of Theses and

Dissertations: with faster, broader, and less expensive access

Demonstrate how universities can work together directly (vs. indirectly through publishers or associations)

Page 30: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

What are the long term goals? 400K US students / year getting grad degrees are exposed

/ involved 200K/yr rich hypermedia ETDs that may turn into

electronic portfolios Dramatic increase in knowledge sharing: lit. reviews,

bibliographies, … Services providing lifelong access for

students/researchers: browse, search, prior searches, citation links, ...

Page 31: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Record all work with NDLTD, return to prior situation, prepare bibliography

Powerful (multilingual, text, image) searching, browsing (with categories), following citation links

Support collaboration with others in same field: help with literature review, sharing tools and data sets, applying their methods

Undergraduate honors thesis: Todd Miller

Grad Student Assistant?

Page 32: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Increase local interchange among students, faculty, library, graduate school

Increase international understanding, building many more invisible colleges, with students more empowered

Connect graduate researchers with undergrads, who can access ETDs / them

Facilitate direct university collaboration, explicitly, in reshaping publishing world

Social Capital?

Page 33: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

How are ETDs being done at Virginia Tech?

Some produced w. SGML (XML) Most produced using standard word processing

packages as PDF files– LaTeX class, outline fonts, Distiller– Word template, export to PDF (XML)

Reviewed by the Graduate School Cataloged and archived by the library Ds downloaded by UMI from server

Page 34: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

NDLTD

Computer Resources

Research

Literature

Student Prepares Thesis or Dissertation

Page 35: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Student Defends and Finalizes ETD

My Thesis

ETD

Page 36: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Student Gets Committee Signatures and Submits ETD

Signed

Grad School

Page 37: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Graduate School Approves ETD Student is Graduated

Ph.D.

Page 38: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Library Catalogs ETD and New StudentsHave Access to the New Research

WWW

NDLTD

Page 39: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Status of the Local Project Approved by university governance Spring

1996; required starting 1/1/97 Submission & access software in place Submission workshops for students (and

faculty) occur often: beginner/adv. Faculty training as part of Faculty

Development Initiative Over 1000 ETDs in collection

Page 40: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Statistics 30-40K accesses/week to WWW site 300K accesses of ETD HTML pages

– 80K downloads of PDF version of ETDs– 5 most popular ETDs: 10K, 8K, 2800, 2500, ...

Multimedia content: about 65% have some– 45% have images, 5% have movies– 1 w. 85 VRML files, 1 w. 378M Director file

See details: http://scholar.lib.vt.edu/theses/ OCLC has about 3.5M TD MARC records UMI Dissertation Abstracts has 1.5M entries

Page 41: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Initial Stats

1996 1997

Total successful requests: 37,171 247,573Average successful requests per day: 102 685Requests for .PDF files ( full ETDs): 4,600 72, 854Requests for .HTML file 28,225 129,831Distinct hosts served 9,015 22,725Total data transferred: 3,229M 25,953MAverage data transferred per day: 9,038K 73,574K

Page 42: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Popular Works 1996458 Seevers, Gary L. Identification of Criteria for Delivery of Theological Education Through Distance Education: An International Delphi Study (Ph.D., Educational Research and Evaluation, April 1993; 1353Kb)

432 Hohauser, Robyn Lisa. The Social Construction of Technology: The Case of LSD (MS in Science and Technology Studies, Feb. 1995; 244Kb)

390 Childress, Vincent William. The Effects of Technology Education, Science, and Mathematics Integration Upon Eighth Grader's Technological Problem-Solving Ability (Ph.D. in Vocational and Technical Education, July 1994; 285Kb)

310 Kuhn, William B. Design of Integrated, Low Power, Radio Receivers in BiCMOS Technologies (Ph.D. in Electrical Engineering, Dec. 1995; 2Mb)

287 Sprague, Milo D. A High Performance DSP Based System Architecture for Motor Drive Control ( MS in Electrical Engineering, May 1993; 878Kb)

165 Wallace, Richard A. Regional Differences in the Treatment of Karl Marx by the Founders of American Academic Sociology (MS in Sociology, Nov. 1993; 479Kb)

150 McKeel, Scott Andrew. Numerical Simulation of the Transition Region in Hypersonic Flow (Ph.D. in Aerospace Engineering, Feb. 1996; 3Mb)

Page 43: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Popular Works 19979920 Liu, Xiangdong. Analysis and Reduction of Moire Patterns inScanned Halftone Pictures (Ph.D. in Computer Science, May 1996; 6.6Mb)

7656 Petrus, Paul. Novel Adaptive Array Algorithms and Their Impact on Cellular System Capacity (Ph.D. in Electrical Engineering, March 1997; 5Mb)

2781 Agnes, Gregory Stephen. Performance of Nonlinear Mechanical, Resonant-Shunted Piezoelectric, and Electronic Vibration Absorbers for Multi-Degree-of-Freedom Structures (Ph.D. in Engineering Mechanics, Sept. 1997; ? + 7926Kb)

2492 Gonzalez, Reinaldo J. Raman, Infrared, X-ray, and EELS Studies of Nanophase Titania (Ph.D. in Physics, July 1996; 4607Kb)

1877 Shih, Po-Jen. On-Line Consolidation of Thermoplastic Composites (Ph.D. in Engineering Mechanics, Feb. 1997; 3.3Mb)

1791 Saldanha, Kevin J. Performance Evaluation of DECT in Different Radio Environments (MS in Electrical Engineering, Aug. 1996; 3.2Mb)

1431 DeVaux, David. A Tutorial on Authorware (MS in CS, April 1996; 2.3Mb)

1394 Kuhn, William B. Design of Integrated, Low Power, Radio Receivers in BiCMOS Technologies (Ph.D. in Electrical Engineering, Dec. 1995; 2518Kb)

Page 44: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

International Use 1996 1997 850 2,922 United Kingdom 608 2,501 Australia 346 2,378 Germany 713 2,367 Canada 387 1,264 South Korea 463 1,161 France 183 1,130 Brazil 22 967 Thailand 83 958 Greece

Page 45: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Universities Visiting / Visited byVirginia Tech Staff

Auburn CMU Columbia Drexel U. Florida Int’l Univ. Georgia Tech Hong Kong James Madison U. MIT (Nat’l Lib. Canada) Nat’l Univ. Singapore New York U. Ohio State U. Penn State Rice Univ. Rutgers Univ. San Jose State U. Stanford Univ. Tech. Univ. Portugal U. Alabama

U. Alabama Birmingham U. Arizona U. CA Berkeley U. CA Santa Barbara U. Central Florida U. Delaware U. Denver (+CSU,CU Boulder) UDLA (Puebla, Mexico) U. Florida U. Ill. Urbana Champaign U. Massachusetts Amherst U. Michigan U. NC Charlotte U. North Florida U. Pennsylvania U. Utah U. Waterloo Virginia Commonwealth U. William & Mary

Page 46: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Universities OfficiallyPart of NDLTD

U. Laval (Canada) U. Maine U. of New South Wales (AU) U. of South Florida U. of Tennessee, Knoxville U. of Tennessee, Memphis U. of Virginia U. Waterloo (Canada) U. Wisconsin - Madison Vanderbilt U. Virginia Tech West Virginia U. Wilfrid Laurier U. (Can.)

Clemson UniversityConcordia University (IL)Darmstadt U. of Tech. (GE)Florida Institute of Tech.Michigan TechNaval Postgraduate SchoolNorth Carolina State U.Rhodes U. (South Africa)Rochester Institute of Tech.University of FloridaUniversity of GeorgiaUniversity of Guelph (Can.)U. of Hawaii, Manoa

Plus: 1 in S. Korea, HQ of CIC, ...

Page 47: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

User Search Support

NDLTD W orld Federated Search

Virg in ia Tech ...(un iv )

U M I ...(corporate)

C IC ...(un iv group)

Portugese N L ...(national lib)

N AT O ...(reg ional)

UserInterface

Note: Above are illustrative, in some cases potential.

Page 48: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Interoperability Tests Planned Locally developed federated search IBM DL: donated equipment Z39.50: OCLC SiteSearch / VT tailored s/w

– university libraries w. catalogs of freely shared MARC records pointing to archival copies

– via URNs: handles & PURLs Dienst / NCSTRL - www.ncstrl.org: CS depts.,

DARPA, NSF, CNRI, Cornell - UVA is working on extensions for ETDs - Portugal is studying use for Europe - VT is working on Dienst to Z39.50 gateway

Page 49: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Access Approaches

Goal: Maximize access and services, e.g., by encouraging:

UMI centralized services Distributed service: Dienst, Z39.50 Regional services (e.g., CIC, MEL) Local servers with browse, search

– From local catalogs to local archives WWW robot indexing and search services

Page 50: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Why might your university want to be involved?

To improve graduate education / better prepare your students

To unlock university information To save money for students and for the

university / improve workflow To build an important digital library

supported by SURA, FIPSE

Page 51: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

How can your university get involved? (www.ndltd.org/join) Select planning/implementation team

– Graduate School– Library– Computing / Information Technology– Institutional Research / Educ. Tech.

Send us letter, give us contact names Adapt Virginia Tech solution

– Build interest and consensus– Start trial / allow optional submission

Page 52: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Contact Our Project Team

[email protected]

Phone CallVisit

Video Tape

Page 53: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Convene Local Planning Group

ETD

Page 54: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Join NDLTD: Get a CD-ROM

WE JOIN

Signed Letter

ETD CD-ROM

Page 55: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Build Your ETD Site

Digital Library

Policies

Inspection/Approval

Workshop/Training

ETD

ETD

Page 56: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Type 1 Members University Requires ETDs

Adobe Acrobat and/or SGML tools Automated submission & processing Archive/access through UMI,

(OCLC,) Virginia Tech, ... (Local) WWW site, publicity (Local) Assistance provided as

requested: email, phone, listserv(s)

Page 57: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Type 2 Members University Agrees to Require ETDs

Like Type 1 but set date not reached Usually has an option or pilot May: wait for new AY; start with all who enter

after; … Build grass roots support

– Advisory committee: representative? expert?– Champions to spread by word of mouth– Approval: Senates, Commissions, Deans, Students– Publicity to reach community

Page 58: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Types 3-7

3. Part of university requires ETDs

4. University allows ETDs

5. University investigating, has pilot

6. University consortium joins

7. Non-university organization joins

Page 59: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

CONCERNS,PROBLEMS,OPPOSITION

Page 60: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

How does this relate to UMI?

1987 UMI workshop to explore ETDs Support letter for US Dept. of Ed. proposal Steering & technical comm. membership Difference in focus: on education, theses ProQuest Direct pilot of scanning works started

1/1/97 Collaborating on:

– accepting electronic author submissions– standards (e.g., representation), research

Page 61: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

ETD Initiative (and UMI)

StudentsLearn aboutDL, EPub

TDsbecome more

expressive

N. Amer. (T)Ds areaccessible, archived

Global TDsbecome more

accessible,archived

UMI

Universities

Page 62: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Some Barriers at Universities

Lethargy; Not invented here (esp. large univ’s) Anger with unfunded, added, required work Last straw: using more frustrating technology Lack of experience in working together: graduate school,

library, computing staff Lack of interest in (quality of) student work More loyalty to discipline than campus Unwillingness to accept responsibility for growing

financial problems with libraries

Page 63: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

SOLUTIONS, IMPLEMENTATION

Page 64: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Level 0 Involvement RISK FREE - allow students

Adobe Acrobat in bookstore Submission allowed (e.g., J. Daniels) Archive/access through UMI, Virginia

Tech, ... (Local) WWW site, publicity (Local) Assistance provided as

requested: email, phone, listserv(s)

Page 65: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Level 1 Involvement = Level 0 + LOW COST - help & encourage students

Install our software, change practices in graduate school and library

Train students Build grass roots support

– Advisory committee: representative? expert?– Champions to spread by word of mouth– Approval: Senates, Commissions, Deans, Students– Publicity to reach community

Page 66: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Level 2 Involvement = Level 1 + EVENTUAL FULL INVOLVEMENT

Require electronic submission Have firm arrangement with local library, OCLC,

VT and/or UMI re archival services Share MARC records, with URNs pointing to

archive copy (Stock laboratories) (Run servers: search, URN) (Launch evaluation program)

Page 67: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Support Services Developed

CD/WWW site with > 300M: student guidelines, listservs, FAQs, press info, multimedia training materials

Automated submission system SGML DTD for ETDs, SGML to HTML (web

generator) Donations: Adobe, Microsoft Evaluation: instruments, analysis

Page 68: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Relationship with publishers

Concern of faculty and students that still wish to publish books or journal articles, voiced: campus, Chronicle, NPR, Times

Solution: Approval Form gives students, faculty choices on access, when to change access condition; use IPR controls in DL

Solution: by case, work with publishers and publisher associations to increase access– AAP, AAUP– AAAS, ACM, ACS, Elsevier, ...

Page 69: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Some responses from publishers

ACM: need to acknowledge copyright Elsevier: need to acknowledge copyright IEEE-CS: endorse initiative AAAS: Science wants first publication Textbook publishers: different market,

manuscript significantly reworked General: restricting access to local campus will

not cause any problems

Page 70: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Plans

Nurture federation -> summer workshop (now, in Memphis)

Increase # members -> lower barriers by supporting pilot efforts directly

R&D in federated fashion– USF: writing– UVA / U.Mich.: SGML– Portugal: national library requirements– Singapore: multilingual support with UNICODE

Page 71: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Summary

Sustainable, Scalable: started in 1987, growing, coupled with education

Open: everyone welcome, mutually agreed-upon standards, building international collaborative community

Content: valuable, high demand, aiming toward completeness

Usability: applying latest and future DL research so can easily submit, utilize

Page 72: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

NDLTD Future Work Working with publishers to increase level of access as

much as possible Interoperability tests among universities and with UMI

to provide integrated services Study with testbed that emerges, to improve

information retrieval, browsing, interface, and other types of user support

Evaluation, improving learning experience, spread to worldwide initiative, sustainable support and coordination

Page 73: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

CSTCNCSTRL, VT GrantsVision, Benefits, ApproachConcerns, ProblemsSolutions, Plans

Page 74: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

NCSTRL: CS TECHNICAL REPORTS

CS TR project supported by ARPA (Berkeley, CMU, Cornell, MIT, Stanford)

WATERS project for other departments led by ODU, SUNY Buffalo, UVA, VPI&SU

Merger summer 1995 to www.NCSTRL.org (Networked CS Tech Report Library)

Most large departments now have joined “Central” server: UVA, “backup”: VPI&SU 1998 extension to preprint service, with LANL

Page 75: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Virginia Tech GRANTS 1991-1993 ENVISION project funded by NSF 1993-1998 “Interactive Learning with a Digital Library

in CS” by NSF: http://ei.cs.vt.edu (10M accesses to over 45 courses)

1998-2000 “Computer Science Teaching Center” by NSF and ACM Education Board: http://ei.cs.vt.edu/~cstc/

1998-2000 “Curriculum Resources in Interactive Multimedia” by NSF : http://ei.cs.vt.edu/~crim/

Page 76: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

ENVISION

A User-Centered Database from the Computer Science Literature (1991-93)

Collected bib. data, converted to SGML Converted typesetter data to SGML Scanned thousands of page images MARIAN search engine (also applied to the Virginia

Tech library catalog) used as part of a prototype object-based DL, with tailored visualization interface (L. Nowell dissertation)

Page 77: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

NSF Education Innovation (EI)

NSF “Interactive Learning with a Digital Library in Computer Science” (1993-98)

45 online courses, 100+K accesses/wk, plus: DL courseware, overall EI project pages

Tools: SWAN (visualization), QUIZIT Evaluation

– traditional– network logging and analysis– tools for visualization

Page 78: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

QUIZIT (Lucio Tinoco)

SGML generates HTML & answer files MSQL supports records database Automated password request supported Password allows review of taken quizes Password allows selection of next quiz 4 types of questions are supported Feedback provided if questions are missed

Page 79: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

PAPERLESS COURSES

CS1604: Introduction to the Internet

CS3604: Professionalism in Computing

CS4624: Multimedia, Hypertext and Information Access (MHIA)

CS5604: Information Storage and Retrieval

CS6604: Digital Libraries

Page 80: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

EVALUATION: Log Analysis

WWW traffic logging, analysis, modeling, simulation, prediction (G. Abdulla)

Students with same grades learn by knowing or by knowing how to search

Regression predicts course’s traffic Modeling of hourly, daily, seasonal trends Understand users, plan future networks

Page 81: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

MHIA Workshops

ACM: SIGIR’96, MM’96, SIGIR’97-DL’97; IEEE CS: ICMCS’97, … (MM’98, Sept. in UK)

Aim: Curriculum Guidelines for MHIA Area New Programs, ex. Euro EI Masters Reusable WWW Knowledge Modules New Courses, ex. Hypertext, IR, Multimedia Add-ons to Existing Courses (Comparative

Languages, DBMS, HCI, Networking, OS)

Page 82: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Vision, Benefits, Approach Instead of building large, expensive multimedia packages,

that become obsolete and are difficult to re-use, concentrate on small knowledge units.

Learners benefit from having well-crafted modules that have been reviewed and tested.

Use digital libraries to build a powerful base of support for learners, upon which a variety of courses, self-study tutorials & reference resources can be built. (See NSF SMETE-Lib Study at http://www.dlib.org/smete/public/smete-public.html)

Page 83: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Concerns, Problems Motivating educators to create modules that can be

used elsewhere is difficult without a suitable reward structure and an infrastructure of testing, packaging, discovery, reuse, and evaluation.

There is a disconnect between researchers preparing exciting demonstrations for conferences and instructors interesting in helping students grasp underlying concepts and innovations in their area.

Page 84: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

Solutions, Plans CSTC will have a variety of focused centers so that

different types of resources can be collected, tested, and suitably packaged:– laboratory exercises, activities, assignments

– visualizations and visualization tools

– interactive multimedia resources (CRIM)

ACM has been approached to launch a digital library “Transactions in Courseware and Education in Computing” to provide an ongoing infrastructure for CSTC.

Page 85: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

CONCLUSIONS Digital libraries may provide powerful support for learners

if properly developed and supported by suitable, scalable, sustainable infrastructure.

NDLTD will have a dramatic impact on graduate education if institutions participate, which is a “win-win situation”.

CSTC and CRIM will help us explore how learning about computing can be enhanced by a large number of well-crafted modules that illustrate key concepts and can be “glued” together in a variety of fashions to suit local needs.

Page 86: Digital Libraries: The Networked Digital Library of Theses and Dissertations (NDLTD) and the Computer Science Teaching Center (CSTC) 3rd Computer Science

COLLABORATION ! NDLTD, CSTC, CRIM, NCSTRL, …

– Join and help build the collections

– Use the collections and their resources

– Help enhance the technology through R&D

Help solve key DL problems– Become a center (recall USF/English role in KDI)

– Connect with library/publishing world in Mexico

– Connect with preservation/dissemination in Mexico

– Work on testbed, system, or HCI aspects (DLI2)