42
cessing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000 Processing Processing electronic electronic literature: literature: CERN case study CERN case study C. Pettenati (ETT-SI) M. Draper (ETT-DH) CERN

Processing e-literature at CERN Corrado Pettenati Mick Draper20 March 2000 Processing electronic literature: CERN case study C. Pettenati (ETT-SI) M. Draper

Embed Size (px)

Citation preview

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Processing Processing electronic literature: electronic literature:

CERN case studyCERN case study

C. Pettenati (ETT-SI) M. Draper (ETT-DH)

CERN

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Presentation plan (1)Presentation plan (1)

The CERN Library Definitions Grey literature management

Current services CERN grey literature collection Submission & Acquisition services Consultation & Dissemination services

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Presentation plan (2)Presentation plan (2)

Tools available to the readers Future perspectives for grey literature

at CERN Architecture

Hardware configuration Software architecture Re-usability

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

CERN - European Organization CERN - European Organization for Nuclear Researchfor Nuclear Research

European Laboratory for Particles Physics Fundamental research Founded in 1954 in Geneva, Switzerland

20 member states

540 universities and laboratories, 7000 researchers, 90 nationalities

5 accelerators, more than 1000 experiments and collaborations

Current year budget: 939 MCHF

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

The CERN LibraryThe CERN Library

A central unit and four satellites Few monographs, less than 40,000 500 open subscriptions to scientific journals

400 titles available electronically in full text

A very important collection of grey literature, more than 350,000 documents (with full-text electronically available from February 1994 onwards)

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Definitions (1)Definitions (1) The CERN grey literature collection is

composed Documents prepared to be submitted to scientific journals Documents submitted to conferences Theses CERN internal notes (Committee papers, Proposals) External reports Pictures (photos & diagrams) Videotapes on academic training (partly “webcasted”) Administrative Documents (separate protected access) CERN internal publication (weekly bulletin)

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Definitions (2)Definitions (2)

Open Archive A submission mechanism A long term storage system A management policy for submission and preservation An open interface to let third parties collect data from the archive

The CERN Preprint Server was an Open Archive a long while before this definition was set up last year in Santa Fe (see http://www.openarchives.org)

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Number of accesses to the Number of accesses to the CERN Library catalogueCERN Library catalogue

Access perday to thecatalogue

Access perday to the f-tserver

Documentsdownloadedper day

25,000 16,000 2,000

Picturesdownloadedper day

Ratiointernal vs.external use

450 20:80

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

CERN Library collectionsCERN Library collections

BooksJournalsArchivesPreprints

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Grey literature acquisition Grey literature acquisition proceduresprocedures

Direct electronic submissions Official series Open series Theses

Downloading from other grey literature servers Los Alamos, DESY, SLAC, Fermilab, etc. Email based application: the Uploader

Digitization of paper documents Exchange with other labs (Annual reports)

Harmonization of the record description

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

E-SubmissionE-Submission

Web Submission options: Bibliographic Notice Input/Update Fulltext document Transfer or Link (TeX, Word,

PDF, HTML) Revised version Transfer Alert an e-mail distribution list Forward to Printshop and Mail Office Ask for approval (internal & scientific notes)

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

ProvenanceProvenance

More than 40,000 documents processed per year Internal to CERN 10% External 90%

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Document prepared for Document prepared for publication: Preprintspublication: Preprints

They are sent to the CERN Library and at the same time submitted to the publisher of a scientific journal

They are distributed via the Library Web server the day after submission

In general they will be published much later, after 8-24 months

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Submission to the Library

Submission to a journal

record, text, figures

Visibility on Internetthe day after

Record updating

Article publication

INSPEC, conference proceedings, SLAC db, authors, ...

????

1 day

8 - 18 months

1 week

Weekly list preparation

Preprints processing procedure

Input of the publication note

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

SYSNO 0251285 BA 11 LN eng YR 1997 SW $$s n $$w 9725 ER hep-th/9706105 EM [email protected] TI Noncommutative Yang-Mills and Noncommutative Relativity: A Bridge Over Trouble Water AU2 Carminati, L AU Iochum, B AU Schucker, T AF Marseille

IM $$d 16 Jun 1997 $$c 29 p EXT $$x http://preprints.cern.ch/cgi-bin/setlink?base=preprint&categ=hep-th&id=9706105 $$n Access to fulltext document RN CPT-96-P-3477 SU AD PARTICLE PHYSICS - THEORY SU1 AD AB Connes’ view at Yang-Mills theories is reviewed with special emphasis on the gauge invariant scalar product. This landscape is shown to contain Chamseddine and Connes’ noncommutative extension of AB $$9 general relativity restricted to flat space-time,if the top mass is between 172 and 204 GeV .

Then the Higgs mass is between 188 and 201 GeV. YAU Carminati, Lionel YAU Iochum, Bruno YAU Schucker, Thomas

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Title : Noncommutative Yang-Mills and Noncommutative Relativity: A Bridge Over Trouble WaterAuthor : Carminati, LImprint : 16 Jun 1997. - 29 p.Server : hep-th/9706105Report no : CPT-96-P-3477Abstract : Connes’ view at Yang-Mills theories is reviewed with special emphasis on the gauge invariant scalar product. This landscape is shown to contain Chamseddine and Connes’ noncommutative extension ofURL address : http://preprints.cern.ch/cgi-bin/setlink?base=prepr int&categ=hep-th&id=9706105Add.author : Iochum, B; Schucker, T

Lib Coll Shelf number Vol. text Category C SVR hep-th/9706105 000 08

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Preprint CPT-96-P-3477(hep-th/9706105)

Noncommutative Yang-Mills and Noncommutative Relativity: A Bridge Over Trouble Water

by Carminati, L ; Iochum, B ; Schucker, T ;

Email: [email protected] Affiliations: Marseille Imprint: 16 Jun 1997 29 p Subjects: PARTICLE PHYSICS - THEORY

Abstract:Connes’ view at Yang-Mills theories is reviewed with special emphasis on the gauge invariant scalarproduct. This landscape is shown to contain Chamseddine and Connes’ noncommutative extension ofgeneral relativity restricted to flat space-time, if the top mass is between 172 and 204 GeV. Then the Higgsmass is between 188 and 201 GeV.

full text

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Record no: xyz

EXT: URL ...

Preprint full-text server

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Reprint (published preprint) (cond-mat/9711001)

Anomalous Behavior at a Superconducting Quantum Critical Point

by Ramazashvili, R ;

Email: Revaz Ramazashvili [email protected] Publ. Ref.: Phys. Rev., B : 56 (1997) 5518 - Published version - Imprint: 4 Nov 1997 4 p Subjects: CONDENSED MATTER

Abstract:Motivated by pressure experiments on UBe_{13} and Tl_{2}Ba_{2}CaCu_{2}O_8, we discuss low-temperatureeffects of the pairing interaction at a superconducting quantum critical point in a clean system. We point outthat measurements at this quantum critical point can provide a diagnostic tool to mark out non-BCSmechanisms of superconductivity.

Access to fulltext document - Show references - Quoted by (experimental) - Mark document -

Home | Comments | Help | Show Shelf | Format Output | Check Loans | V iew Searches

Access to the preprint full text

Access to the publishedtext

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Record # 123

EXT: URL ...

CERN Preprint full-text server

Pub. note Tit. AA, vol. pp ...

URL: .....

Author: .......

Title ....

Electronic journal Publisher server

CERN algorithm

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Document formatsDocument formats

Accepted

Tex/Latex Word TIFF HTML ...

Distributed

PDF PS HTML TIFF GIFF ...

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Formats elaborated by the Formats elaborated by the electronic submissionelectronic submission

Conversion from Tex/Latex to PS Conversion from Word to PS Conversion from PS to PDF

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Text trasmissionText trasmission

FTP by the author him/herself FTP requested by the CERN Document

Server Automatic transfer from a Web server

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Citations managementCitations management

The document PS format is analysed and citations are automatically extracted

If the cited document is also in the CERN database a link is inserted next to the citation

The citations can not always be safely processed automatically

Bang!!

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Documents submitted to Documents submitted to conferencesconferences

In general they are prepared at the last minute … Often the submission to the Library is forgotten These documents are published later

On the conference server or As printed conference proceedings

As independent monograph or Included in a specialized journal

Hard and intensive work to discover them

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Annual reportsAnnual reports

In general received as exchange More and more often available electronically Now processed as periodicals

One record, several issues Automatic claiming

Link to a new title if required

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

ThesesTheses

Degree and post-graduate Prepared

On CERN equipment and/or Under CERN staff supervision

In general defended 12-18 months later Difficult to retrace

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

ILASFull Text

server

PC MAC X

CERN LAN

Aleph FTP

TEX, LATEX,

WORD, HTML, ...

Preprints electronic submission

TIFF

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

ILASFull Text

server

PC MAC X

Internet

CERN LAN

AlephGIF, TIFF,

PS, PDF,

HTML

HTPP

CERN grey literature

MAC

Preprints distribution

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

ArchitectureArchitecturesoftwaresoftware

Submission+Services

EDS

CDSCERN Document Server

WEBLIBinterface

WWW

Electronic document submission

AlephMetadata database

DOCUMENT

Link Manager

QUERY ACCESS

Configuration

DB (EDS)

ConfigurationDB (WebLib)

SUN SPARC 4504 CPUs 250

MHz80 GigaBytes

SUN SPARC 450

3 CPUs 300 MHz

(ORACLE DB)

MySQL DatabasePHP/Perl scripting

MySQL DatabasePHP/Perl scripting

Aleph APIsC programming

(CGI)Java interface

C programmin

g

/hardware

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Re-usabilityRe-usability

Complete system Modular: parts can be re-used

Software: All sources are freely distributed

Databases Aleph integrated system: commercial (Oracle based) MySQL databases: freeware Existing configuration tools New functions easy to attach

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Tools available to the end usersTools available to the end users

Need to involve directly the readers in the search

Four groups of tools to: Search Access Transfer Manage

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Consultation & Dissemination (1)Consultation & Dissemination (1)

Graphical User Interface:WebLib All catalogues with “Find” and “Browse” Available indexes on authors, titles, subjects, report numbers, etc. Words searchable on all fields (including abstracts) Output sort options Record metadata available in HTML, LateX or PDF Navigation & Search can be set up by institute, year, subject, etc. Search history available Downloading mechanism for many formats (PS, PDF, GIF, etc.) Linking capabilities for book records to booksellers' records

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Consultation & Dissemination (2)Consultation & Dissemination (2)

Personal Virtual Library Results displayed in various formats (brief, detailed or personal) Individual Alert mechanism (SDI) to e-mail new records Personal shelf (basket) to keep searches, items, formats & profiles

E-prints Record description is updated with the publication notes

(Journal title + vol/year + starting page number) Dynamic linking from the notice to the published article Dynamic linking from the citations of the document to the article Availability of the link to users with a subscription to the e-journal

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Document access toolsDocument access tools

Web Z39.50 client/server Different formats (PDF, PS, TIFF, GIF,

HTML, …) Document size continuously increasing Strong need for increased bandwith

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Usage measurementUsage measurement

Statistics collection By country By IP domain By IP number By type of format By slice of time

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

How to prepare a virtual libraryHow to prepare a virtual library

The final goal is to provide the end reader with a complete toolbox to search, find, reach, download, use and manage the documents

There are no universal recipes The CERN Library tries to find its own balance

between traditional and electronic literature

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Basic components of the CERN Basic components of the CERN virtual (digital) libraryvirtual (digital) library

An integrated library automation system A graphic User Interface A network with enough bandwidth A CD-ROM LAN An electronic document delivery tool A collection of external electronic resources

Electronic journals Grey literature servers

Use of the protocols HTTP, Z39.50 (SR-U)

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Future of grey literature at Future of grey literature at CERNCERN

Usage of the XML format More intensive distribution before publication Preparation of metadata done directly by the

author Use of specialized network search engine

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

Article and DC (Dublin Core) metadata

Author

webmaster

convert DC metadata

Network

Search Service

website

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

ConclusionConclusion

More and more important role for the grey literature

Contraction of the number of traditional scientific publications

Exponential growing of spontaneous electronic journals

Processing e-literature at CERN Corrado Pettenati Mick Draper 20 March 2000

QUESTIONS?QUESTIONS?