Upload
cynthia-lane
View
216
Download
0
Embed Size (px)
Citation preview
DL-Application for the DL-Application for the University Archive JenaUniversity Archive Jena
Ulrike Krönert, Mathias Hegner
FSU Jena
OverviewOverview
Project GoalsCreation and Use of the ArchiveArchive Data ModelArchive Class LibraryCurrent State of the ProjectOutlook
Project GoalsProject Goals
Archive is part of UrMELPresentation of all FSU archive files in the
internet ?Digitization of files for the Weimar classics
foundationWorkflow test in 2000, September-
NovemberWrite software for creating, exploiting, and
searching the archive
Creation and Use of the ArchiveCreation and Use of the Archive
Digitization of (selected) files Description of the files by meta-data, files
become searchable Scientific exploitation (1): Creating
documents Scientific exploitation (2): Creating
dossiers on themes
Recording the ArchiveRecording the Archive
36,000,000 pages, 600 dpi, >100 years– 360 Tbyte of lossless compressed data– 12 Tbytes of highly compressed data
Digitization and microfilming in one step Description of the file by hand Automated page loader– <holding name><file number><page
number>.<extension>
Exploiting the Archive in Two Exploiting the Archive in Two StepsSteps Step 1: Describing documents, select
pages into documents Results: File as a heap of pages becomes
a folder of documents Step 2: Summarizing documents, single
pages or even whole files Results: Dossiers on selected themes
Data ModelData Model
Files containing documentsDocuments containing pagesPages containing the images (parts)Dossiers containing files, documents,
pagesGeneralized text search index for files,
documents, dossiersNote part for each object
File Data ModelFile Data Model
Necessary attributes: archive name, holding name, file number
Additional attributes: origin, period, size
Parts: file title, keywords, comprise note, remark
Text index for all attributes and partsFolder: documents in file
Document Data ModelDocument Data Model
Attributes: page numbers, date, document type, author
Parts: reference, remark, co-authorsText index for all attributes and partsFolder: pages in documentHelp text index containing file pid
(for faster search)
Page and Dossier Data ModelsPage and Dossier Data Models
Page– Page number as an attribute– JPEG-, TIFF- parts– Help text index: document pid
Dossier– Dossier title as an attribute– Text index containing dossier title– Folder: objects in dossier
Class Library for ArchiveClass Library for Archive
Archive objectObject collection (*)Browser session (*)Servlet (FsuArchiv)Help classes: access rights, archive
user
Archive ObjectsArchive Objects
Object type (file, document, page, dossier)
Constructors for persistent and new objects, Destructor
open/ close, add/ deleteget/ set attributes/ partsisFolder, hasParent, getIndexClassget parent/ items/ dossiers
Object CollectionsObject Collections
add object(s)delete object(s)sort objects (e.g. by page numbers)manage collection pointerget archive object(s)
Browser SessionsBrowser Sessions
User managing by DL (*)login/ auto logout (after a deadtime)manage access rightsget/ set session properties, e.g.
session numbermanage object collections in a
collections stack
Servlet Class (FsuArchiv)Servlet Class (FsuArchiv)
query parametric/ text/ combinedshow object attributes and meta-
partsupdate object/ create new partsselect digitized page into document add to/ remove from dossierdelete object
OutlookOutlook
using SSL, encrypting password etc.fine-grained access rights for files
and dossiersmanage payment for archive usemulti-medial layout with animations,
images etc. Ergonomics of the HTML pages