Upload
stone-holden
View
23
Download
1
Tags:
Embed Size (px)
DESCRIPTION
An emergent system for the creation and dissemination of manuscript transcriptions. Fabio Carrera Stanley Selkow. Research Paper. Archives. Scholars. Transcriptions. Typical Transcription Process. Publishes. Requests. Receives. Uses. Creates. Transcription Assistant Project. - PowerPoint PPT Presentation
Citation preview
An emergent system for the An emergent system for the creation and dissemination ofcreation and dissemination of
manuscript transcriptions manuscript transcriptions
Fabio CarreraFabio Carrera
Stanley SelkowStanley Selkow
Typical Transcription ProcessTypical Transcription Process
Receives
Requests
Scholars
Research Paper
Publishes
Creates
UsesTranscriptions
Archives
Transcription Assistant ProjectTranscription Assistant Project
Makes manuscript images accessible on Makes manuscript images accessible on the webthe web
Facilitates transcription using these Facilitates transcription using these imagesimages
Allows sharing of transcriptionsAllows sharing of transcriptions
Makes searching for manuscripts and Makes searching for manuscripts and transcriptions possibletranscriptions possible
NormalNormal
JHS Maria JosephEl diuino caçador
Auto sacramental
ModernModern
JHS María JosepEl Divino CazadorAuto sacramental
MadisonMadison
Miguel de Cervantes Miguel de Cervantes
Digital LibraryDigital Library
{HD1. JHS Maria Joseph 1}{HD2. El diuinoCaçador}(^La fiera de losmontes){RUB. Auto sacramental}
XMLXML<hd1>
JHS Maria Joseph
</hd1>
<hd2>
El diuino Caçador
</hd2>
<hd2>
Auto sacramental
</hd2>
National Art Library LondonNational Art Library London
Text Encoding Initiative Text Encoding Initiative
(TEI)(TEI)<head>
JHS <name type="person">
<corr sic = “María”>Maria
</corr></name>
</head><l>
El <corr sic = “divino”>diuino
</corr> <corr sic = “cazador”> caçador</corr>
</l><l>
<del type=“overstrike”>la fiera de los montes></del>
</l><hd2>auto sacramental</hd2>
Library of the Royal Palace in SpainLibrary of the Royal Palace in Spain
Types of TranscriptionTypes of Transcription
Recent Digital Transcription Recent Digital Transcription ProjectsProjects
Improving on Digital Improving on Digital Transcriptions Transcriptions
Receives
Requests
Scholars
Research Paper
Publishes
Creates
Uses
TranscriptionsArchives Paid Transcribers
Hires Creates
TEI Expert
TEI€€Transparent
Digital
Digital
• Is not cost-efficient• Only projects of special importance done
Marie
Manuscript Markup Language Manuscript Markup Language (MML)(MML)TEI CompatibleTEI Compatible
Intuitive InterfaceIntuitive Interface
WYSIWYG OutputWYSIWYG Output
<box type="Image" corner1x="331" corner1y="132" corner2x="379" corner2y="173"> <del type=“overstrike”> la fiera de los montes </del> Fri-Dec-06-14_25_59-EST-2002_9.jpg</box>
<box type="Text" corner1x="134" corner1y="33" corner2x="196" corner2y="66" fontStyle="BOLD“ fontSize="24“ fontType="Monotype Corsiva">
JHS </box>
<del type=“overstrike”> la fiera de los montes </del>
Manuscript AccessibilityManuscript Accessibility
Metadata:Metadata:– Information used to catalogue and identify Information used to catalogue and identify
manuscripts manuscripts – Used as a means of sharing and searchingUsed as a means of sharing and searching
Currently supported manuscript metadata Currently supported manuscript metadata standards: standards: – ISAD(G) General International Standard Archival ISAD(G) General International Standard Archival
DescriptionDescription– Dublin Core Metadata InitiativeDublin Core Metadata Initiative
Goal: Eliminate barriers to resource sharingGoal: Eliminate barriers to resource sharing
Length
Physical Characteristics
DescriptionScope & Content
IdentifierReference Code
CreatorName of Creator
ISADG Standard Metadata
Transcription Assistant Standard Transcription Assistant Standard MetadataMetadata
Author
Catalogue Number
Content
Physical Characteristics
Transcription Assistant
Standard Metadata Dublin Core Standard Metadata
Width
Media Type Media Type
Width
Length
Format
Manuscript Image Format: XPGManuscript Image Format: XPG
Author
Title
TypeArchivistArchives
Electronic Image
XPG
Manuscript Metadata
Transcription Assistant
Server
XPG ContentsXPG Contents
… (Total of 65 elements)
Transcription Assistant Standard Metadata
Manuscript Image Format: XPGManuscript Image Format: XPG
Scholar
RequestsArchivistArchives
Image
Transcription Assistant
XPG
Manuscript Metadata
XPG
Manuscript Metadata
Archive Server
Creates Transcription
Transcription MetadataTranscription Metadata
Sends Appropriate XPG Image
Archive
Transcription Metadata
Manuscript Metadata
Scholar
Loads XPG ImageEnter Transcription MetadataResulting Output
MML File ContentsMML File Contents
Manuscript (i.e. image) Manuscript (i.e. image) metadata (inherited from metadata (inherited from XPG)XPG)
Manuscript image Manuscript image (inherited from XPG)(inherited from XPG)
Transcription metadata Transcription metadata (user defined)(user defined)
Transcription text Transcription text encodingencoding
MML File
<box type="Image" corner1x="331" corner1y="132" corner2x="379" corner2y="173"> <del type=“overstrike”> la fiera de los montes </del> Fri-Dec-06-14_25_59-EST-2002_9.jpg</box>
Author: John Doe
Organization: WPI
Transcription Name: 08121847TaxRecords
Transcription Description: Tax records
Author: Jane Clerk
Scope & Content: Tax records up to Aug 12, 1847
Physical Characteristics: Rip on bottom left portion.
Format: JPEG
Reference Code: WPIVen081214C36
MML
MML
MML
MML
MML
MML
Emergent SystemEmergent SystemArchive
Archive
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
MML
Future GoalsFuture Goals
Implement automatic box drawing.Implement automatic box drawing.Implement handwriting recognition for word Implement handwriting recognition for word suggestion.suggestion.Client-Server system.Client-Server system.Create rating system for submitted Create rating system for submitted transcriptions.transcriptions.
ConclusionConclusion
The system should: The system should:
Accelerate the availability of transcriptionsAccelerate the availability of transcriptions
Improve the ability of historians to produce Improve the ability of historians to produce research materialresearch material
Provide global non-destructive access to Provide global non-destructive access to manuscripts otherwise available only in manuscripts otherwise available only in archives open to select scholarsarchives open to select scholars