Encoding the Document,Encoding the Process
Elena Pierazzo
Centre for Computing in the Humanities, King’s College London
Are Modern Manuscripts the same of Medieval Manuscripts?
They have different functions: private/note taking vs. “publication”, public circulation (most of the time)
They look different: unplanned vs. planned writing
Modern Ms. contain a mixture of text and other stuff which phenomenology is not always easy to re-conduct to medieval manuscripts (or so we thought)
They can be organised in sequences of drafts Does TEI provide enough support for
Modern manuscripts?
The beginning The idea was firstly raised on the TEI MS
SIG first meeting: Nancy 2003 The issue was arisen again with high
priority in the MS SIG Agenda in 2007P5 official release
Working group within MS SIG in November 2008 (London, TEI MM)
The Working Group
Fotis Jannidis (chair) Elena Pierazzo Malte Rehbein + Lou Burnard (Paolo D’Iorio)
& Gregor Middell Moritz Wissenbach
The Workshop
Sponsored by ALLC, ACH, TEI, Moore Institute
Paris, ENS – ITEM, 14-15 May 2009 Experts on Genetic Editions, not necessarily
on Digital Editions Invited experts: Anne Bohnenkamp, John
Bryant, Aurèle Crasson, Jean-Daniel Fekete, Daniel Ferrer, Hans Walter Gabler, Axel Gellhaus, Almuth Grésillon, Claus Huitfeldt, Dirk van Hulle, Jean-Louis Lebrave, Wolfgang Lukas, Kenneth M. Price, Kathryn Sutherland
Road map Officially presented at the TEI Members’
Meeting in Ann Arbor (Michigan) 2009 Beta test open, up to end of January 2010 February/March 2010: revision April 2010: submission to the TEI council as a
formal proposal Accepted in principle November 2010: Workshop and testing at TEI
Members’ Meeting in Zadar
Documents or Texts?
Documents and TextsYing and Yang, Body and Soul
In the editorial process it is impossible not to consider them both…
Genesis of what? Documents? (tears, cuts, patches, stitches…) Texts? (corrections, revisions, additions,
deletions, transpositions?)
Both, of course!
Three parts Documents Encoding Transcription Enhancement Genetic encoding
TEI provides a way to encode TEXTS, but sometimes
The process to make the document is at least as important as the text that it contains
The text can be determined by the document The text is graphically presented There is no text: non-linear texts are non-texts
(Ferrer)
Where is the text?
Are Modern Manuscript the same of Medieval Manuscript?
Not really…
<ge:document> a sibling of <text> and <facsimile> represents the physical structure of a
document, in terms of written <surface>s, and <zone>s of writing.
a new <ge:line> element is needed to represent topographic lines
a new <ge:patch> element is needed to represent fragmentary surfaces
<text> <body> <div type="diary-entry"> <dateline> <date value="2009-04-01"> 1 April 2009 </date> </dateline> <p><lb/>Fed Birds in the park today.<lb/> Might write an article about <lb/> the Thick-billed Warbler. </p> </div> <div type="note" rend="rotated"> <p><lb/>Samaria is a Greek <lb/> brand of water that <lb/> comes from thenatural <lb/> springs of Stilos, in <lb/> Crete</p> </div> </body></text>
<ge:document> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone ulx="10" uly="43" lrx="185" lry="84" rotate="0"> <zone> <ge:line rend="right"> 1 April 2009 </ge:line> </zone> <ge:line>Fed Birds in the park today.</ge:line> <ge:line>Might write an article about </ge:line> <ge:line>the Thick-billed Warbler. </ge:line> </zone> <zone ulx="9" uly="20" lrx="70" lry="60" rotate="90"> <ge:line>Samaria is a Greek </ge:line> <ge:line>brand of water that</ge:line> <ge:line>comes from the natural</ge:line> <ge:line>springs of Stilos, in </ge:line> <ge:line>Crete </ge:line> </zone> </surface></ge:document>
Can we combine the two views? Sort of…
<ge:document> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone stage="#stage1" seq="0" ulx="10" uly="43" lrx="185" lry="84"> <zone> <milestone unit="date" spanTo="#endDate"/>1 April 2009 <anchor xml:id="endDate"/> </zone> <milestone unit="p" spanTo="#p2"/> <ge:line>Fed Birds in the park today.</ge:line> <ge:line> Might write an article about </ge:line> <ge:line>the Thick-billed Warbler.</ge:line> </zone> <zone stage="#stage2" ulx="9" uly="20" lrx="70" lry="60" rotate="90"> <milestone unit="p" xml:id="p2" spanTo="#end"/> <ge:line>Samaria is a Greek</ge:line> <ge:line>brand of water that</ge:line> <ge:line>comes from the natural</ge:line> <ge:line>springs of Stilos, in</ge:line> <ge:line>Crete</ge:line> <anchor xml:id="end"/> </zone> </surface></ge:document>
The content of <zone> <ge:line> <table> Text!
Patches
<surface> <zone> <ge:line>Poem</ge:line> <ge:line>As in Visions of — at</ge:line> <ge:line>night —</ge:line> <ge:line>All sorts of fancies running through</ge:line> <ge:line>the head</ge:line> </zone> <ge:patch type="newsprint" binder="glue" height="40" width="90"> Spring has just set in here, and the weather.... a steamer <zone> <ge:metaMark function="sequence">2</ge:metaMark> </zone></ge:patch> <ge:patch type="newsprint" binder="glue" height="35" width="90"> "The shores on either side of the Sound are... The In- <zone> <ge:metaMark function="sequence">3</ge:metaMark> </zone></ge:patch></surface>
Exercise 1: Jane Austen Go to www.janeausten.ac.uk Select Manuscripts Select Volume the First – Text From the dropdown menu select page 1
New Transcriptional Features <ge:metamark> : markup already present in
the document <ge:rewrite> : clarification or fixation <ge:used>: marked as used elsewhere <ge:undo>: a cancelled revision <ge:redo> <ge:transpose> & <ge:transposeGrp> <ge:mod>: generic element for modifications
Metamark
<del><ge:metaMark function="flag" targets="#s1">lege</ge:metaMark><s xml:id="s1" >Ock en schullen de bruwere des hilgen dages nicht over <lb/>setten noch uppe den stillen fridach bruwen.</s> <add><s>Noch nymande<lb/>over setten, se en sehin denne erst, dat uppe den bonen<lb/>neyn stro noch, huw noch flaß ligghe, by pine eyner halven<lb/>roden, deme bruwere so wol alse dem bruwheren to murende.</s></add></del>
Clarification
<ge:line>...Sku<ge:rewrite cause="unclear">l<ge:rewrite>dren</ge:line>
Fixation
<ge:rewrite cause="fix" hand=”#ja2" stage="#s1">Now, if we could get <del stage=“1">but</del> a young Heiress</ge:rewrite>
Used<surface>
<ge:used rend="cross" spanTo="#X2"/> <zone> <ge:line rend="underline">The Poet</ge:line> <ge:line><del rend="strikethrough">I think</del> His sight is the</ge:line> <ge:line> sight of the ? and</ge:line> <ge:line>has sent the instinct of the</ge:line> <ge:line>? dog</ge:line> </zone>
<zone> <ge:line>I think <ge:rewrite>ten</ge:rewrite> million</ge:line> <!-- ... --> <ge:line>well; those <subst> <del rend="strikethrough">supple-fingered gods</del> <add>journeymen divine.</add> </subst></ge:line> <anchor xml:id="X2"/> </zone></surface>
Undo (and instant correction)
<ge:line> che e’ <hi rend="underline"><ge:undo spanTo="#x2"/>si <anchor xml:id="x2"/> rechi a’</hi> <del rend="overstrike” instant="true">dotti</del> <hi rend="underline">denti</hi> l’un d’essi cibi</ge:line>
Transposition
<ge:line><seg xml:id="ib01">bör</seg><ge:metaMark rend="underline" function="transposition” targets="#ib1" place="above">2.</ge:metaMark> og <seg xml:id="ib02" >hör</seg><ge:metaMark rend="underline" function="transposition" targets="#ib02" place="above">1.</ge:metaMark></ge:line> <ge:transposeGrp> <ge:transpose> <ptr target="#ib02"/> <ptr target="#ib01"/> </ge:transpose> </ge:transposeGrp>
A generic modification
Semantic Documentary
<ge:line>I should <subst> <del>soon</del> <add>shortly</add></subst>do</ge:line>
<ge:line>I should <ge:mod rend="strikethrough">do</ge:mod><ge:mod rend="interlinear” >soon</ge:mod>shortly do</ge:line>
Support on existing markup
Alternative readings
<zone> <ge:line>Alone <seg type="alternative" xml:id="alt1">before</seg> <add place="above" type="alternative" xml:id="alt2">beside</add> his native river —</ge:line> <alt targets="#alt1 #alt2" mode="excl" weights="0 1"/></zone>
Exercise 2: The Blake Archive Go to www.blakearchive.org Enter the archive Go to Text Search and search for Devil Choose first item Songs of Innocence and
of Experience Then first item again Then object 39 Open the image as well
Genesis Genesis within the same document Genesis across documents The passage from correction within a
document to another document can happen at anytime and it is not a one way process
Within the same Doc: Stages and Revision Campaigns
<profileDesc> <creation> <ge:stageNotes ordered="true"> <ge:stageNote xml:id="ST-1">First stage, written in ink by a scribe</ge:stageNote> <ge:stageNote xml:id="ST-2">Second stage, written in Goethe's hand using pencil</ge:stageNote> <ge:stageNote xml:id="ST-3">Fixation of the revised passages and further revisions by Goethe using ink</ge:stageNote> <ge:stageNote xml:id="ST-4">Addition of another stanza in a different hand, probably at a later stage</ge:stageNote> </ge:stageNotes> </creation></profileDesc>
Transposition (with stages)
<ge:line><seg xml:id="ib01">bör</seg><ge:metaMark rend="underline" function="transposition” targets="#ib1" place="above” stage=“#ST2”>2.</ge:metaMark> og <seg xml:id="ib02" >hör</seg><ge:metaMark rend="underline" function="transposition" targets="#ib02" place="above” stage=“#ST2”>1.</ge:metaMark></ge:line> <ge:transposeGrp> <ge:transpose> <ptr target="#ib02"/> <ptr target="#ib01"/> </ge:transpose> </ge:transposeGrp>
Genesis across docs<profileDesc> <creation> <date notAfter="1816-07-18"/> <ge:stageNotes ordered="true"> <ge:stageNote xml:id="mod1" when="1816-07-16">The first draft of <title>Persuasion</title> is completed by the <date>July 16 1816</date> written after the word <q>Finis</q> at <ref target="#pers-30">page 30</ref>.</ge:stageNote> <ge:stageNote xml:id="mod2" notBefore="1816-07-16">After the <date>16th of July</date> Austen starts revision of the two final chapters, by rewriting the end and adding a new block (<ref target="#transp-1">pages 32-35</ref>) to be inserted at <ref target="#insertion-p1">page 19</ref>. This stage is documented by the deletion of the date (<date>July 16 1816</date>) at <ref target="#pers-30">page 30</ref>, and the addition of more text and of a new date (<date>July 18. 1816</date>) at <ref target="#pers-31">page 31</ref></ge:stageNote> <ge:stageNote notBefore="1816-07-18” xml:id=“mod3”>Before publication, after <date>July 18th, 1816</date> chapters 10-11 were broken into three chapters, 10, 11, 12, as witnessed by the print.</ge:stageNote> </ge:stageNotes> </creation></profileDesc>
Let’s start by grouping!
<teiCorpus> <teiHeader> <!-- information about the dossier --> </teiHeader> <xi:include href="document1.xml"/> <xi:include href="document2.xml"/> <xi:include href="document3.xml"/></teiCorpus>
Then let’s organise our group
<graph type="directed"> <node xml:id="A" value="http://edition.net/witness/A"> <label>A</label> </node> <node xml:id="B" value="http://edition.net/witness/B"> <label>B</label> </node> <!-- more nodes --> <arc xml:id="AC" from="#A" to="#C" value="http://edition.net/genetic/analysis#ac"/> <arc xml:id="BC" from="#B" to="#C" value="http://edition.net/genetic/analysis#bc"/> <!-- more arcs --></graph>
Genesis of documents
<ge:document> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone>...</zone> </surface> <damageSpan spanTo="#p3"/> <gap extent="1" unit="folio"> <desc>Stub of a missing folio</desc> </gap> <surface ulx="0" uly="0" lrx="200" lry="300" xml:id="p3"> <zone>...</zone> </surface></ge:document>
http://www.janeausten.ac.uk/facsimile/blvolsecond/65-66.html
Exercise 3: Whitman archive Go to www.whitmanarchive.org Select Manuscripts > Poetry Manuscripts Select ninth item: America Old-World Bards
For more Info
http://www.tei-c.org/SIG/Manuscripts/genetic.html