26
MLW Pisa 2011 G.Brelstaff & F.Chessa 1 Interactive alignment of Parallel Texts – a cross browser experience Gavin Brelstaff (gjb@ crs4.it) CRS4 09010 Pula (CA) – Sardinia, Italy Francesca Chessa University of Sassari, Italy Multilingual Web Workshop Pisa April 2011 (standards in practice)

Interactive alignment of Parallel Texts – a cross browser

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 1

Interactive alignment of Parallel Texts – a cross browser experience

Gavin Brelstaff (gjb@ crs4.it)CRS4 09010 Pula (CA) – Sardinia, Italy

Francesca ChessaUniversity of Sassari, Italy

Multilingual Web Workshop PisaApril 2011

(standards in practice)

Page 2: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 2

Introduction

Alignment of

parallel texts;

multi-lingual;

minority languages;

poetry

But why?

Dante’s w

as a minority language.

Page 3: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 3

Genius locithecreative spiritsof place – geolocated.

Minority languageaseed-bedfor poetic expression,beyond mere communication.

He was the cat that walked by himselfand all places were alike to him.

Kipling

“Think global, act local ” “Think local, act global ”

Whenever we lose a language the“genetic basis” for such expression diminishes, globally

Page 4: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 4

Echo Chamber

Minority language

Island language

(song,verse,prose)

Page 5: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 5

Echo Chamber

in poet’s head

Page 6: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 6

Echo Chamber

inside the head

(ear,tongue,thought)

Page 7: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 7

Nel

mez

zo del

cammin di n

ostra

vita m

i ritr

ovai

per

una s

elva o

scur

a

ché la

dirit

ta via

era s

marrit

a

Echo Chamber

inside the head

(ear,tongue,thought,eye)

Page 8: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 8

Nel mezzo del

cammin di nostra

vita mi ritrovai per

una selva oscura

ché la diritta via

era smarrita

Language Barrier

Page 9: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 9

Language Barrier

Cultural context BCultural context A

cf R.Jakobson

diffusion

diffusion

diffusion

diffusion

Page 10: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 10

Minority language Global language

osmosis

diffusion

diffusion

diffu

sion

“cellular membrane”

diffusion

Language Barrier

Assist avoiding dilution,

shrivelling, bursting.

diffusion

Page 11: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 11

Nel mezzo del

cammin di nostra

vita mi ritrovai per

una selva oscura

ché la diritta via

era smarrita

Language Barrier

Page 12: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 12

Nel mezzo del

cammin di nostra

vita mi ritrovai per

una selva oscura

ché la diritta via

era smarrita

Parallel text alignment ↔ to communicate semantics• standards-based markup• web delivery, cross-browser• non-verbal interactvity• beyond GoogleTranslate →

Midway this way

of life, we’re

bound upon I

woke up and

found myself in a

dark wood.

Midway this way

of life, we’re

bound upon I

woke up and

found myself in a

dark wood.

Nel mezzo del

cammin di nostra

vita mi ritrovai per

una selva oscura

ché la diritta via

era smarrita

Midway Nel mezzo

Translator

Page 13: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 13

Beyond GoogleTranslate:

• SMT not going to translate poetry well any time soon.• We allow the translator to clarify by alignment• Point-&-click interface to modify standard markup• Colour-code: formal & dynamic equivalence [Nida-Taber]

• Demo

Parallel text alignment web interface

Page 14: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 14

Demo (a desktop browser: IE8-9,FF3-4,Opera11,Chrome,Safari)

Page 15: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 15

Demo:selection by click

Page 16: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 16

Demo:selection & alignment

Page 17: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 17

Presentation Content Structure Semantics

eXist XMLdb

XMLCSSXHTML not RDF

Unicode

http put

REST/ajax

DOMJavascript

jQuery

I n t e r a c t

TEI-p5

XMLSchema

XSL

XQL

notw3cRange

Standards in practice

Pros & Cons

Page 18: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 18

Presentation Content Structure Semantics

eXist XMLdb

XMLCSSXHTML not RDF

Unicode

http put

REST/ajax

DOMJavascript

jQuery

I n t e r a c t

TEI-p5

XMLSchema

XSL

XQL

Cons: #1

We can’t interact directly with Semantics

Browsers only bind events to XHTML (why not XML?) elements

Incurs two degrees of messy indirection.

notw3cRange

Page 19: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 19

Presentation Content Structure Semantics

eXist XMLdb

XMLCSSXHTML not RDF

Unicode

http put

REST/ajax

DOMJavascript

jQuery

I n t e r a c t

TEI-p5

XMLSchema

XSL

XQL

Cons: #2

w3cRange is not “road worthy”.

We resort to Click to Select

Selection within words still lacking.

notw3cRange

Page 20: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 20

Presentation Content Structure Semantics

eXist XMLdb

XMLCSSXHTML not RDF

Unicode

http put

REST/ajax

DOMJavascript

jQuery

I n t e r a c t

TEI-p5

XMLSchema

XSL

XQL

Cons: #3

TEI-p5 must be subsetted to avoid overlapping markup

We prioritise alignment tags over{verse-line,paragraph} hierarchy.

notw3cRange

Page 21: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 21

Presentation Content Structure Semantics

eXist XMLdb

XMLCSSXHTML not RDF

Unicode

http put

REST/ajax

DOMJavascript

jQuery

I n t e r a c t

TEI-p5

XMLSchema

XSL

XQL

Pros: #1

Unicode in XML attributes permits our novel alignment scheme:

The verbatim source text is simply assigned as an attributed of an enclosing tag in the translated text

notw3cRange

Page 22: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 22

Presentation Content Structure Semantics

eXist XMLdb

XMLCSSXHTML not RDF

Unicode

http put

REST/ajax

DOMJavascript

jQuery

I n t e r a c t

TEI-p5

XMLSchema

XSL

XQL

Pros: #2

CSS selection mechanism as embraced in jQuery helps tame the complexity of cross-browser DOM programming.

notw3cRange

Page 23: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 23

Presentation Content Structure Semantics

eXist XMLdb

XMLCSSXHTML not RDF

Unicode

http put

REST/ajax

DOMJavascript

jQuery

I n t e r a c t

TEI-p5

XMLSchema

XSL

XQL

Pros: #3

RESTful archiving is a realitydue to:

• Ajax in the browser• Http PUT on the wire, &• eXist XML db on the server

notw3cRange

Page 24: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 24

Conclusion

Cons

• Can’t bind to XML

• W3cRange not ready

• Must subset TEI-p5

Pros

• Unicode in attributes

• CSS&jQuery v. DOM

• RESTful reality

Standards in practice

Page 25: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 25

Browser issues

• Opera: no transparent cursor in text

• Firefox: synchronous scoll down bug

• IE: onselectstart issue

• Google Chrome: run from disk fix

• Safari/Chrome/IE Form Enctype: validation

Page 26: Interactive alignment of Parallel Texts – a cross browser

MLW Pisa 2011 G.Brelstaff & F.Chessa 26

That’s all folks:

Gavin Brelstaff (gjb@ crs4.it )CRS4 09010 Pula (CA) – Sardinia, Italy

Francesca ChessaUniversity of Sassari, Italy

L'A

mor

che

mov

e il

sole

e l'

altr

e st

elle