25
A corpus-based study of PP placement in translated and non-translated journalistic Dutch. Annelore Willems, Gert De Sutter Faculty of Translation Studies University College Ghent – Ghent University {annelore.willems,gert.desutter}@hogent.be Grammar and Genre 2012

A corpus-based study of PP placement in translated and non-translated journalistic Dutch

  • Upload
    cybele

  • View
    31

  • Download
    0

Embed Size (px)

DESCRIPTION

A corpus-based study of PP placement in translated and non-translated journalistic Dutch . Annelore Willems, Gert De Sutter Faculty of Translation Studies University College Ghent – Ghent University { annelore.willems,gert.desutter }@hogent.be. Grammar and Genre 2012. - PowerPoint PPT Presentation

Citation preview

Page 1: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

A corpus-based study of PP placement

in translated and non-translated journalistic Dutch.

Annelore Willems, Gert De Sutter Faculty of Translation Studies

University College Ghent – Ghent University

{annelore.willems,gert.desutter}@hogent.be

Grammar and Genre 2012

Page 2: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Stylistic rule in Dutch: a prepositional phrase (PP) should be extraposed in subordinate clauses whenever the distance between the subject and V-final becomes too large.

dat ikSU contacten hebV-final binnen een vijftiental ondernemingen van de Bel20. that ISU contacts haveV-final within about five enterprises of the Bel20.

Post field position or extraposition

dat ikSU binnen een vijftiental ondernemingen van de Bel20 contacten hebV-final. that ISU within about five enterprises of the Bel20 contacts haveV-final.

Middle field position or non-extraposition

Background

Page 3: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

1. Try to investigate whether this stylistic rule is reflected in actual language use.

2. Try to investigate whether this rule has an identical effect in different genres of journalistic texts and in translated and non-translated Dutch.

Purpose

Page 4: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Dutch Parallel Corpus (DPC)• a 10-million-word, parallel corpus of Dutch, English and French• sentence-aligned with basic linguistic annotations• 5 different text genres but for this presentation only journalistic texts

Data selection:• dependent clauses starting with the grammatical conjunction dat (=

that)• PP phrases where variation between extraposition and non-

extraposition is possible

Method: Which data will be used?

Page 5: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Length between SU en V-final in words Length of the PP in words

dat ikSU contacten binnen een vijftiental ondernemingen van de Bel20 hebV-final.

that ISU contacts within about five enterprises of the Bel20 haveV-final.

Operationalisation

Page 6: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

The length of the PP will be positively correlated with extraposition. If the distance between SU and V-final becomes too large, the PP

will be extraposed. There will be a difference between news items and comment

articles. Biber & Conrad 2009

There will be a difference between translated and non-translated journalistic texts. Baker 1993, Puurtinen 1998

Hypotheses

Page 7: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Results: general trend

Page 8: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

The length of the PP will be positively correlated with extraposition.

1 = 2 or 3 words2 = 4 to 6 words3 = 7 to 11 words4 = 12 or more words

AV = extrapositionMV = middle field

Page 9: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Logistic regression analysis:

The length of the PP will be positively correlated with extraposition.

Factor O.R. p-value

Length PP 16.10 9.67e-16 ***

Page 10: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

If the distance between SU and V-final becomes too large, the PP will be extraposed.

1 = 0 words2 = 1 or 2 words3 = 3 or more words

AV = extraposition

Page 11: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Logistic regression analysis:

If the distance between SU and V-final becomes too large, the PP will be extraposed.

Factor O.R. p-value

Length between SU and V 1.47 8.27e-05 ***

Page 12: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Stylistic rule in Dutch…a PP should be extraposed in subordinate clauses whenever the

distance between the subject and V-final becomes too large.• length of the PP itself• length between SU and V

… can be confirmed.

Summary

Page 13: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Results: genre specific trends

Page 14: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Overall distribution extraposition/non-extraposition in news articles and comment articles:

There will be a difference between news items and comment articles.

Factor O.R. p-value

Subtype 0.81 0.003**

Page 15: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

The length of the PP: no difference between news articles and comment articles

There will be a difference between news items and comment articles.

45%

63%

88%

41%53%

78%

95%

100%

Page 16: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

The length between SU and V: no difference between news articles and comment articles

There will be a difference between news items and comment articles.

37%

62%52%

43%

68%57%

Page 17: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Overall distribution extraposition/non-extraposition in translated and non-translated texts:

There will be a difference between translated and non-translated journalistic texts.

Factor O.R. p-value

(non-)translated 0.45

Page 18: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

The length of the PP: no difference between translated and non-translated journalistic texts

There will be a difference between translated and non-translated journalistic texts.

44%57%

86%

43%60%

82%

99%93%

Page 19: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

The length between SU and V: no difference between translated and non-translated journalistic texts

There will be a difference between translated and non-translated journalistic texts.

41%53%

78%

95%

64%58%

40% 42%

66%

50%

Page 20: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Difference between news articles and comment articles for the overall distribution of PP’s.

Stylistic rule has no different effect in news articles and in comment articles.

Stylistic rule has no different effect in translated and in non-translated texts.

Summary

Page 21: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Thank you!

For further [email protected]

Page 22: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Interference?

Structural interference: the word order in the source text influences the word order in the target text.

One last thing…

Position source texts Position PP target text

Extraposition Non-extraposition P-value

EN After V-final 62% 38%8.71e-10

Before V-final 27% 73%FR After V-final 64% 36%

1.597e-08Before V-final 15% 85%

Page 23: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

Corpora: DPC

Text Type SRC → TGT DU EN FR Total

Journalistic texts

EN → DU 262 768 264 900 0 527 668

FR → DU 240 785 0 265 530 506 315

DU → EN 250 580 259 764 0 510 344

DU → FR 314 989 0 340 319 655 308

Total 1 069 122 524 664 605 849 2 199 635

Overview DPC

Page 24: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

57% 43%

translatednon-translated

Overview translated/non-translated

Page 25: A corpus-based study of PP placement  in  translated and non-translated journalistic Dutch

58% 42%

extrapositionnon-extraposition

Overview extraposition/non-extraposition