51
Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg (UC Davis)

Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Embed Size (px)

Citation preview

Page 1: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Orthographic regularization of morphology in English, and the advantages of N-gram research

Maxwell J. SowellMarissa C. Huston-Carico

Eric D. Warburg(UC Davis)

Page 2: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

Page 3: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• N-grams

Page 4: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• What is an N-gram?

Page 5: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• What is an N-gram?– 1-gram: “morphology”– 2-gram: “morphological process”– 3-gram: “morphological process research”– etc.

Page 6: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?

Page 7: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?

Page 8: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?• When did “yuppie” come into use?

Page 9: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?• What about “yuppiedom”?

Page 10: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?• “yuppie” “yuppiedom”

Page 11: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?• Advantages over theoretical process research:

Page 12: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?• Advantages over theoretical process research:– “Theoretical”

Page 13: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?• Advantages over theoretical process research:– “Theoretical”– Hard evidence that one word was used first

Page 14: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Researching morphological change

• How does this help with research?• Advantages over theoretical process research:– “Theoretical”– Hard evidence that one word was used first– Specific dates can be implemented in

sociolinguistic research

Page 15: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Morphological regularization

Page 16: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Morphological regularization

• General morphological rules and processes make parsing more efficient

Page 17: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Morphological regularization

• General morphological rules and processes make parsing more efficient– /write + -groblaxt/ - past tense– /read + -pfeffets/ - past tense

Page 18: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Morphological regularization

• General morphological rules and processes make parsing more efficient

• Still some irregular morphology:– ox oxen– eat ate

Page 19: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Morphological regularization

• General morphological rules and processes make parsing more efficient– What would the plural of /boug/ be?

Page 20: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Morphological regularization

• General morphological rules and processes make parsing more efficient– What would the plural of /boug/ be?– The past tense of /teev/?

Page 21: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Morphological regularization

• General morphological rules and processes make parsing more efficient– What would the plural of /boug/ be?– The past tense of /teev/?

• Speakers tend to use regularized forms when irregular forms are unknown or not drawn from the lexicon quickly enough

Page 22: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Morphological regularization

• General morphological rules and processes make parsing more efficient

• Speakers tend to use regularized forms when irregular forms are unknown or not drawn from the lexicon quickly enough

• If morphology does not reflect close relations between words, regularization is a risk

Page 23: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

Page 24: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• Why is the change significant?

Page 25: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• Why is the change significant?• More generally, why is spelling significant?

Page 26: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• Why is the change significant?• More generally, why is spelling significant?– speakers retain visual spellings by symbolizing

sounds (c.f. Ehri & Wilce 1980)

Page 27: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• Why is the change significant?• More generally, why is spelling significant?– speakers retain visual spellings by symbolizing

sounds (c.f. Ehri & Wilce 1980)– visualized representations of words, rather than

their sequences of sounds, are used to cognitively parse them into meaningful parts (c.f. Olson 1996)

Page 28: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• So what happened?

Page 29: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• So what happened?• No new semantic niche for “peddler” to fill

“seller” not shown – also in relatively constant use

Page 30: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• “peddle” comes in near when “peddler” did

Page 31: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• closely relate the two lexemes’ roots with morphological reanalysis after all in use

Page 32: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• closely relate the two lexemes’ roots with morphological reanalysis after all in use

• /peddle + -er/ cognitively closer to /peddle/

Page 33: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• What about “peddler” being reanalyzed in order to create “peddle” via back-formation?

Page 34: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• What about “peddler” being reanalyzed in order to create “peddle” via back-formation?– “peddling” already existed; more feasible to

remove an inflectional affix than a derivational one to yield “peddle”

Page 35: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” “peddler”

• What about “peddler” being reanalyzed in order to create “peddle” via back-formation?– “peddling” already existed; more feasible to

remove an inflectional affix than a derivational one to yield “peddle”

– seemingly no reason for a spelling change if ‘peddle’ is not causing the change

Page 36: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“burglar” and “*burgler”

• Why is “*burgler” not attested?

“*burgler” insignificant and so not shown

Page 37: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“burglar” and “*burgler”

• Why is “*burgler” not attested?– differences in relative usage of related lexemes

Page 38: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“burglar” and “*burgler”

• Why is “*burgler” not attested?– differences in relative usage of related lexemes– “burgle” is a humorous back-formation

Page 39: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“burglar” and “*burgler”

• Speakers are equally likely to use “burglarize”

Page 40: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” vs. “burglar”

• Spelling not changed in order to back-form

Page 41: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” vs. “burglar”

• Spelling not changed in order to back-form• “Peddle” (and “burgle”, too) semantically

unnecessary; in use for other [social] reasons

Page 42: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” vs. “burglar”

• Spelling not changed in order to back-form• “Peddle” (and “burgle”, too) semantically

unnecessary; in use for other [social] reasons• Rising usage caused speakers to relate

“pedlar” and “peddle”

Page 43: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” vs. “burglar”

• Spelling not changed in order to back-form• “Peddle” (and “burgle”, too) semantically

unnecessary; in use for other [social] reasons• Rising usage caused speakers to relate

“pedlar” and “peddle”• Regularized with /-er/ ending instead of

recalling separately memorized form “pedlar”

Page 44: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

“pedlar” vs. “burglar”

• Spelling not changed in order to back-form• “Peddle” (and “burgle”, too) semantically

unnecessary; in use for other [social] reasons• Rising usage caused speakers to relate

“pedlar” and “peddle”• Regularized with /-er/ ending instead of

recalling separately memorized form “pedlar”• “pedlar” fell out of use in American English;

efficiency taking priority is a noted trend

Page 45: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Further study

• Using raw chronology for derivational research

Page 46: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Further study

• Using raw chronology for derivational research• For English /-er/ regularization– Without many English words ending in [-ar] that

semantically correlate with the deverbal nominalizer /-er/, research is limited

Page 47: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Further study

• Using raw chronology for derivational research• For English /-er/ regularization– Without many English words ending in [-ar] that

semantically correlate with the deverbal nominalizer /-er/, research is limited

– Etymological research

Page 48: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Further study

• Using raw chronology for derivational research• For English /-er/ regularization– Without many English words ending in [-ar] that

semantically correlate with the deverbal nominalizer /-er/, research is limited

– Etymological research• e.g. “pedlar” is derived from Latin, while “burglar” is

derived from French

Page 49: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Further study

• Using raw chronology for derivational research• For English /-er/ regularization– Without many English words ending in [-ar] that

semantically correlate with the deverbal nominalizer /-er/, research is limited

– Etymological research• e.g. “pedlar” is derived from Latin, while “burglar” is

derived from French

– Other sociolinguistic applications

Page 50: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Further study

• Using raw chronology for derivational research• For English /-er/ regularization– Without many English words ending in [-ar] that

semantically correlate with the deverbal nominalizer /-er/, research is limited

– Etymological research• e.g. “pedlar” is derived from Latin, while “burglar” is

derived from French

– Other sociolinguistic applications• Why words like “peddle” come into use/peak in use

Page 51: Orthographic regularization of morphology in English, and the advantages of N-gram research Maxwell J. Sowell Marissa C. Huston-Carico Eric D. Warburg

Selected references• Bourassa, D. C., & Treiman, R. (2008). Morphological constancy in

spelling: A comparison of children with dyslexia and typically developing children. Dyslexia, 14(3), 155-169. doi: 10.1002/dys.368

• Ehri, L. C., & Wilce, L. S. (1980). The influence of orthography on readers' conceptualization of the phonemic structure of words. Applied Psycholinguistics, 1, 371-385. doi: 10.1017/S0142716400009802

• Hoad, T. F. (1993). The concise oxford dictionary of english etymology. New York, NY: Oxford University Press USA.

• Hudson, G. (2000). Essential introductory linguistics. Malden, MA: Blackwell.

• Olson, D. R. (1996). The world on paper: The conceptual and cognitive implications of writing and reading. Cambridge, England: Cambridge University Press.