How random is a corpus? The library metaphor

Autor(en): Evert, Stefan
Stichwörter: Literature
Erscheinungsdatum: 2006
Herausgeber: WALTER DE GRUYTER GMBH
Journal: ZEITSCHRIFT FUR ANGLISTIK UND AMERIKANISTIK
Volumen: 54
Ausgabe: 2
Startseite: 177
Seitenende: 190
Zusammenfassung: 
There is a stark contrast between the random sample model underlying the statistical analysis of corpus frequency data and our intuitive knowledge that sentences are more than random bags of words. The `library metaphor' illustrates how randomness results from the selection of a corpus as the basis for a linguistic study. At the same time it reveals two reasons why corpus data do not fully meet the assumptions of the random sample model. Finally, practicable methods for identifying and quantifying non-randomness are introduced and demonstrated on the example of passive verb forms.
Beschreibung: 
Workshop on Corpus Linguistics, Free Univ Berlin, Berlin, GERMANY, JUL 11, 2005
ISSN: 00442305

Show full item record

Page view(s)

1
Last Week
0
Last month
0
checked on Mar 5, 2024

Google ScholarTM

Check