Wellington Corpus of Spoken New Zealand English
The Wellington Corpus of Spoken New Zealand English is a one-million-word corpus of transcribed English compiled from materials collected between 1988 and 1994, which is made up of excerpts from a range of speakers who have lived in New Zealand since before the age of 10. The corpus was collected under the direction of linguist Janet Holmes and includes broadcast transcripts as well as informal conversations, telephone conversations, lectures, and oral history interviews.[1]
The corpus, which was distributed as part of the 1999 ICAME CD-ROM, has been used for a number of academic studies including those looking at morphology,[2] pronoun use[3] and language contact studies, as of the influence of Māori on NZ English.[4][5]
References
- ^ Janet Holmes, Bernadette Vine and Gary Johnson, and Bernadette Vine (1998). "Wellington Corpus". Retrieved May 28, 2015.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Hundt, Marianne (1998). New Zealand English Grammar: Fact or Fiction. John Bengjamins.
- ^ Holmes, Janet (1998). "Generic pronouns in the Wellington Corpus of Spoken New Zealand English". Kōtare: New Zealand Notes & Queries.
- ^ Macalister, John (2006). "The Maori presence in the New Zealand English lexicon, 1850–2000: Evidence from a corpus-based study". English World-Wide.
- ^ Macalister, John (1999). "Trends in New Zealand English: Some Observations on the Presence of Maori Words in the Lexicon". New Zealand English Journal.
External links
Corpus main website: http://www.victoria.ac.nz/lals/resources/corpora-default/corpora-wsc
- v
- t
- e
English
- American National Corpus
- Bank of English
- Bergen Corpus of London Teenage Language
- British National Corpus
- Brown Corpus
- Buckeye Corpus
- Cambridge English Corpus
- Corpus of Contemporary American English
- Enron Corpus
- EnTenTen
- International Corpus of English
- Lancaster-Oslo-Bergen Corpus
- Oxford English Corpus
- PropBank
- Spoken English Corpus
- Switchboard Telephone Speech Corpus
- TIMIT
- VerbNet
- Wellington Corpus of Spoken New Zealand English
non-English
- Bijankhan Corpus
- CHILDES
- CorCenCC National Corpus of Contemporary Welsh
- Croatian Language Corpus
- Croatian National Corpus
- Czech National Corpus
- Europarl Corpus
- German Reference Corpus
- Hamshahri Corpus
- National Corpus of Polish
- Neo-Assyrian Text Corpus Project
- Persian Speech Corpus
- Quranic Arabic Corpus
- Russian National Corpus
- Scottish Corpus of Texts and Speech
- Slovenian National Corpus
- TalkBank
- Tatoeba
- Tehran Monolingual Corpus
- Tekstaro de Esperanto
- TenTen Corpus Family
- Thesaurus Linguae Graecae