Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

CANELC – constructing an e-language corpus

Knight, Dawn, Adolphs, Svenja and Ronald, Carter 2014. CANELC – constructing an e-language corpus. Corpora 9 (1) , pp. 29-56. 10.3366/cor.2014.0050

[img]
Preview
PDF - Accepted Post-Print Version
Download (1MB) | Preview

Abstract

This paper reports on the construction of the Cambridge and Nottingham e-language Corpus (CANELC). CANELC is a one-million word corpus of digital communication in English, taken from online discussion boards, blogs, tweets, e-mails and Short Message Services (SMS). The paper outlines the approaches used when planning the corpus: obtaining consent, collecting the data and compiling the corpus database. This is followed by a detailed analysis of some of the patterns of language used in the corpus. The analysis includes a discussion of the key words and phrases used, as well as the common themes and semantic associations connected with the data. These discussions form the basis of an investigation into how e-language operates in ways that are both similar to and different from spoken and written records of communication (as evidenced by the British National Corpus, BNC).

Item Type: Article
Date Type: Publication
Status: Published
Schools: English, Communication and Philosophy
Subjects: P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA76 Computer software
Uncontrolled Keywords: blogs, tweets, SMS, discussion boards, e-language, corpus linguistics
Publisher: Edinburgh University Press
ISSN: 1749-5032
Date of First Compliant Deposit: 30 March 2016
Last Modified: 26 Feb 2019 16:04
URI: http://orca.cf.ac.uk/id/eprint/72349

Citation Data

Cited 9 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics