Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Corpus linguistics: One size fits all? Exploring (and exploiting) methods to analyse small and large corpora of public and professional discourse

Potts, Amanda 2017. Corpus linguistics: One size fits all? Exploring (and exploiting) methods to analyse small and large corpora of public and professional discourse. Presented at: Quantitative Lexicology and Variational Linguistics (QLVL) Meeting, KU Leuven, Belgium, 30 August 2017.

Full text not available from this repository.

Abstract

Over the past decade, rapid technological evolution has revolutionised the study of language; we have access to more powerful tools and more data than ever before. But what is happening at the extremes of the size spectrum? In using corpora in the tens of millions (or even billions) of words, how can we be sure that we are placing enough stress on 'linguistic' analysis? In analysing small datasets in the tens of thousands of words, how can we be sure that this is 'corpus' linguistics? In my work at Lancaster University and Cardiff University, I have been experimenting with methods of 'downscaling' and 'upscaling' results from very large representative corpora and very small opportunistic corpora. These methods include triangulation, semantic collocation, and XML markup on corpora of print journalism, legal sentencing remarks, YouTube comments, activist tweets, doctors' blogs, and Google autocomplete results. In this talk, I will introduce my work, discuss the highs and lows, and talk about the research that I am currently undertaking.

Item Type: Conference or Workshop Item (Lecture)
Date Type: Completion
Status: Unpublished
Schools: English, Communication and Philosophy
Subjects: P Language and Literature > P Philology. Linguistics
Last Modified: 13 Sep 2017 12:39
URI: http://orca.cf.ac.uk/id/eprint/104576

Actions (repository staff only)

Edit Item Edit Item