Corpus linguistic methodology, Corpus linguistics, Corpus tools, Digital humanities, Early modern English, English, English grammar, English language, European languages, Grammar, Grammatical theory and description, Historical and diachronic corpora, Historical GIS, Humanities computing, India, Language, Linguistics, Metaphor, Multilingual corpora, Quantitative linguistics, Semantics, South Asia, Statistics, Syntax
English Language and Linguistics
Dr Andrew Hardie
UCREL - University Centre for Computer Corpus Research on Language
My major specialism is corpus linguistics - specifically, the methodology of corpus linguistics, and how it can be applied to different areas of study in linguistics and beyond. I am currently working on applications of corpus methods in the social sciences and humanities. I am also very interested in the use of corpus-based methods to study languages other than English, especially the languages of Asia, with an especial focus on issues in descriptive and theoretical grammar.
PhD Supervision Interests
I am willing to consider PhD applications in areas coherent with my research interests. I am especially eager to supervise students in the following two areas:
- The development of new corpus-based methods, or the extension of existing methodologies;
- The application of these methods in different areas of the humanities and social sciences.
I am also interested to supervise projects that extend established corpus methods to "new" languages - non-European languages and minority languages in particular - especially with regard to topics in descriptive or theoretical grammar.
See below for an indicative list of topics studied by my current and previous PhD supervisees.
AS well as holding the position of Lecturer in Linguistics, I am Deputy Director of the ESRC Centre for Corpus Approaches to Social Science, a major research project running for five years from April 2013.
I am also currently the Chair of UCREL, the corpus research centre which beings together researchers from the Linguistics and Computing departments. (From 2005 to 2012 I was Project Development Officer.)
My primary research specialism is the corpus-based methodology and its applications. In particular, I am interested in a range of areas relating to corpus design and construction and corpus analysis methods and software tools, and how these may be applied to my own subject area (broadly: the grammar of English and other languages), to other fields of linguistics such as discourse analysis or language teaching, and to other disciplines in the humanities and social sciences.
Most of my current research work is focused on a series of projects in which corpus methods are adapted to the needs of social scientists and humanists, in a range of subject areas including Psychology, Geography, History and English Literature.
Areas that I have worked on (and published in) earlier in my career include:
- quantitative and collocational approaches to grammar in English and other languages;
- historical text-mining, with particular regard to the journalism of the Early Modern English period;
- part-of-speech tagging and the theory of morphosyntactic categories;
- keyness and frequency phenomena in texts;
- the languages and writing systems of South Asia;
- text and corpus encoding and processing (with particular reference to Unicode).
Languages that I have worked on or am currently interested in include:
- Nepali (see my Nepali Grammar Project)
A major part of my work involves software development to support the corpus methodologies listed above. I am one of the lead developers of Corpus Workbench, a powerful, open-source system for corpus indexing and querying. Furthermore, I created (and continue to develop) the CQPweb system as a user-friendly front-end to the Corpus Workbench.
As part of my work on the EMILLE corpus of South Asian languages, I created the Unicodify software. While working on part-of-speech tagging for South Asian languages including Urdu and Nepali, I developed the Unitag framework.
A list of my research publications is available on this website.
I am currently attached full-time to a research project and therefore am not active in our general undergraduate and postgraduate teaching. I still supervise research students and teach on our postgraduate Summer School programmes.
I previously taught corpus linguistics, English grammar, grammatical theory, typology, language acquisition, psycholinguistics, and other topics at undergraduate and postgraduate level.
PhD Supervisions Completed
Here is a list of the topics that my current and former PhD students have worked on:
- The processing of learner speakers' collocational errors
- Using cluster analysis to study text typology in the British National Corpus
- Statistical analysis of closure in sublanguages
- Structural and ideological aspects of collocation in Modern Standard Arabic
- Valency-changing constructions in Javanese
- Sociolinguistics of swearing in Arabic
No publications found
A comprehensive analysis of the form, content and impact of communications between large, publicly traded corporations and their key stakeholder groups concerning the following three key aspects of co ... Read more»
... Read more»
CASS is a Centre designed to bring a new method in the study of language – the corpus approach – to a range of social sciences. In doing it provides an insight into the use and manipulation of lan ... Read more»
The primary aim of this project is to investigate the use of metaphor in the experience of end-of-life care in the UK. We will study the metaphors used by members of different stakeholder groups (pati ... Read more»
This five-year project runs from 2012-16, funded by the European Research Council under a Starting Researcher Grant. The project aims to create a step-change in the way that place, space and geography ... Read more»
Summary: We are an interdisciplinary research group which is combining established areas of research excellence at Lancaster University. The emergent synthesis is generating unique methods and approac ... Read more»
CQPweb is a web-based corpus analysis system which provides a user-friendly interface to the Corpus Workbench (CWB) system. This interface is compatible with any corpus, but is especially useful for l ... Read more»
Corpus based grammer in contrast : The cross-linguistic distributional analysis of Naepali grammatical categories01/10/2007 → 30/09/2009
... Read more»
A feasibility and pilot study on the exploitation of the Child Language Survey This project is a feasibility and pilot study on the exploitation of the Child Language Survey. It is led by a cross- ... Read more»
... Read more»
The project, which has gone through several stages since late 2005, represents an investigation into the computer-assisted analysis of metaphoric patterns across discourses and genres. Our overall aim ... Read more»
CASS is delighted to announce a successful ESRC application for funding on a project entitled "Twitter rape threats and the discourse of online misogyny" (ES/L008874/1). The award of £191,245.25 was ... Read more»
The seventh international Corpus Linguistics conference (CL2013) will be held at Lancaster University from Tuesday 23rd July 2013 to Friday 26th July 2013. The main conference will be preceded by a wo ... Read more»