Contact Details
Tel: +44 (0)1524 593024
Room: Bowland, B85
Dr Andrew Hardie

Lecturer in Corpus Linguistics
Degree: BA (Lancaster), PhD (Lancaster)
Associated research centres and groups: University Centre for Computer Corpus Research on Language (UCREL)
Current Teaching
This year, my undergraduate teaching includes the following (NB links are accessible to current students only):
- Director of Studies for Linguistics and the joint English Language and Linguistics degree schemes
- Course convenor and lecturer on LING 101
- Course convenor and lecturer on LING 208
- Course convenor and lecturer on LING 206
- Lecturer on LING 130
- Lecturer on LING 151
- Seminar tutor and occasional lecturer on LING 202
In previous years I have also taught on LING 152 and LING 203.
My postgraduate teaching includes the MA module in Corpus Linguistics and the New Route PhD module in Child Language Acquisition. I also supervise a number of PhD students.
Research Interests
Since September 2005 I have held the post of Lecturer in Corpus Linguistics in the department. I am currently doing research in a range of areas relating to corpus annotation, multilingual corpus linguistics and corpus-driven grammar and textual studies.
My particular research interests currently include:
- quantitative approaches to grammar in English and beyond;
- historical text-mining, with particular regard to the journalism of the Early Modern English period;
- part-of-speech tagging and the theory of morphosyntactic categories;
- keyness and frequency phenomena in texts;
- the languages and writing systems of South Asia, in particular the grammar of Nepali;
- text and corpus encoding and processing (with particular reference to Unicode).
My research into South Asian languages is currently focussed on Nepali. In co-operation with scholars at a number of institutions including the Open University, the University of Gothenburg, and Tribhuvan University, Kathmandu, I am working on the EU-funded Nelralec project, contributing expertise on corpus construction, encoding and annotation to the creation of the Nepali National Corpus. My assorted other work on Nepali, including a project to build a corpus of spoken Nepali funded by the British Academy, is grouped together as the Nepali Grammar Project.
Previously I have worked on the EMILLE corpus, which consists of 93 million words of text resources for fifteen South Asian languages. I was also involved in constructing the Lancaster Newsbooks Corpus.
My work on corpus encoding and annotation has at times involved the creation of software tools. As part of my work on EMILLE, I created the Unicodify software. While working on part-of-speech tagging for South Asian languages including Urdu and Nepali, I developed the Unitag framework.
A list of my research publications is available on this website.
Potential Doctoral Proposals
I would be especially interested in supervising PhD candidates working in the following areas:
- corpora of languages other than English - and in alphabets other than Latin;
- development and applications of corpus annotation ("tagging" at various levels);
- corpus-based grammatical analysis, especially cross-linguistic and/or quantitative approaches to grammar;
- investigating languageusing statistical collocation;
- the exploitation of corpus methods and resources in the other fields of the humanities and social sciences (e.g. history);
- or, more generally, in any area coherent with my research interests.
Other professional activities
I am the Project Development Officer of UCREL, the corpus research centre which beings together researchers from the Linguistics and Computing departments.
I am on the editorial boards of Corporaand the Journal of Quantitative Linguistics.
Associated Keywords: Corpus linguistic methodology, Corpus linguistics, Corpus tools, Digital humanities, Early modern English, Early modern writing, English, English grammar, English language, European languages, Grammar, Grammatical theory and description, Historical and diachronic corpora, Historical GIS, Humanities computing, India, Language, Linguistics, Metaphor, Multilingual corpora, Quantitative linguistics, Semantics, Seventeenth century, South Asia, Statistics, Swearing, Syntax
View all research activities, news and events associated with Andrew Hardie.

