Sample Language Interaction Corpora available from this Website

(See Chapter 7 of the LIDES CODING MANUAL)



Click on the link in the last row to download the sample corpus as a .cha file.



 

Corpus Languages

Multilingual 

(8 languages)

Dutch / Turkish

Dutch / French

Alsatian German / 

French

Researcher

Eva Codó <Eva.Codo@uab.es>

Ad Backus <A.M.Backus@kub.nl>

Jeanine Treffers-Daller <J-TREFFERSDALLER
@wpg.uwe.ac.uk>

Penelope Gardner-Chloros <p.gardner-chloros
@app-ling.bbk.ac.uk>

Location of study

Barcelona

Tilburg, Netherlands

Brussels

Strasbourg

Year of data collection

1997 - 1998

1989 and 1990

1985-1991

1983 - 1985

Research goals

Language negotiation patterns

General account

Language interaction phenomena (code-switching/borrowing)

General account

Type of study

Service sites

Community

Community

Community

Sampling

348 encounters

Networks

social network 

approach

3 families

3 workplaces

Type of data

Service encounters

Conversation

Conversation

Conversation

Amount of data recorded

15 hours

10 hours

70 hours

10 hours

Amount of data 

transcribed

5 hours 

(128 encounters)

10 hours

30 hours

10 ours

CHAT transcription?

Yes

No

Yes (6 hours)

Yes

Data file

corpus03.cha

corpus01.cha

corpus10.cha

corpus05.cha



 

Corpus Languages

Catalan / Spanish

Catalan / Spanish

English / Spanish

English/ German 

Researcher

M. Teresa Turell <teresa.turell@trad.upf.es> 
and Montserrat Forcadell
<montserrat.forcadell
@trad.upf.es>

Maria Carme Torras <mctorras@seneca.uab.es>

Melissa Moyer <Melissa.Moyer@uab.es>

Eva Eppler <evieppler@hotmail.com>

Location of study

La Canonja (Tarragona)

Barcelona

Gibraltar

London

Year of data collection

1987 - 1992

1997

1989 - 1990

1993

Research goals

Lang. interaction 

by community

Service encounters

General account

Grammar, 

morpho-syntax

Type of study

Community

Service sites

Community

Community

Sampling

30 subjects by age and 

ethnicity

Many sites

(290 encounters)

Many settings

Network

Type of data

Sociolinguistic interview

Conversation

Conversation

Conversation

Amount of data recorded

60 hours

4 hours

180 hours

46 hours

Amount of data 

transcribed

60 hours

90 encounters

40 hours

8 hours

CHAT transcription?

˝ hour

Yes

2 hours

Yes

Data file

 corpus11a.cha
corpus11b.cha

corpus09.cha

corpus07a.cha

corpus07b.cha

corpus04.cha (small sample)

NEW in 2002! Much larger sample of data.


 

Corpus Languages

English / Punjabi

English / Greek 

CypriotDialect 

English / Jamaican 

Creole

Catalan / Spanish / 

English

Researcher

Reeva   Charles r.charles@gold.ac.uk

Penelope Gardner-Chloros <p.gardner-chloros
@app-ling.bbk.ac.uk>

Mark Sebba <M.Sebba
@lancaster.ac.uk>

M. Teresa Turell <teresa.turell@trad.upf.es> 
and Montserrat Forcadell
<montserrat.forcadell
@trad.upf.es>

Location of study

London

London

London

Barcelona

Year of data collection

1994 -1995

1989 - 1990

1982 - 1984

1992

Research goals

General account

Language interaction 

by generations

General account

Lang. interaction

phenomena

Type of study

Community

Community

Community

Community

Sampling

4 women

30 subjects by age 

and gender

Family homes, schools

6 informants

Type of data

Conversation

Informal interviews

Conversation

Conversation

Amount of data recorded

6 hours

25 hours

6 hours

5 hours

Amount of data 

transcribed

6 hours

25 hours

1 hour

1˝ hours

CHAT transcription?

2 hours

Yes

Sample only

1˝ hours

Data file

corpus02.cha

corpus06.cha

corpus08.cha

corpus12.cha


 

Return to LIPPS home page