Welcome to LANGSNAP

Publications Reference

Researchers using this repository are asked to acknowledge the LANGSNAP project in any publication arising from their work. They are also asked to acknowledge the CHILDES project as outlined at

The aim of this repository is to promote research on the learning of French and Spanish as L2, by making parallel learner corpora for each language freely available to the research community.

The learner corpora made available here were collected during 2011-13, as part of the research project "Social networks, target language interaction, and second language acquisition during the year abroad: A longitudinal study" (LANGSNAP Project: : ESRC research award number RES-062-23-2996). The LANGSNAP corpora include both spoken and written data produced over a 21-month period by advanced (university-level) learners of French and Spanish.

All participants were speakers of English as their L1/ dominant language, and were studying languages at a British university. To complete their course requirements, all participants were required to spend the third year of their four-year programme in a French- or Spanish-speaking country. The participants chose between three different types of placement abroad: a language teaching assistantship, enrolment at a partner university, or a workplace internship. The L2 production data were collected on 6 occasions, before, during and after the period of residence abroad.

The publicly available repository includes three types of data for each language, collected on all 6 occasions:

  • Oral interviews (where participants took part in a semi-structured interview led by a member of the research team);
  • Story retelling (where participants retold a story guided by a sequence of pictures);
  • Argumentative writing (where participants wrote a timed 200-word response to a stimulus question).

The data available in this repository includes anonymised audio recordings of the oral interview and story retelling data, as well as transcripts of all data in CHAT format. To access and explore the data, click on the "Browse" tab above.

The repository adds new advanced learner corpora to the existing Southampton University collections of L2 French available at, and of learner Spanish available at

Researchers making use of the repository are expected to adhere generally to the researcher and user ground rules which have been developed by the CHILDES project at Carnegie Mellon University, USA.

LANGSNAP is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software credits.