London-Lund Corpus 2
A corpus of spoken British English
Welcome to the official website of the London–Lund Corpus 2 (LLC–2)!
LLC–2 is a half-a-million-word corpus of spoken language with recordings from 2014–2019 with adult educated speakers of British English. On the one hand, the corpus is a resource for studying contemporary speech from a synchronic perspective and across different registers and groups of speakers. On the other hand, it is designed according to the same principles as the original London–Lund Corpus (LLC–1) with data recorded in the 1950s–1980s. To this end, it facilitates principled comparisons across different time periods of English with roughly 50 years in between. The corpus design includes: face-to-face conversation, phone/CMC conversation (landline telephone calls in LLC–1), broadcast media, parliamentary proceedings, spontaneous commentary, legal proceedings and prepared speech.
News about LLC–2
July 2021: Publication alert!
We are happy to announce the publication of the article "Challenges of releasing audio material for spoken data: The case of the London–Lund Corpus 2". The publication is part of the special issue "Challenges of combining structured and unstructured data in corpus compilation", co-edited by Tanja Säily and Jukka Tyrkkö for the journal Research in Corpus Linguistics. We thank Tanja and Jukka, as well as the general editors of RICL, for making this possible, and we hope that you enjoy reading the article! See the reference here (Publications).
February 2021: Talk at the UCL Survey of English Usage seminar
We are thankful to the Survey of English Usage at University College London for inviting Nele to talk at their virtual seminar. The talk was entitled "The London–Lund Corpus 2: A new resource for spoken English" and outlined some recent developments in the compilation of the corpus. UCL was also the main site of recording for LLC–2, so we were happy to be "back". Slides available on request!