Participation Type
Paper
Presentation #1 Title
Initiating the Young Appalachians’ Living Language (YALL) Corpus
Presentation #1 Abstract or Summary
The varieties of English spoken in Appalachia are some of the most widely-recognized varieties of American English (Wolfram and Christian 1976, Montgomery 2004), and popular conceptions of the varieties, which cast speakers as ignorant and backwards, and their language as inherently incorrect, are based on the mythologies of Appalachian people and culture as isolated and homogenous (Montgomery 1999). These negative notions about Appalachian Englishes are pervasive and powerful.
With this context as a backdrop, we have initiated a collaborative research endeavor – the Young Appalachians’ Living Language (YALL) Corpus – to build a corpus of audio-aligned transcripts of interviews with young people from across Appalachia, providing a large collection of sociolinguistic data from the Appalachian dialect region and a representation of Appalachian speech to the interested public through a website designed for outreach and education. As a monitor corpus (Davies 2010) that gathers comparable data sets at regular temporal intervals to build a longitudinal collection of data for a language community, the YALL Corpus holds the potential to reveal ongoing, long-range changes in Appalachian Englishes that have been difficult to capture with standard sociolinguistic methodologies.
Our main goal in this presentation is to discuss our methods and rationale for the corpus. We will give a brief glimpse into the kinds of data that have been collected so far, but data analysis will not figure strongly in our presentation. We hope to present the case for the project as a whole, soliciting input on our data collection, processing, and analysis techniques.
At-A-Glance Bio- Presenter #1
Jennifer Cramer is associate professor of linguistics at the University of Kentucky. She received a Ph.D. in linguistics from the University of Illinois in 2010. Her research focuses on the perception and production of linguistic variation at dialect and regional borders, with a specific interest in the dialects spoken in Kentucky, located at the border between the Southern and Midwestern regions of the United States.
Mark Richard Lauersdorf is associate professor of linguistics at the University of Kentucky. His research in historical sociolinguistics concentrates on data-driven, corpus-based methods to investigate language variation, change, and normalization in contexts of high varietal differentiation and strong contact between varieties. His work also includes the conceptualization and development of digital tools and resources to pursue such investigations.
Initiating the Young Appalachians’ Living Language (YALL) Corpus
The varieties of English spoken in Appalachia are some of the most widely-recognized varieties of American English (Wolfram and Christian 1976, Montgomery 2004), and popular conceptions of the varieties, which cast speakers as ignorant and backwards, and their language as inherently incorrect, are based on the mythologies of Appalachian people and culture as isolated and homogenous (Montgomery 1999). These negative notions about Appalachian Englishes are pervasive and powerful.
With this context as a backdrop, we have initiated a collaborative research endeavor – the Young Appalachians’ Living Language (YALL) Corpus – to build a corpus of audio-aligned transcripts of interviews with young people from across Appalachia, providing a large collection of sociolinguistic data from the Appalachian dialect region and a representation of Appalachian speech to the interested public through a website designed for outreach and education. As a monitor corpus (Davies 2010) that gathers comparable data sets at regular temporal intervals to build a longitudinal collection of data for a language community, the YALL Corpus holds the potential to reveal ongoing, long-range changes in Appalachian Englishes that have been difficult to capture with standard sociolinguistic methodologies.
Our main goal in this presentation is to discuss our methods and rationale for the corpus. We will give a brief glimpse into the kinds of data that have been collected so far, but data analysis will not figure strongly in our presentation. We hope to present the case for the project as a whole, soliciting input on our data collection, processing, and analysis techniques.