Exploring vernacular reorganisation in a longitudinal corpus of pre-schoolers’ speech

Joshua Wilson Black and Lynn Clark

Overview

Overview

  1. Background and Research Questions
  2. Data
  3. Analysis
  4. Results
  5. Interpretation

The Team

Joshua Wilson Black (Postdoc)

Lynn Clark (PI)

Robert Fromont (Software)

Gail Gillon (AI)

Brigid McNeill (AI)

Amy Scott (AI)

Grant: 20-UOC-064

Background

Vernacular reorganisation

[…] we necessarily begin with the phonetics, phonology, morphology, and syntax that we acquired from our first caretaker, normally female. The general condition for linguistic change can then be stated in a very simple way: children must learn to talk differently from their mothers. Let us refer to this process as vernacular re-organization. (Labov 2001, 415)

We investigate the onset of vernacular reorganisation.

Vernacular reorganisation

  1. Transmission: Acquire caregiver’s vernacular.

  2. Incrementation: Advance linguistic changes underway in the community.

Trigger: shift from caregiver-dominated norms to peer-dominated norms around 4-5 (usually at school).

Research Questions

  1. Is there evidence of vernacular reorganisation in the speech of preschool aged children?
  2. Are all changing accent features advanced at the same time and at the same rate during incrementation?

Data

Story Retell

  • Source: the UC Child Well-Being Research Institute’s Better Start Literacy Approach | Te Ara Reo Matatini.

  • Children were presented with a story and asked to retell it in their own words.

  • Two stories were used: Hana and the Tūī and Tama and the Playground (Gillon, McNeill, and Scott 2019).

  • Both stories were written by the CWRI to match the NZ cultural context and contain a wide variety of literacy-relevant linguistic features (Gillon et al. 2023; Scott et al. 2022).

Tell:

Retell:

Preschooler corpus

  • 18 centres
  • 132 children
  • F: 76, M: 65
  • 60 with two recordings
  • 19 with three recordings
  • 2 with four recordings

Preschooler corpus

  • 101 NZ European
  • 19 Māori
  • 2 Pasifika
  • 6 Asian
  • 4 Other
  • Age at recording: 3;11 - 5;5
  • Median age: 4;6

Preschooler corpus

  • dress: 968 (F1), 961 (F2)
  • fleece: 1150 (F1), 1159 (F2)
  • kit: 1888 (F1), 1848 (F2)
  • nurse 444 (F1), 447 (F2)
  • trap 987 (F1), 963 (F2)

QuakeBox

Source: Wayne Williams (Port Hills Productions
  • 431 single-speaker recordings
  • Prompt: “tell us your earthquake story”.
  • Recorded across a range of sites in Christchurch in 2011-2012.
  • Our community baseline (caregivers: F 18-35, Oldest generation: F 76+)
  • High quality audio and video recordings.

Analysis

Preprocessing

  1. Transcription from CWRI.
  2. Correction of timestamps.
  3. Forced alignment (see Fromont et al. 2023).
  4. Formant tracking (via FastTrack (Barreda 2021) with manual checking).
  5. Normalisation
  • Watch out: Kids data raises many problems — be careful! ⚠️
    • See supplementary materials of full paper.

Modelling Approach

  1. We don’t have all the data we need to directly test core hypotheses
    • …so we go exploratory.
  2. We want to discern an onset in sound change
    • …so we use GAMMs.
  3. Data is sparse and model convergence difficult

Model structure

For dress, fleece, kit, nurse, and trap (the ‘extended short front vowel shift’), F1 and F2 we fit the following model:

formant_value ~ gender + stopword + s(age_s, by=gender, k = 4) + (1|word/unstressed) + (1|participant/collect)
  • Similar models fit to QuakeBox.

Results

e.g.:

Results summary

  • Is there evidence of vernacular reorganisation in the speech of pre-school aged children?
    • Both stability and change.
    • No ‘elbow’ in the smooths indicate rapid acceleration of change (incrementation).
    • Difficult to interpret these straightforwardly as vernacular reorganisation.
      • fleece is going the wrong way!
  • Are all changing accent features advanced at the same time and at the same rate during incrementation?
    • Not all features are changing.
    • Some are changing faster than others.
    • Some are changing differently for males and females.

Interpretation

Developmental?

CDS/Story-book speech

  • Kids vowel spaces suggest hyper-articulation.
  • First thought: kids primed by storyteller?
  • Second thought: kids learn hyper-articulated vowels before developing the ability to hypo-articulate (see Ménard et al. 2020).

  • We can’t exclude development effects. (More inter(sub)disciplinary is work needed).

Upshot

  • We’ve found sound change before kids start school.
  • No ‘elbow’, or point at which incrementation begins.
  • Some changes, e.g. dress raising look like vernacular reorganisation.
  • But fleece looks entirely different.
  • Possible influence of CDS/Story-book speech and developmental hyperarticulation.
  • Focus on multiple variables at once is vital for this kind of project.

References

Barreda, Santiago. 2021. “Fast Track: Fast (Nearly) Automatic Formant-Tracking Using Praat.” Linguistics Vanguard 7 (1). https://doi.org/10.1515/lingvan-2020-0051.
Brosseau-Lapré, Françoise, and Elizabeth Roepke. 2019. “Speech Errors and Phonological Awareness in Children Ages 4 and 5 Years with and Without Speech Sound Disorder.” Journal of Speech, Language, and Hearing Research 62 (September): 3276–89. https://doi.org/10.1044/2019_jslhr-s-17-0461.
Bürkner, Paul-Christian. 2017. brms: An R Package for Bayesian Multilevel Models Using Stan.” Journal of Statistical Software 80 (1). https://doi.org/10.18637/jss.v080.i01.
Donegan, Patricia. 2012. “Normal Vowel Development.” In Handbook of Vowels and Vowel Disorders. Taylor & Francis.
Fromont, Robert, Lynn Clark, Joshua Wilson Black, and Margaret Blackwood. 2023. “Maximizing Accuracy of Forced Alignment for Spontaneous Child Speech.” Language Development Research 3 (1). https://doi.org/10.34842/shrr-sv10.
Gillon, Gail, Brigid McNeill, and Amy Scott. 2019. Tama and the Playground. University of Canterbury Child Wellbeing Research Institute.
Gillon, Gail, Brigid McNeill, Amy Scott, Megan Gath, and Marleen Westerveld. 2023. “Retelling Stories: The Validity of an Online Oral Narrative Task.” Child Language Teaching and Therapy, 02656590231155861.
Labov, William. 2001. Principles of Linguistic Change: Social Factors. Wiley-Blackwell.
Ménard, Lucie, Amélie Prémont, Pamela Trudeau-Fisette, Christine Turgeon, and Mark Tiede. 2020. “Phonetic Implementation of Prosodic Emphasis in Preschool-Aged Children and Adults: Probing the Development of Sensorimotor Speech Goals.” Journal of Speech, Language, and Hearing Research 63 (June): 1658–74. https://doi.org/10.1044/2020\_jslhr-20-00017.
Roepke, Elizabeth, and Françoise Brosseau-Lapré. 2021. “Vowel Errors Produced by Preschool-Age Children on a Single-Word Test of Articulation.” Clinical Linguistics &Amp; Phonetics 35 (January): 1161–83. https://doi.org/10.1080/02699206.2020.1869834.
Scott, Amy, Gail Gillon, Brigid McNeill, and Alex Kopach. 2022. “The Evolution of an Innovative Online Task to Monitor Childrens Oral Narrative Development.” Frontiers in Psychology 13 (July). https://doi.org/10.3389/fpsyg.2022.903124.
Vorperian, Houri K., and Ray D. Kent. 2007. “Vowel Acoustic Space Development in Children: A Synthesis of Acoustic and Anatomic Data.” Journal of Speech, Language, and Hearing Research. https://doi.org/10.1044/1092-4388(2007/104).

Extra: Formant tracking settings

Upper limits:

  • Front vowels get 6000-9000 Hz.

  • Back vowels get 5000-8000 Hz.

  • Other vowels get 5500-8500 Hz

  • ‘Front vowels’ (NZE) = fleece, dress, nurse, goose, trap, kit

  • ‘Back vowels’ = lot, thought, foot

  • ‘Others’ = strut, start

By-vowel limits:

  • FLEECE: f2 > 1500,
  • DRESS: f2 > 1500,
  • GOOSE: f2 > 1000,
  • NURSE: f2 > 1200,
  • THOUGHT: f2 < 2250,
  • LOT: f2 < 2500
  • FOOT: f2 > 900
  • KIT: f2 > 1250

Formant bounds:

label f1lower f1upper f2lower f2upper f3lower f3upper START 350 1500 1200 3500 0 5000 THOUGHT 350 1500 1200 2250 0 5000 TRAP 350 1500 1200 3500 0 5000 NURSE 350 1500 1200 3500 0 5000 DRESS 350 1500 1500 4000 0 5000 FLEECE 350 1500 1500 4000 0 5000 KIT 350 1500 1250 3500 0 5000 LOT 350 1500 1200 2500 0 5000 GOOSE 350 1500 1000 3500 0 5000 FOOT 350 1500 900 3500 0 5000 STRUT 350 1500 1200 3500 0 5000