Neurons in auditory brain areas use fluctuations in speech volume to identify the beginnings and ends of syllables, found new study published in Science Advances.
UC San Francisco neuroscientists have identified how the listening brain scans speech to break it down into syllables. The findings provide for the first time a neural basis for the fundamental atoms of language and insights into our perception of the rhythmic poetry of speech. For decades, speech neuroscientists have looked for evidence that neurons in auditory brain areas use fluctuations in speech volume to identify the beginnings and ends of syllables -- like a lin-guis-tics pro-fes-sor di-a-gram-ming a sen-tence. So far, these efforts have met with little luck.
‘The brain instead responds to a marker of vocal stress in the middle of each syllable more like a poet scanning the sonnets of Shakespeare.’
The researchers showed that this signal -- in an area of speech cortex called the middle superior temporal gyrus (mSTG) -- is specifically based on the rising volume at the start of each vowel sound, which is a universal feature of human languages. Notably, the authors say, this simple syllabic marker could also provide the brain with direct information about patterns of stress, timing, and rhythm that are so central to conveying meaning and emotional context in English and many other languages.
"What I find most exciting about this work is that it shows a simple neural coding principle for the sense of rhythm that is absolutely fundamental to how our brains process speech," said neuroscientist Yulia Oganian, PhD, who led the new research. "Could this explain why humans are so sensitive to the sequence of stressed and unstressed syllables that make up spoken poetry, or even oral storytelling?"
Oganian is a postdoctoral researcher in the lab of UCSF Health neurosurgeon Eddie Chang, MD, PhD, Bowes Biomedical Investigator at UCSF, member of the UCSF Weill Institute for Neurosciences, and a Howard Hughes Medical Institute (HHMI) Faculty Scholar, whose research laboratory studies the neural basis of human speech, movement, and emotion.
"What really excites me is that we now understand how a simple sound cue, the rapid increase in loudness that happens at the onset of vowels, serves as a critical landmark for speech because it tells a listener when a syllable occurs and whether it is stressed. This is a rather central discovery about how the brain extracts syllable units from speech," said Chang.
Advertisement
Oganian recruited 11 volunteers whose seizure-mapping electrodes happened to overlap with areas of the brain involved in speech processing and who were happy to participate in a research study during their down-time in the hospital. She played each participant a selection of speech recordings from a variety of different speakers while recording patterns of brain activity in their auditory speech centers, then analyzed the data to identify neural patterns reflecting the syllabic structure of what they had heard.
Advertisement
To make it possible to identify what features of the audio recordings were driving the new-found syllable markers, Oganian asked four of her research volunteers to listen to recorded speech that was slowed down four-fold. These ultra-slow speech recordings let Oganian see that the syllable signals were occurring consistently at the moment of rising stress at the start of each vowel sound (e.g. as 'b' turns to 'a' in the syllable 'ba'), and not at the peak of each syllable as other scientists had theorized.
The syllabic marker Oganian discovered in the mSTG also varied with the emphasis the speaker placed on a particular syllable. This suggested that this first stage of speech processing simultaneously allows the brain to split speech into syllabic units and also to track the patterns of stress that are critical for meaning in English and many other languages (e.g. "computer console" vs. "console a friend"; "Did I do that?" vs. "Did I do that?").
The syllabic signal also provides a simple metronome for the brain to track the rhythm and speed of speech. "Some people speak fast; others speak slow. People change how quickly they speak when they are excited or sad. The brain needs to be able to adjust to that," Oganian said. "By marking whenever a new syllable is occurring, this signal acts as an internal pacemaker within the speech signal itself."
The researchers are continuing to study how brain signals in the mSTG are interpreted to enable the brain to process speech rhythmicity and meaning. They also hope to explore how the brain's interpretation of these signals varies in languages other than English that put more or less emphasis on the stress patterns of speech.
Source-Eurekalert