4.6 Article

Attentional Modulation of Hierarchical Speech Representations in a Multitalker Environment

Journal

CEREBRAL CORTEX
Volume 31, Issue 11, Pages 4986-5005

Publisher

OXFORD UNIV PRESS INC
DOI: 10.1093/cercor/bhab136

Keywords

cocktail-party; dorsal and ventral stream; encoding model; fMRI; natural speech

Categories

Funding

  1. National Eye Institute [EY019684]
  2. European Molecular Biology Organization Installation [IG 3028]
  3. TUBA GEBIP
  4. Science Academy BAGEP

Ask authors/readers for more resources

Attention plays a crucial role in selectively listening to desired speakers in noisy environments, causing broad modulations in speech representations at multiple levels and growing stronger towards later stages of processing. Unattended speech is still represented up to the semantic level in parabelt auditory cortex.
Humans are remarkably adept in listening to a desired speaker in a crowded environment, while filtering out nontarget speakers in the background. Attention is key to solving this difficult cocktail-party task, yet a detailed characterization of attentional effects on speech representations is lacking. It remains unclear across what levels of speech features and how much attentional modulation occurs in each brain area during the cocktail-party task. To address these questions, we recorded whole-brain blood-oxygen-level-dependent (BOLD) responses while subjects either passively listened to single-speaker stories, or selectively attended to a male or a female speaker in temporally overlaid stories in separate experiments. Spectral, articulatory, and semantic models of the natural stories were constructed. Intrinsic selectivity profiles were identified via voxelwise models fit to passive listening responses. Attentional modulations were then quantified based on model predictions for attended and unattended stories in the cocktail-party task. We find that attention causes broad modulations at multiple levels of speech representations while growing stronger toward later stages of processing, and that unattended speech is represented up to the semantic level in parabelt auditory cortex. These results provide insights on attentional mechanisms that underlie the ability to selectively listen to a desired speaker in noisy multispeaker environments.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available