The findings point to an important nuance about when cognition helps. When speech signals are reasonably intact, people with stronger cognitive scores gained greater benefits from spatial and spectral separation. When the target voice was heavily degraded so that little useful signal remained, cognitive ability mattered less. That pattern fits a view in which listening can shift along a continuum: sometimes our mental resources rescue a difficult task, and sometimes the signal quality limits what cognition can do.

For anyone interested in hearing, learning, or designing inclusive audio environments, these results matter. They suggest which kinds of signal-processing or seating arrangements might help different listeners, and they raise questions about tailoring technology or spaces to support people whose cognitive resources are taxed. Follow the link to read how these experiments tie into broader ideas about human potential, accessibility, and the limits of attention.

Abstract
Speech-on-speech listening involves selectively attending to a target talker while ignoring a simultaneous competing talker. Spatially separating the talkers improves performance, a phenomenon known as spatial release from masking (spatial RM). The same is true of spectral separation, that is, filtering the talkers into non-overlapping frequency bands (spectral RM). The relative benefit of spatial versus spectral RM is currently unknown. Furthermore, it is unclear how listeners’ ability to exploit spatial versus spectral cues is related to individual differences in cognition. The resource-limit account suggests that cognitive resources are required to support the processing of degraded speech, implying the strongest cognition/performance relationship when RM is limited or absent. However, an alternative claim, referred to as the data-limit account, suggests that cognitive resources cease to be useful when the target is severely degraded. In this study, participants (N = 240) completed a selective listening task in which they transcribed the speech of one of two simultaneously presented talkers. The speech was filtered into interleaved or overlapping frequency bands (spectral RM vs. no spectral RM) and presented dichotically or collocated (a proxy for spatial RM vs. no spatial RM). A battery of cognitive tasks was administered to assess working memory/attention. Spectral RM provided at least as much benefit as spatial RM, with the best performance when both RM types were present. Cognitive scores were significantly positively correlated with RM benefits. However, the weakest correlation between cognitive scores and performance was observed in the no-RM condition. The results therefore support an account of speech-on-speech listening that lies on a continuum from data-limited to resource-limited processing as a function of the quality of the target speech signal.

Read Full Article (External Site)