%A Dahl,Christoph %A Logothetis,Nikos %A Kayser,Christoph %D 2010 %J Frontiers in Integrative Neuroscience %C %F %G English %K cross-modal,sensory integration,Temporal Lobe,visual scene %Q %R 10.3389/fnint.2010.00010 %W %L %M %P %7 %8 2010-April-13 %9 Original Research %+ Dr Christoph Kayser,Max Planck Institute for Biological Cybernetics,Department for Physiology of Cognitive Processes,Tübingen,Germany,christoph.kayser@uni-bielefeld.de %# %! STS neurons and audio-visual congruency %* %< %T Modulation of visual responses in the superior temporal sulcus by audio-visual congruency %U https://www.frontiersin.org/articles/10.3389/fnint.2010.00010 %V 4 %0 JOURNAL ARTICLE %@ 1662-5145 %X Our ability to identify or recognize visual objects is often enhanced by evidence provided by other sensory modalities. Yet, where and how visual object processing benefits from the information received by the other senses remains unclear. One candidate region is the temporal lobe, which features neural representations of visual objects, and in which previous studies have provided evidence for multisensory influences on neural responses. In the present study we directly tested whether visual representations in the lower bank of the superior temporal sulcus (STS) benefit from acoustic information. To this end, we recorded neural responses in alert monkeys passively watching audio-visual scenes, and quantified the impact of simultaneously presented sounds on responses elicited by the presentation of naturalistic visual scenes. Using methods of stimulus decoding and information theory, we then asked whether the responses of STS neurons become more reliable and informative in multisensory contexts. Our results demonstrate that STS neurons are indeed sensitive to the modality composition of the sensory stimulus. Importantly, information provided by STS neurons’ responses about the particular visual stimulus being presented was highest during congruent audio-visual and unimodal visual stimulation, but was reduced during incongruent bimodal stimulation. Together, these findings demonstrate that higher visual representations in the STS not only convey information about the visual input but also depend on the acoustic context of a visual scene.