A convolutional neural network provides a generalizable model of natural sound coding by neural populations in auditory cortex

Jacob R. Pennington; Stephen V. David

doi:10.1371/journal.pcbi.1011110

A convolutional neural network provides a generalizable model of natural sound coding by neural populations in auditory cortex

Jacob R. Pennington, Stephen V. David

Otolaryngology

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

Convolutional neural networks (CNNs) can provide powerful and flexible models of neural sensory processing. However, the utility of CNNs in studying the auditory system has been limited by their requirement for large datasets and the complex response properties of single auditory neurons. To address these limitations, we developed a population encoding model: a CNN that simultaneously predicts activity of several hundred neurons recorded during presentation of a large set of natural sounds. This approach defines a shared spectro-temporal space and pools statistical power across neurons. Population models of varying architecture performed consistently and substantially better than traditional linear-nonlinear models on data from primary and non-primary auditory cortex. Moreover, population models were highly generalizable. The output layer of a model pre-trained on one population of neurons could be fit to data from novel single units, achieving performance equivalent to that of neurons in the original fit data. This ability to generalize suggests that population encoding models capture a complete representational space across neurons in an auditory cortical field.

Original language	English (US)
Article number	e1011110
Journal	PLoS computational biology
Volume	19
Issue number	5
DOIs	https://doi.org/10.1371/journal.pcbi.1011110
State	Published - May 2023

ASJC Scopus subject areas

Ecology, Evolution, Behavior and Systematics
Modeling and Simulation
Ecology
Molecular Biology
Genetics
Cellular and Molecular Neuroscience
Computational Theory and Mathematics

Access to Document

10.1371/journal.pcbi.1011110

Cite this

@article{68a64c16f3e1401abf1151ffda2a7e6b,

title = "A convolutional neural network provides a generalizable model of natural sound coding by neural populations in auditory cortex",

abstract = "Convolutional neural networks (CNNs) can provide powerful and flexible models of neural sensory processing. However, the utility of CNNs in studying the auditory system has been limited by their requirement for large datasets and the complex response properties of single auditory neurons. To address these limitations, we developed a population encoding model: a CNN that simultaneously predicts activity of several hundred neurons recorded during presentation of a large set of natural sounds. This approach defines a shared spectro-temporal space and pools statistical power across neurons. Population models of varying architecture performed consistently and substantially better than traditional linear-nonlinear models on data from primary and non-primary auditory cortex. Moreover, population models were highly generalizable. The output layer of a model pre-trained on one population of neurons could be fit to data from novel single units, achieving performance equivalent to that of neurons in the original fit data. This ability to generalize suggests that population encoding models capture a complete representational space across neurons in an auditory cortical field.",

author = "Pennington, {Jacob R.} and David, {Stephen V.}",

note = "Publisher Copyright: {\textcopyright} 2023 Pennington, David. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.",

year = "2023",

month = may,

doi = "10.1371/journal.pcbi.1011110",

language = "English (US)",

volume = "19",

journal = "PLoS computational biology",

issn = "1553-734X",

publisher = "Public Library of Science",

number = "5",

}

TY - JOUR

T1 - A convolutional neural network provides a generalizable model of natural sound coding by neural populations in auditory cortex

AU - Pennington, Jacob R.

AU - David, Stephen V.

N1 - Publisher Copyright: © 2023 Pennington, David. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PY - 2023/5

Y1 - 2023/5

N2 - Convolutional neural networks (CNNs) can provide powerful and flexible models of neural sensory processing. However, the utility of CNNs in studying the auditory system has been limited by their requirement for large datasets and the complex response properties of single auditory neurons. To address these limitations, we developed a population encoding model: a CNN that simultaneously predicts activity of several hundred neurons recorded during presentation of a large set of natural sounds. This approach defines a shared spectro-temporal space and pools statistical power across neurons. Population models of varying architecture performed consistently and substantially better than traditional linear-nonlinear models on data from primary and non-primary auditory cortex. Moreover, population models were highly generalizable. The output layer of a model pre-trained on one population of neurons could be fit to data from novel single units, achieving performance equivalent to that of neurons in the original fit data. This ability to generalize suggests that population encoding models capture a complete representational space across neurons in an auditory cortical field.

AB - Convolutional neural networks (CNNs) can provide powerful and flexible models of neural sensory processing. However, the utility of CNNs in studying the auditory system has been limited by their requirement for large datasets and the complex response properties of single auditory neurons. To address these limitations, we developed a population encoding model: a CNN that simultaneously predicts activity of several hundred neurons recorded during presentation of a large set of natural sounds. This approach defines a shared spectro-temporal space and pools statistical power across neurons. Population models of varying architecture performed consistently and substantially better than traditional linear-nonlinear models on data from primary and non-primary auditory cortex. Moreover, population models were highly generalizable. The output layer of a model pre-trained on one population of neurons could be fit to data from novel single units, achieving performance equivalent to that of neurons in the original fit data. This ability to generalize suggests that population encoding models capture a complete representational space across neurons in an auditory cortical field.

UR - http://www.scopus.com/inward/record.url?scp=85159762968&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85159762968&partnerID=8YFLogxK

U2 - 10.1371/journal.pcbi.1011110

DO - 10.1371/journal.pcbi.1011110

M3 - Article

C2 - 37146065

AN - SCOPUS:85159762968

SN - 1553-734X

VL - 19

JO - PLoS computational biology

JF - PLoS computational biology

IS - 5

M1 - e1011110

ER -

A convolutional neural network provides a generalizable model of natural sound coding by neural populations in auditory cortex

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this