TY - JOUR
T1 - Reverse engineering cellular networks
AU - Margolin, Adam A.
AU - Wang, Kai
AU - Lim, Wei Keat
AU - Kustagi, Manjunath
AU - Nemenman, Ilya
AU - Califano, Andrea
N1 - Funding Information:
ACKNOWLEDGMENTS This work was supported by the National Cancer Institute, the National Institute of Allergy and Infectious Diseases, and the National Centers for Biomedical Computing NIH Roadmap Initiative. A.A.M. is supported by an IBM Ph.D. fellowship and by the National Library of Medicine Medical Informatics Research Training Program. I.N. is supported by the Department of Energy/National Nuclear Security Administration. We would like to thank R. Dalla-Favera for continued support and insight, K. Basso and U. Klein for contributions to the experimental validation of the original ARACNE algorithm, and K. Smith for help in reviewing the manuscript.
PY - 2006/7
Y1 - 2006/7
N2 - We describe a computational protocol for the ARACNE algorithm, an information-theoretic method for identifying transcriptional interactions between gene products using microarray expression profile data. Similar to other algorithms, ARACNE predicts potential functional associations among genes, or novel functions for uncharacterized genes, by identifying statistical dependencies between gene products. However, based on biochemical validation, literature searches and DNA binding site enrichment analysis, ARACNE has also proven effective in identifying bona fide transcriptional targets, even in complex mammalian networks. Thus we envision that predictions made by ARACNE, especially when supplemented with prior knowledge or additional data sources, can provide appropriate hypotheses for the further investigation of cellular networks. While the examples in this protocol use only gene expression profile data, the algorithm's theoretical basis readily extends to a variety of other high-throughput measurements, such as pathway-specific or genome-wide proteomics, microRNA and metabolomics data. As these data become readily available, we expect that ARACNE might prove increasingly useful in elucidating the underlying interaction models. For a microarray data set containing ∼10,000 probes, reconstructing the network around a single probe completes in several minutes using a desktop computer with a Pentium 4 processor. Reconstructing a genome-wide network generally requires a computational cluster, especially if the recommended bootstrapping procedure is used.
AB - We describe a computational protocol for the ARACNE algorithm, an information-theoretic method for identifying transcriptional interactions between gene products using microarray expression profile data. Similar to other algorithms, ARACNE predicts potential functional associations among genes, or novel functions for uncharacterized genes, by identifying statistical dependencies between gene products. However, based on biochemical validation, literature searches and DNA binding site enrichment analysis, ARACNE has also proven effective in identifying bona fide transcriptional targets, even in complex mammalian networks. Thus we envision that predictions made by ARACNE, especially when supplemented with prior knowledge or additional data sources, can provide appropriate hypotheses for the further investigation of cellular networks. While the examples in this protocol use only gene expression profile data, the algorithm's theoretical basis readily extends to a variety of other high-throughput measurements, such as pathway-specific or genome-wide proteomics, microRNA and metabolomics data. As these data become readily available, we expect that ARACNE might prove increasingly useful in elucidating the underlying interaction models. For a microarray data set containing ∼10,000 probes, reconstructing the network around a single probe completes in several minutes using a desktop computer with a Pentium 4 processor. Reconstructing a genome-wide network generally requires a computational cluster, especially if the recommended bootstrapping procedure is used.
UR - http://www.scopus.com/inward/record.url?scp=33750353888&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33750353888&partnerID=8YFLogxK
U2 - 10.1038/nprot.2006.106
DO - 10.1038/nprot.2006.106
M3 - Article
C2 - 17406294
AN - SCOPUS:33750353888
SN - 1754-2189
VL - 1
SP - 662
EP - 671
JO - Nature protocols
JF - Nature protocols
IS - 2
ER -