TY - JOUR
T1 - Biology-inspired graph neural network encodes reactome and reveals biochemical reactions of disease
AU - Burkhart, Joshua G.
AU - Wu, Guanming
AU - Song, Xubo
AU - Raimondi, Francesco
AU - McWeeney, Shannon
AU - Wong, Melissa H.
AU - Deng, Youping
N1 - Publisher Copyright:
© 2023 The Author(s)
PY - 2023/7/14
Y1 - 2023/7/14
N2 - Functional heterogeneity of healthy human tissues complicates interpretation of molecular studies, impeding precision therapeutic target identification and treatment. Considering this, we generated a graph neural network with Reactome-based architecture and trained it using 9,115 samples from Genotype-Tissue Expression (GTEx). Our graph neural network (GNN) achieves adjusted Rand index (ARI) = 0.7909, while a Resnet18 control model achieves ARI = 0.7781, on 370 held-out healthy human tissue samples from The Cancer Genome Atlas (TCGA), despite the Resnet18 using over 600 times the parameters. Our GNN also succeeds in separating 83 healthy skin samples from 95 lesional psoriasis samples, revealing that upregulation of 26S- and NUB1-mediated degradation of NEDD8, UBD, and their conjugates is central to the largest perturbed reaction network component in psoriasis. We show that our results are not discoverable using traditional differential expression and hypergeometric pathway enrichment analyses yet are supported by separate human multi-omics and small-molecule mouse studies, suggesting future molecular disease studies may benefit from similar GNN analytical approaches.
AB - Functional heterogeneity of healthy human tissues complicates interpretation of molecular studies, impeding precision therapeutic target identification and treatment. Considering this, we generated a graph neural network with Reactome-based architecture and trained it using 9,115 samples from Genotype-Tissue Expression (GTEx). Our graph neural network (GNN) achieves adjusted Rand index (ARI) = 0.7909, while a Resnet18 control model achieves ARI = 0.7781, on 370 held-out healthy human tissue samples from The Cancer Genome Atlas (TCGA), despite the Resnet18 using over 600 times the parameters. Our GNN also succeeds in separating 83 healthy skin samples from 95 lesional psoriasis samples, revealing that upregulation of 26S- and NUB1-mediated degradation of NEDD8, UBD, and their conjugates is central to the largest perturbed reaction network component in psoriasis. We show that our results are not discoverable using traditional differential expression and hypergeometric pathway enrichment analyses yet are supported by separate human multi-omics and small-molecule mouse studies, suggesting future molecular disease studies may benefit from similar GNN analytical approaches.
KW - DSML2: Proof-of-concept: Data science output has been formulated, implemented, and tested for one domain/problem
KW - biochemical reaction
KW - differential gene expression
KW - graph neural network
KW - healthy tissue
KW - human reactome
KW - molecular disease study
KW - pathway enrichment analysis
KW - reaction network
UR - http://www.scopus.com/inward/record.url?scp=85163510263&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85163510263&partnerID=8YFLogxK
U2 - 10.1016/j.patter.2023.100758
DO - 10.1016/j.patter.2023.100758
M3 - Article
AN - SCOPUS:85163510263
SN - 2666-3899
VL - 4
JO - Patterns
JF - Patterns
IS - 7
M1 - 100758
ER -