TY - JOUR
T1 - Unifying the identification of biomedical entities with the Bioregistry
AU - Hoyt, Charles Tapley
AU - Balk, Meghan
AU - Callahan, Tiffany J.
AU - Domingo-Fernández, Daniel
AU - Haendel, Melissa A.
AU - Hegde, Harshad B.
AU - Himmelstein, Daniel S.
AU - Karis, Klas
AU - Kunze, John
AU - Lubiana, Tiago
AU - Matentzoglu, Nicolas
AU - McMurry, Julie
AU - Moxon, Sierra
AU - Mungall, Christopher J.
AU - Rutz, Adriano
AU - Unni, Deepak R.
AU - Willighagen, Egon
AU - Winston, Donald
AU - Gyori, Benjamin M.
N1 - Funding Information:
CTH, KK, BMG were funded under the Defense Advanced Research Projects Agency (DARPA) Young Faculty Award [W911NF-20-1-0255]. EW received funding by NWO grant [203.001.121]. TL received funding from the São Paulo Research Foundation (FAPESP) grant 2019/26284-1. CJM, MAH, and JM were supported by a NIH Office of the Director Grant #5R24OD011883. The authors would like to acknowledge the developers, maintainers, and curators of each of the external registries referenced throughout this manuscript and used in the Bioregistry, whose prior work enabled ours. More detailed “live” acknowledgments for these registries can be found at https://bioregistry.io/acknowledgements .
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - The standardized identification of biomedical entities is a cornerstone of interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for biomedical entities such as small molecules, proteins, cell lines, and clinical trials. However, existing registries have struggled to provide sufficient coverage and metadata standards that meet the evolving needs of modern life sciences researchers. Here, we introduce the Bioregistry, an integrative, open, community-driven metaregistry that synthesizes and substantially expands upon 23 existing registries. The Bioregistry addresses the need for a sustainable registry by leveraging public infrastructure and automation, and employing a progressive governance model centered around open code and open data to foster community contribution. The Bioregistry can be used to support the standardized annotation of data, models, ontologies, and scientific literature, thereby promoting their interoperability and reuse. The Bioregistry can be accessed through https://bioregistry.io and its source code and data are available under the MIT and CC0 Licenses at https://github.com/biopragmatics/bioregistry.
AB - The standardized identification of biomedical entities is a cornerstone of interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for biomedical entities such as small molecules, proteins, cell lines, and clinical trials. However, existing registries have struggled to provide sufficient coverage and metadata standards that meet the evolving needs of modern life sciences researchers. Here, we introduce the Bioregistry, an integrative, open, community-driven metaregistry that synthesizes and substantially expands upon 23 existing registries. The Bioregistry addresses the need for a sustainable registry by leveraging public infrastructure and automation, and employing a progressive governance model centered around open code and open data to foster community contribution. The Bioregistry can be used to support the standardized annotation of data, models, ontologies, and scientific literature, thereby promoting their interoperability and reuse. The Bioregistry can be accessed through https://bioregistry.io and its source code and data are available under the MIT and CC0 Licenses at https://github.com/biopragmatics/bioregistry.
UR - http://www.scopus.com/inward/record.url?scp=85142281057&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85142281057&partnerID=8YFLogxK
U2 - 10.1038/s41597-022-01807-3
DO - 10.1038/s41597-022-01807-3
M3 - Article
C2 - 36402838
AN - SCOPUS:85142281057
SN - 2052-4463
VL - 9
JO - Scientific Data
JF - Scientific Data
IS - 1
M1 - 714
ER -