Cloud computing for genomic data analysis and collaboration

Ben Langmead, Abhinav Nellore

Research output: Contribution to journalReview articlepeer-review

140 Scopus citations


Next-generation sequencing has made major strides in the past decade. Studies based on large sequencing data sets are growing in number, and public archives for raw sequencing data have been doubling in size every 18 months. Leveraging these data requires researchers to use large-scale computational resources. Cloud computing, a model whereby users rent computers and storage from large data centres, is a solution that is gaining traction in genomics research. Here, we describe how cloud computing is used in genomics for research and large-scale collaborations, and argue that its elasticity, reproducibility and privacy features make it ideally suited for the large-scale reanalysis of publicly available archived data, including privacy-protected data.

Original languageEnglish (US)
Pages (from-to)208-219
Number of pages12
JournalNature Reviews Genetics
Issue number4
StatePublished - Apr 1 2018

ASJC Scopus subject areas

  • Molecular Biology
  • Genetics
  • Genetics(clinical)


Dive into the research topics of 'Cloud computing for genomic data analysis and collaboration'. Together they form a unique fingerprint.

Cite this