Deepfakes in Ophthalmology: Applications and Realism of Synthetic Retinal Images from Generative Adversarial Networks

Jimmy S. Chen, Aaron S. Coyner, R. V.Paul Chan, M. Elizabeth Hartnett, Darius M. Moshfeghi, Leah A. Owen, Jayashree Kalpathy-Cramer, Michael F. Chiang, J. Peter Campbell

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

Purpose: Generative adversarial networks (GANs) are deep learning (DL) models that can create and modify realistic-appearing synthetic images, or deepfakes, from real images. The purpose of our study was to evaluate the ability of experts to discern synthesized retinal fundus images from real fundus images and to review the current uses and limitations of GANs in ophthalmology. Design: Development and expert evaluation of a GAN and an informal review of the literature. Participants: A total of 4282 image pairs of fundus images and retinal vessel maps acquired from a multicenter ROP screening program. Methods: Pix2Pix HD, a high-resolution GAN, was first trained and validated on fundus and vessel map image pairs and subsequently used to generate 880 images from a held-out test set. Fifty synthetic images from this test set and 50 different real images were presented to 4 expert ROP ophthalmologists using a custom online system for evaluation of whether the images were real or synthetic. Literature was reviewed on PubMed and Google Scholars using combinations of the terms ophthalmology, GANs, generative adversarial networks, ophthalmology, images, deepfakes, and synthetic. Ancestor search was performed to broaden results. Main Outcome Measures: Expert ability to discern real versus synthetic images was evaluated using percent accuracy. Statistical significance was evaluated using a Fisher exact test, with P values ≤ 0.05 thresholded for significance. Results: The expert majority correctly identified 59% of images as being real or synthetic (P = 0.1). Experts 1 to 4 correctly identified 54%, 58%, 49%, and 61% of images (P = 0.505, 0.158, 1.000, and 0.043, respectively). These results suggest that the majority of experts could not discern between real and synthetic images. Additionally, we identified 20 implementations of GANs in the ophthalmology literature, with applications in a variety of imaging modalities and ophthalmic diseases. Conclusions: Generative adversarial networks can create synthetic fundus images that are indiscernible from real fundus images by expert ROP ophthalmologists. Synthetic images may improve dataset augmentation for DL, may be used in trainee education, and may have implications for patient privacy.

Original languageEnglish (US)
Article number100079
JournalOphthalmology Science
Volume1
Issue number4
DOIs
StatePublished - Dec 2021

Keywords

  • Deep learning
  • Generative adversarial networks
  • Ophthalmology
  • Synthetic images

ASJC Scopus subject areas

  • Ophthalmology

Fingerprint

Dive into the research topics of 'Deepfakes in Ophthalmology: Applications and Realism of Synthetic Retinal Images from Generative Adversarial Networks'. Together they form a unique fingerprint.

Cite this