Pairwise Comparison Versus Likert Scale for Biomedical Image Assessment

Andrew S. Phelps; David M. Naeger; Jesse L. Courtier; Jack W. Lambert; Peter A. Marcovici; Javier E. Villanueva-Meyer; John D. MacKenzie

doi:10.2214/AJR.14.13022

Pairwise Comparison Versus Likert Scale for Biomedical Image Assessment

Andrew S. Phelps, David M. Naeger, Jesse L. Courtier, Jack W. Lambert, Peter A. Marcovici, Javier E. Villanueva-Meyer, John D. MacKenzie

Research output: Contribution to journal › Article › peer-review

51 Scopus citations

Abstract

OBJECTIVE Biomedical imaging research relies heavily on the subjective and semiquantitative reader analysis of images. Current methods are limited hy interreader variability and fixed upper and lower limits. The purpose of this study was to compare the performance of two assessment methods, pairwise comparison and Likert scale, for improved analysis of biomedical images. MATERIALS AND METHODS. A set of 10 images with varying degrees of image sharpness was created by digitally blurring a normal clinical chest radiograph. Readers assessed the degree of image sharpness using two different methods: pairwise comparison and a 10-poinl Likert scale. Reader agreement with actual chest radiograph sharpness was calculated for each method by use of the Lin concordance correlation coefficient (CCC). RESULTS. Reader accuracy was highest for pairwise comparison (CCC, 1.0) and ranked Likert (CCC, 0.99) scores and lowest for nonranked Likert scores (CCC, 0.83). Accuracy improved slightly when readers repeated their assessments (CCC, 0.87) or had reference images available (CCC, 0.91). CONCLUSION. Pairwise comparison and ranked Likert scores yield more accurate reader assessments than nonranked Likert scores.

Original language	English (US)
Pages (from-to)	8-14
Number of pages	7
Journal	American Journal of Roentgenology
Volume	204
Issue number	1
DOIs	https://doi.org/10.2214/AJR.14.13022
State	Published - Jan 1 2015
Externally published	Yes

Keywords

Image assessment
Likert scale
Pairwise comparison

ASJC Scopus subject areas

Radiology Nuclear Medicine and imaging

Access to Document

10.2214/AJR.14.13022

Cite this

@article{1001cbebd8d048cd83697cb7155d87dd,

title = "Pairwise Comparison Versus Likert Scale for Biomedical Image Assessment",

abstract = "OBJECTIVE Biomedical imaging research relies heavily on the subjective and semiquantitative reader analysis of images. Current methods are limited hy interreader variability and fixed upper and lower limits. The purpose of this study was to compare the performance of two assessment methods, pairwise comparison and Likert scale, for improved analysis of biomedical images. MATERIALS AND METHODS. A set of 10 images with varying degrees of image sharpness was created by digitally blurring a normal clinical chest radiograph. Readers assessed the degree of image sharpness using two different methods: pairwise comparison and a 10-poinl Likert scale. Reader agreement with actual chest radiograph sharpness was calculated for each method by use of the Lin concordance correlation coefficient (CCC). RESULTS. Reader accuracy was highest for pairwise comparison (CCC, 1.0) and ranked Likert (CCC, 0.99) scores and lowest for nonranked Likert scores (CCC, 0.83). Accuracy improved slightly when readers repeated their assessments (CCC, 0.87) or had reference images available (CCC, 0.91). CONCLUSION. Pairwise comparison and ranked Likert scores yield more accurate reader assessments than nonranked Likert scores.",

keywords = "Image assessment, Likert scale, Pairwise comparison",

author = "Phelps, {Andrew S.} and Naeger, {David M.} and Courtier, {Jesse L.} and Lambert, {Jack W.} and Marcovici, {Peter A.} and Villanueva-Meyer, {Javier E.} and MacKenzie, {John D.}",

note = "Publisher Copyright: {\textcopyright} American Roentgen Ray Society.",

year = "2015",

month = jan,

day = "1",

doi = "10.2214/AJR.14.13022",

language = "English (US)",

volume = "204",

pages = "8--14",

journal = "American Journal of Roentgenology",

issn = "0361-803X",

publisher = "American Roentgen Ray Society",

number = "1",

}

TY - JOUR

T1 - Pairwise Comparison Versus Likert Scale for Biomedical Image Assessment

AU - Phelps, Andrew S.

AU - Naeger, David M.

AU - Courtier, Jesse L.

AU - Lambert, Jack W.

AU - Marcovici, Peter A.

AU - Villanueva-Meyer, Javier E.

AU - MacKenzie, John D.

N1 - Publisher Copyright: © American Roentgen Ray Society.

PY - 2015/1/1

Y1 - 2015/1/1

N2 - OBJECTIVE Biomedical imaging research relies heavily on the subjective and semiquantitative reader analysis of images. Current methods are limited hy interreader variability and fixed upper and lower limits. The purpose of this study was to compare the performance of two assessment methods, pairwise comparison and Likert scale, for improved analysis of biomedical images. MATERIALS AND METHODS. A set of 10 images with varying degrees of image sharpness was created by digitally blurring a normal clinical chest radiograph. Readers assessed the degree of image sharpness using two different methods: pairwise comparison and a 10-poinl Likert scale. Reader agreement with actual chest radiograph sharpness was calculated for each method by use of the Lin concordance correlation coefficient (CCC). RESULTS. Reader accuracy was highest for pairwise comparison (CCC, 1.0) and ranked Likert (CCC, 0.99) scores and lowest for nonranked Likert scores (CCC, 0.83). Accuracy improved slightly when readers repeated their assessments (CCC, 0.87) or had reference images available (CCC, 0.91). CONCLUSION. Pairwise comparison and ranked Likert scores yield more accurate reader assessments than nonranked Likert scores.

AB - OBJECTIVE Biomedical imaging research relies heavily on the subjective and semiquantitative reader analysis of images. Current methods are limited hy interreader variability and fixed upper and lower limits. The purpose of this study was to compare the performance of two assessment methods, pairwise comparison and Likert scale, for improved analysis of biomedical images. MATERIALS AND METHODS. A set of 10 images with varying degrees of image sharpness was created by digitally blurring a normal clinical chest radiograph. Readers assessed the degree of image sharpness using two different methods: pairwise comparison and a 10-poinl Likert scale. Reader agreement with actual chest radiograph sharpness was calculated for each method by use of the Lin concordance correlation coefficient (CCC). RESULTS. Reader accuracy was highest for pairwise comparison (CCC, 1.0) and ranked Likert (CCC, 0.99) scores and lowest for nonranked Likert scores (CCC, 0.83). Accuracy improved slightly when readers repeated their assessments (CCC, 0.87) or had reference images available (CCC, 0.91). CONCLUSION. Pairwise comparison and ranked Likert scores yield more accurate reader assessments than nonranked Likert scores.

KW - Image assessment

KW - Likert scale

KW - Pairwise comparison

UR - http://www.scopus.com/inward/record.url?scp=84924928742&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84924928742&partnerID=8YFLogxK

U2 - 10.2214/AJR.14.13022

DO - 10.2214/AJR.14.13022

M3 - Article

C2 - 25539230

AN - SCOPUS:84924928742

SN - 0361-803X

VL - 204

SP - 8

EP - 14

JO - American Journal of Roentgenology

JF - American Journal of Roentgenology

IS - 1

ER -

Pairwise Comparison Versus Likert Scale for Biomedical Image Assessment

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this