Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey

Bridge2AI-Voice

doi:10.1002/lary.31052

Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey

Bridge2AI-Voice

Medical Informatics and Clinical Epidemiology

Research output: Contribution to journal › Article › peer-review

Abstract

Introduction: Accuracy and validity of voice AI algorithms rely on substantial quality voice data. Although commensurable amounts of voice data are captured daily in voice centers across North America, there is no standardized protocol for acoustic data management, which limits the usability of these datasets for voice artificial intelligence (AI) research. Objective: The aim was to capture current practices of voice data collection, storage, analysis, and perceived limitations to collaborative voice research. Methods: A 30-question online survey was developed with expert guidance from the voicecollab.ai members, an international collaborative of voice AI researchers. The survey was disseminated via REDCap to an estimated 200 practitioners at North American voice centers. Survey questions assessed respondents' current practices in terms of acoustic data collection, storage, and retrieval as well as limitations to collaborative voice research. Results: Seventy-two respondents completed the survey of which 81.7% were laryngologists and 18.3% were speech language pathologists (SLPs). Eighteen percent of respondents reported seeing 40%–60% and 55% reported seeing >60 patients with voice disorders weekly (conservative estimate of over 4000 patients/week). Only 28% of respondents reported utilizing standardized protocols for collection and storage of acoustic data. Although, 87% of respondents conduct voice research, only 38% of respondents report doing so on a multi-institutional level. Perceived limitations to conducting collaborative voice research include lack of standardized methodology for collection (30%) and lack of human resources to prepare and label voice data adequately (55%). Conclusion: To conduct large-scale multi-institutional voice research with AI, there is a pertinent need for standardization of acoustic data management, as well as an infrastructure for secure and efficient data sharing. Level of Evidence: 5 Laryngoscope, 134:1333–1339, 2024.

Original language	English (US)
Pages (from-to)	1333-1339
Number of pages	7
Journal	Laryngoscope
Volume	134
Issue number	3
DOIs	https://doi.org/10.1002/lary.31052
State	Published - Mar 2024

Keywords

artificial intelligence
current practices
data collection
voice

ASJC Scopus subject areas

Otorhinolaryngology

Access to Document

10.1002/lary.31052

Cite this

@article{44e2554e7ed34d378221695add2f0537,

title = "Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey",

abstract = "Introduction: Accuracy and validity of voice AI algorithms rely on substantial quality voice data. Although commensurable amounts of voice data are captured daily in voice centers across North America, there is no standardized protocol for acoustic data management, which limits the usability of these datasets for voice artificial intelligence (AI) research. Objective: The aim was to capture current practices of voice data collection, storage, analysis, and perceived limitations to collaborative voice research. Methods: A 30-question online survey was developed with expert guidance from the voicecollab.ai members, an international collaborative of voice AI researchers. The survey was disseminated via REDCap to an estimated 200 practitioners at North American voice centers. Survey questions assessed respondents' current practices in terms of acoustic data collection, storage, and retrieval as well as limitations to collaborative voice research. Results: Seventy-two respondents completed the survey of which 81.7% were laryngologists and 18.3% were speech language pathologists (SLPs). Eighteen percent of respondents reported seeing 40%–60% and 55% reported seeing >60 patients with voice disorders weekly (conservative estimate of over 4000 patients/week). Only 28% of respondents reported utilizing standardized protocols for collection and storage of acoustic data. Although, 87% of respondents conduct voice research, only 38% of respondents report doing so on a multi-institutional level. Perceived limitations to conducting collaborative voice research include lack of standardized methodology for collection (30%) and lack of human resources to prepare and label voice data adequately (55%). Conclusion: To conduct large-scale multi-institutional voice research with AI, there is a pertinent need for standardization of acoustic data management, as well as an infrastructure for secure and efficient data sharing. Level of Evidence: 5 Laryngoscope, 134:1333–1339, 2024.",

keywords = "artificial intelligence, current practices, data collection, voice",

author = "Bridge2AI-Voice and Emily Evangelista and Rohan Kale and Desiree McCutcheon and Anais Rameau and Alexander Gelbard and Maria Powell and Michael Johns and Anthony Law and Phillip Song and Matthew Naunheim and Stephanie Watts and Bryson, {Paul C.} and Crowson, {Matthew G.} and Jeremy Pinto and {Bensoussan Yael}, E. and Elemento Olivier and Rameau Ana{\"i}s and Sigaras Alexandros and Ghosh Satrajit and {Powell Maria}, E. and Johnson Alistair and Ravitsky Vardit and Jean-Christophe, {B{\'e}lisle Pipon} and Dorr David and Payne Phillip and Yael Bensoussan",

note = "Publisher Copyright: {\textcopyright} 2023 The American Laryngological, Rhinological and Otological Society, Inc.",

year = "2024",

month = mar,

doi = "10.1002/lary.31052",

language = "English (US)",

volume = "134",

pages = "1333--1339",

journal = "Laryngoscope",

issn = "0023-852X",

publisher = "John Wiley and Sons Inc.",

number = "3",

}

TY - JOUR

T1 - Current Practices in Voice Data Collection and Limitations to Voice AI Research

T2 - A National Survey

AU - Bridge2AI-Voice

AU - Evangelista, Emily

AU - Kale, Rohan

AU - McCutcheon, Desiree

AU - Rameau, Anais

AU - Gelbard, Alexander

AU - Powell, Maria

AU - Johns, Michael

AU - Law, Anthony

AU - Song, Phillip

AU - Naunheim, Matthew

AU - Watts, Stephanie

AU - Bryson, Paul C.

AU - Crowson, Matthew G.

AU - Pinto, Jeremy

AU - Bensoussan Yael, E.

AU - Olivier, Elemento

AU - Anaïs, Rameau

AU - Alexandros, Sigaras

AU - Satrajit, Ghosh

AU - Powell Maria, E.

AU - Alistair, Johnson

AU - Vardit, Ravitsky

AU - Jean-Christophe, Bélisle Pipon

AU - David, Dorr

AU - Phillip, Payne

AU - Bensoussan, Yael

PY - 2024/3

Y1 - 2024/3

N2 - Introduction: Accuracy and validity of voice AI algorithms rely on substantial quality voice data. Although commensurable amounts of voice data are captured daily in voice centers across North America, there is no standardized protocol for acoustic data management, which limits the usability of these datasets for voice artificial intelligence (AI) research. Objective: The aim was to capture current practices of voice data collection, storage, analysis, and perceived limitations to collaborative voice research. Methods: A 30-question online survey was developed with expert guidance from the voicecollab.ai members, an international collaborative of voice AI researchers. The survey was disseminated via REDCap to an estimated 200 practitioners at North American voice centers. Survey questions assessed respondents' current practices in terms of acoustic data collection, storage, and retrieval as well as limitations to collaborative voice research. Results: Seventy-two respondents completed the survey of which 81.7% were laryngologists and 18.3% were speech language pathologists (SLPs). Eighteen percent of respondents reported seeing 40%–60% and 55% reported seeing >60 patients with voice disorders weekly (conservative estimate of over 4000 patients/week). Only 28% of respondents reported utilizing standardized protocols for collection and storage of acoustic data. Although, 87% of respondents conduct voice research, only 38% of respondents report doing so on a multi-institutional level. Perceived limitations to conducting collaborative voice research include lack of standardized methodology for collection (30%) and lack of human resources to prepare and label voice data adequately (55%). Conclusion: To conduct large-scale multi-institutional voice research with AI, there is a pertinent need for standardization of acoustic data management, as well as an infrastructure for secure and efficient data sharing. Level of Evidence: 5 Laryngoscope, 134:1333–1339, 2024.

AB - Introduction: Accuracy and validity of voice AI algorithms rely on substantial quality voice data. Although commensurable amounts of voice data are captured daily in voice centers across North America, there is no standardized protocol for acoustic data management, which limits the usability of these datasets for voice artificial intelligence (AI) research. Objective: The aim was to capture current practices of voice data collection, storage, analysis, and perceived limitations to collaborative voice research. Methods: A 30-question online survey was developed with expert guidance from the voicecollab.ai members, an international collaborative of voice AI researchers. The survey was disseminated via REDCap to an estimated 200 practitioners at North American voice centers. Survey questions assessed respondents' current practices in terms of acoustic data collection, storage, and retrieval as well as limitations to collaborative voice research. Results: Seventy-two respondents completed the survey of which 81.7% were laryngologists and 18.3% were speech language pathologists (SLPs). Eighteen percent of respondents reported seeing 40%–60% and 55% reported seeing >60 patients with voice disorders weekly (conservative estimate of over 4000 patients/week). Only 28% of respondents reported utilizing standardized protocols for collection and storage of acoustic data. Although, 87% of respondents conduct voice research, only 38% of respondents report doing so on a multi-institutional level. Perceived limitations to conducting collaborative voice research include lack of standardized methodology for collection (30%) and lack of human resources to prepare and label voice data adequately (55%). Conclusion: To conduct large-scale multi-institutional voice research with AI, there is a pertinent need for standardization of acoustic data management, as well as an infrastructure for secure and efficient data sharing. Level of Evidence: 5 Laryngoscope, 134:1333–1339, 2024.

KW - artificial intelligence

KW - current practices

KW - data collection

KW - voice

UR - http://www.scopus.com/inward/record.url?scp=85179984218&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85179984218&partnerID=8YFLogxK

U2 - 10.1002/lary.31052

DO - 10.1002/lary.31052

M3 - Article

C2 - 38087983

AN - SCOPUS:85179984218

SN - 0023-852X

VL - 134

SP - 1333

EP - 1339

JO - Laryngoscope

JF - Laryngoscope

IS - 3

ER -

Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this