Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey

Bridge2AI-Voice

Research output: Contribution to journalArticlepeer-review

Abstract

Introduction: Accuracy and validity of voice AI algorithms rely on substantial quality voice data. Although commensurable amounts of voice data are captured daily in voice centers across North America, there is no standardized protocol for acoustic data management, which limits the usability of these datasets for voice artificial intelligence (AI) research. Objective: The aim was to capture current practices of voice data collection, storage, analysis, and perceived limitations to collaborative voice research. Methods: A 30-question online survey was developed with expert guidance from the voicecollab.ai members, an international collaborative of voice AI researchers. The survey was disseminated via REDCap to an estimated 200 practitioners at North American voice centers. Survey questions assessed respondents' current practices in terms of acoustic data collection, storage, and retrieval as well as limitations to collaborative voice research. Results: Seventy-two respondents completed the survey of which 81.7% were laryngologists and 18.3% were speech language pathologists (SLPs). Eighteen percent of respondents reported seeing 40%–60% and 55% reported seeing >60 patients with voice disorders weekly (conservative estimate of over 4000 patients/week). Only 28% of respondents reported utilizing standardized protocols for collection and storage of acoustic data. Although, 87% of respondents conduct voice research, only 38% of respondents report doing so on a multi-institutional level. Perceived limitations to conducting collaborative voice research include lack of standardized methodology for collection (30%) and lack of human resources to prepare and label voice data adequately (55%). Conclusion: To conduct large-scale multi-institutional voice research with AI, there is a pertinent need for standardization of acoustic data management, as well as an infrastructure for secure and efficient data sharing. Level of Evidence: 5 Laryngoscope, 134:1333–1339, 2024.

Original languageEnglish (US)
Pages (from-to)1333-1339
Number of pages7
JournalLaryngoscope
Volume134
Issue number3
DOIs
StatePublished - Mar 2024

Keywords

  • artificial intelligence
  • current practices
  • data collection
  • voice

ASJC Scopus subject areas

  • Otorhinolaryngology

Fingerprint

Dive into the research topics of 'Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey'. Together they form a unique fingerprint.

Cite this