Combining automatic speech recognition with semantic natural language processing in schizophrenia

S. Ciampelli*, A. E. Voppel, J. N. de Boer, S. Koops, I. E.C. Sommer

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

16 Citations (Scopus)
180 Downloads (Pure)

Abstract

Natural language processing (NLP) tools are increasingly used to quantify semantic anomalies in schizophrenia. Automatic speech recognition (ASR) technology, if robust enough, could significantly speed up the NLP research process. In this study, we assessed the performance of a state-of-the-art ASR tool and its impact on diagnostic classification accuracy based on a NLP model. We compared ASR to human transcripts quantitatively (Word Error Rate (WER)) and qualitatively by analyzing error type and position. Subsequently, we evaluated the impact of ASR on classification accuracy using semantic similarity measures. Two random forest classifiers were trained with similarity measures derived from automatic and manual transcriptions, and their performance was compared. The ASR tool had a mean WER of 30.4%. Pronouns and words in sentence-final position had the highest WERs. The classification accuracy was 76.7% (sensitivity 70%; specificity 86%) using automated transcriptions and 79.8% (sensitivity 75%; specificity 86%) for manual transcriptions. The difference in performance between the models was not significant. These findings demonstrate that using ASR for semantic analysis is associated with only a small decrease in accuracy in classifying schizophrenia, compared to manual transcripts. Thus, combining ASR technology with semantic NLP models qualifies as a robust and efficient method for diagnosing schizophrenia.

Original languageEnglish
Article number115252
JournalPsychiatry Research
Volume325
DOIs
Publication statusPublished - Jul-2023

Keywords

  • Psychosis
  • Semantic similarity
  • Speech analysis
  • Word error rate

Fingerprint

Dive into the research topics of 'Combining automatic speech recognition with semantic natural language processing in schizophrenia'. Together they form a unique fingerprint.

Cite this