Using pupil dilation to measure cognitive load when listening to text-to-speech in quiet and in noise

Avashna Govender, Anita E. Wagner, Simon King

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    11 Citations (Scopus)
    10 Downloads (Pure)

    Abstract

    With increased use of text-to-speech (TTS) systems in real-world applications, evaluating how such systems influence the human cognitive processing system becomes important. Particularly in situations where cognitive load is high, there may be negative implications such as fatigue. For example, noisy situations generally require the listener to exert increased mental effort. A better understanding of this could eventually suggest new ways of generating synthetic speech that demands low cognitive load. In our previous study, pupil dilation was used as an index of cognitive effort. Pupil dilation was shown to be sensitive to the quality of synthetic speech, but there were some uncertainties regarding exactly what was being measured. The current study resolves some of those uncertainties. Additionally, we investigate how the pupil dilates when listening to synthetic speech in the presence of speech-shaped noise. Our results show that, in quiet listening conditions, pupil dilation does not reflect listening effort but rather attention and engagement. In noisy conditions, increased pupil dilation indicates that listening effort increases as signal-to-noise ratio decreases, under all conditions tested.

    Original languageEnglish
    Title of host publication20th Annual Conference of the International Speech Communication Association
    Subtitle of host publicationCrossroads of Speech and Language
    PublisherISCA
    Pages1551-1555
    Number of pages5
    DOIs
    Publication statusPublished - 2019
    Event20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 - Graz, Austria
    Duration: 15-Sept-201919-Sept-2019

    Publication series

    NameProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
    PublisherISCA
    Volume2019-September
    ISSN (Print)2308-457X

    Conference

    Conference20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019
    Country/TerritoryAustria
    CityGraz
    Period15/09/201919/09/2019

    Keywords

    • Adverse conditions
    • Cognitive load
    • Pupillometry
    • Text-to-speech

    Fingerprint

    Dive into the research topics of 'Using pupil dilation to measure cognitive load when listening to text-to-speech in quiet and in noise'. Together they form a unique fingerprint.

    Cite this