On the reliability of feature attribution methods for speech classification

  • Gaofei Shen
  • , Hosein Mohebbi
  • , Arianna Bisazza
  • , Afra Alishahi
  • , Grzegorz Chrupała

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

As the capabilities of large-scale pre-trained models evolve, understanding the determinants of their outputs becomes more important. Feature attribution aims to reveal which parts of the input elements contribute the most to model outputs. In speech processing, the unique characteristics of the input signal make the application of feature attribution methods challenging. We study how factors such as input type and aggregation and perturbation timespan impact the reliability of standard feature attribution methods, and how these factors interact with characteristics of each classification task. We find that standard approaches to feature attribution are generally unreliable when applied to the speech domain, with the exception of word-aligned perturbation methods when applied to word-based classification tasks.

Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PublisherISCA
Pages266-270
Number of pages5
DOIs
Publication statusPublished - 2025
Event26th Interspeech Conference 2025 - Rotterdam, Netherlands
Duration: 17-Aug-202521-Aug-2025

Publication series

NameProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
ISSN (Print)2308-457X

Conference

Conference26th Interspeech Conference 2025
Country/TerritoryNetherlands
CityRotterdam
Period17/08/202521/08/2025

Keywords

  • feature attribution
  • interpretability
  • speech processing

Fingerprint

Dive into the research topics of 'On the reliability of feature attribution methods for speech classification'. Together they form a unique fingerprint.

Cite this