TY - GEN
T1 - On the reliability of feature attribution methods for speech classification
AU - Shen, Gaofei
AU - Mohebbi, Hosein
AU - Bisazza, Arianna
AU - Alishahi, Afra
AU - Chrupała, Grzegorz
N1 - Publisher Copyright:
© 2025 International Speech Communication Association. All rights reserved.
PY - 2025
Y1 - 2025
N2 - As the capabilities of large-scale pre-trained models evolve, understanding the determinants of their outputs becomes more important. Feature attribution aims to reveal which parts of the input elements contribute the most to model outputs. In speech processing, the unique characteristics of the input signal make the application of feature attribution methods challenging. We study how factors such as input type and aggregation and perturbation timespan impact the reliability of standard feature attribution methods, and how these factors interact with characteristics of each classification task. We find that standard approaches to feature attribution are generally unreliable when applied to the speech domain, with the exception of word-aligned perturbation methods when applied to word-based classification tasks.
AB - As the capabilities of large-scale pre-trained models evolve, understanding the determinants of their outputs becomes more important. Feature attribution aims to reveal which parts of the input elements contribute the most to model outputs. In speech processing, the unique characteristics of the input signal make the application of feature attribution methods challenging. We study how factors such as input type and aggregation and perturbation timespan impact the reliability of standard feature attribution methods, and how these factors interact with characteristics of each classification task. We find that standard approaches to feature attribution are generally unreliable when applied to the speech domain, with the exception of word-aligned perturbation methods when applied to word-based classification tasks.
KW - feature attribution
KW - interpretability
KW - speech processing
UR - https://www.scopus.com/pages/publications/105020026871
U2 - 10.21437/Interspeech.2025-1911
DO - 10.21437/Interspeech.2025-1911
M3 - Conference contribution
AN - SCOPUS:105020026871
T3 - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
SP - 266
EP - 270
BT - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PB - ISCA
T2 - 26th Interspeech Conference 2025
Y2 - 17 August 2025 through 21 August 2025
ER -