SarcasticSpeech: Speech Synthesis for Sarcasm in Low-Resource Scenarios

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Sarcastic speech synthesis, the ability to generate speech that conveys sarcasm, can have several significant implications in various contexts, such as entertainment and better human-computer interaction. This study presents a first attempt to apply transfer learning techniques from a diverse speech style dataset to the challenging domain of sarcastic speech synthesis. The limited availability of specific sarcastic speech data poses significant challenges in capturing the expressive nature of sarcasm. By leveraging transfer learning, a pre-trained model is fine-tuned using a dataset encompassing various speech styles, including sarcastic speech. The synthesized sound contains some robotic elements, indicating moderate performance improvements in sarcastic speech synthesis through transfer learning. Future work will explore the application of multi-modal approaches to improve sarcastic speech synthesis and further enhance the expressiveness and naturalness of generated sarcastic speech.

Original languageEnglish
Title of host publicationProceedings 12th ISCA Speech Synthesis Workshop (SSW2023)
PublisherISCA
Pages242-243
Number of pages2
Publication statusPublished - Aug-2023
Event12th ISCA Speech Synthesis Workshop (SSW2023) - Grenoble, France
Duration: 26-Aug-202328-Aug-2023

Conference

Conference12th ISCA Speech Synthesis Workshop (SSW2023)
Country/TerritoryFrance
CityGrenoble
Period26/08/202328/08/2023

Fingerprint

Dive into the research topics of 'SarcasticSpeech: Speech Synthesis for Sarcasm in Low-Resource Scenarios'. Together they form a unique fingerprint.

Cite this