What if HAL breathed? Enhancing Empathy in Human-AI Interactions with Breathing Speech Synthesis

Publication date

2024

Authors

Loddo, Nicolò
Pessanha, FranciscaORCID 0000-0002-3711-7814ISNI 0000000524640122
Akdağ Salah, A. A.ORCID 0000-0002-7204-5633ISNI 0000000050543653

Editors

Advisors

Supervisors

Document Type

/dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/conferencearticle
Open Access logo

License

Abstract

AI Agents increasingly leverage speech synthesis models to communicate with their users. This study explores the integration of breathing patterns into synthesized speech to deepen empathy towards AI agents. The research methodologically diverges from traditional empathy studies and speech evaluation standards by proposing to the participants the resolution of an emotional dilemma within a cooperative game scenario, where they face a choice reflecting their empathetic engagement with an AI partner. The introduction of a novel speech assessment method that takes into account the interactive and contextual aspects of conversational speech is the first novelty of the paper. The second novelty is in the findings which indicate that breathing in synthesized speech significantly enhances agents' perceived naturalness and users' empathy towards them.

Keywords

breathing, empathy, speech synthesis, virtual agents, Language and Linguistics, Human-Computer Interaction, Signal Processing, Software, Modelling and Simulation

Citation

Loddo, N, Pessanha, F & Akdag, A 2024, 'What if HAL breathed? Enhancing Empathy in Human-AI Interactions with Breathing Speech Synthesis', Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 2625-2629. https://doi.org/10.21437/Interspeech.2024-1083