Jun 30,2018 البحث العلمي والدراسات العليا, الهندسة المعلوماتية والاتصالات

Emotional Audio Visual Arabic Text to Speech

Author

  1. Abou Zliekha, S. Al-Moubayed, O. Al-Dakkak, N. Ghneim

Published in

Conference Paper, Conference: 14th European Signal Processing Conference (EUSIPCO), At Florence, Italy, September 2006

Abstract

The goal of this paper is to present an emotional audio-visual Text to speech system for the Arabic Language. The system is based on two entities: un emotional audio text to speech system which generates speech depending on the input text and the desired emotion type, and un emotional Visual model which generates the talking heads, by forming the corresponding visemes. The phonemes to visemes mapping, and the emotion shaping use a 3-paramertic face model, based on the Abstract Muscle Model. We have thirteen viseme models and five emotions as parameters to the face model. The TTS produces the phonemes corresponding to the input text, the speech with the suitable prosody to include the prescribed emotion. In parallel the system generates the visemes and sends the controls to the facial model to get the animation of the talking head in real time.

Link to read full paper

https://www.researchgate.net/publication/228944294_Emotional_audio_visual_arabic_text_to_speech