End-to-End Multi-Speaker FastSpeech2 With Hierarchical Decoder
Multi-speaker text-to-speech (TTS) systems play a crucial role in different applications, such as personalized voice assistants, audiobooks, and multilingual speech synthesis. These systems aim to generate high-quality, natural-sounding speech while preserving the distinct characteristics of differe...
Saved in:
| Main Authors: | Majid Adibian, Hossein Zeinali |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11080147/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
ECE-TTS: A Zero-Shot Emotion Text-to-Speech Model with Simplified and Precise Control
by: Shixiong Liang, et al.
Published: (2025-05-01) -
Assessment of the Speech Material Usability for Forensic Speaker Identification by Voice and Sounding Speech
by: T. N. Svirava, et al.
Published: (2025-04-01) -
In Memoriam Professor Wojciech Majewski
by: Andrzej Bogdan DOBRUCKI
Published: (2021-08-01) -
Assessing the effectiveness of diarization algorithms in costa rican children-adult speech according to age group and gender
by: Alejandro Chacón-Vargas, et al.
Published: (2022-11-01) -
Privacy-Preserving Deep Speaker Separation for Smartphone-Based Passive Speech Assessment
by: Apiwat Ditthapron, et al.
Published: (2021-01-01)