Indonesian Voice Cloning Text-to-Speech System With Vall-E-Based Model and Speech Enhancement

Indonesian Voice Cloning Text-to-Speech System With Vall-E-Based Model and Speech Enhancement

In recent years, Text-to-Speech (TTS) technology has advanced, with research focusing on multi-speaker TTS capable of voice cloning. In 2023, Wang et al. introduced Vall-E, a Transformer-based neural codec language model, achieving state-of-the-art results in voice cloning. However, limited research...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hizkia Raditya Pratama Roosadi, Rizki Rivai Ginanjar, Dessi Puji Lestari
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Neural codec language model speech enhancement transformer text-to-speech Vall-E voice cloning
Online Access:	https://ieeexplore.ieee.org/document/10806715/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Silent-Hidden-Voice Attack on Speech Recognition System
by: Hyun Kwon, et al.
Published: (2024-01-01)

Voice, speech and gender:
by: Erwan Pépiot
Published: (2015-06-01)

Performance Analysis: AI-based VIST Audio Player by Microsoft Speech API
by: Ribwar Bakhtyar Ibrahim
Published: (2021-07-01)

Speech Signal Analysis in Patients with Parkinson’s Disease, Taking into Account Phonation, Articulation, and Prosody of Speech
by: Ewelina Majda-Zdancewicz, et al.
Published: (2024-11-01)

Advances in Automated Voice Pathology Detection: A Comprehensive Review of Speech Signal Analysis Techniques
by: Anitha Sankaran, et al.
Published: (2024-01-01)

Perception of vocoded speech in domestic dogs
by: Amritha Mallikarjun, et al.
Published: (2024-04-01)

CODE-SWITCHING: STUDY ON THE SPEECH OF INDONESIAN JAVANESE EDUCATED BILINGUALS
by: Sudarsono Sudarsono
Published: (2021-10-01)

HILLARY CLINTON’S CONCESSION SPEECH : A CONSTRUCTIVISM STUDY OF DISCOURSE
by: Khairunnisa Khairunnisa, et al.
Published: (2018-01-01)

ZeST: A Zero-Resourced Speech-to-Speech Translation Approach for Unknown, Unpaired, and Untranscribed Languages
by: Luan Thanh Nguyen, et al.
Published: (2025-01-01)

Speech Recognition for the Sterile Interaction with Information Systems in the Surgical Area
by: Schrüfer Katrin V., et al.
Published: (2024-09-01)

Improving Speech Recognition Rate through Analysis Parameters
by: Eringis Deividas, et al.
Published: (2014-05-01)

Le “Written Speech” yeatsien et ses expressions scéniques
by: Pierre Longuenesse
Published: (2013-06-01)

Chinese Alaryngeal Speech Rehabilitation and Their Acoustical Characteristics: A Comprehensive Review
by: Lawrence Ng Manwa, et al.
Published: (2015-08-01)

Measuring Speech Intelligibility with Romanian Synthetic Unpredictable Sentences in Normal Hearing
by: Oana Astefanei, et al.
Published: (2024-12-01)

Recent advancements in automatic disordered speech recognition: A survey paper
by: Nada Gohider, et al.
Published: (2024-12-01)

Deep Speech Synthesis and Its Implications for News Verification: Lessons Learned in the RTVE-UGR Chair
by: Daniel Calderón-González, et al.
Published: (2024-10-01)

The voice of depression: speech features as biomarkers for major depressive disorder
by: Felix Menne, et al.
Published: (2024-11-01)

AN ERROR ANALYSIS OF THE TRANSFORMATION FROM DIRECT TO INDIRECT SPEECH
by: Zahratul Idami
Published: (2018-01-01)

Suitability of Speech Files for Automatic Speech Recognition Systems after Noise Reduction Procedures
by: R.Kh. Latypov, et al.
Published: (2015-12-01)

His Master’s Voice: Sound Devices in Bram Stoker’s Dracula
by: Maria Parrino
Published: (2021-11-01)

Tackling hate speech online: The effect of counter-speech on subsequent bystander behavioral intentions
by: Yue Jia, et al.
Published: (2025-01-01)

Influence of the Sound Source’s Position on the Stage on the Speech Perception in the Auditorium
by: Stefan Brachmański, et al.
Published: (2024-12-01)

Speech Perception as a Function of the Number of Channels and Channel Interaction in Cochlear Implant Simulation
by: Mustafa YUKSEL, et al.
Published: (2023-12-01)

Research on Safety Risks of Speech Information
by: H. V. Davydau, et al.
Published: (2022-11-01)

Backdoor Defence for Voice Print Recognition Model Based on Speech Enhancement and Weight Pruning
by: Jiawei Zhu, et al.
Published: (2022-01-01)

Co-speech gestures influence the magnitude and stability of articulatory movements: evidence for coupling-based enhancement
by: Karee Garvin, et al.
Published: (2025-01-01)

Dataset of speech produced with delayed auditory feedbackOpen Science FrameworkOpen Science Framework
by: Matthias Heyne, et al.
Published: (2025-04-01)

PERCEPTION AND RECOGNITION OF CONCEPTS OF SPEECH ACTS IN VOCAL COMMUNICATION
by: E. I. Grigoriev
Published: (2013-10-01)

Quantifying and Characterizing Phonetic Reduction in Italian Natural Speech
by: Loredana Schettino, et al.
Published: (2025-01-01)

Materializations of oricha voice through divinations in Cuban Santería
by: Kristina Wirtz
Published: (2018-06-01)

Voice Quality in British English. Its Nature, Functions and Applications
by: Stephan Wilhelm
Published: (2019-11-01)

Speech-language pathology
by: Maryluz Camargo-Mendoza
Published: (2015-07-01)

Parliamentary rules of procedure and parliamentary speech
by: Milić Dejan A.
Published: (2023-01-01)

Teaching coherent descriptive speech to children of senior preschool age with speech underdevelopment problems
by: L. G. Shadrina, et al.
Published: (2020-05-01)

Syllable as a Synchronization Mechanism That Makes Human Speech Possible
by: Yi Xu
Published: (2024-12-01)

“A medley of voices”, polyphonie et discours rapportés dans Lolita de Nabokov
by: Yannicke Chupin
Published: (2010-12-01)

Freedom of speech in the United States /
by: Tedford, Thomas L.
Published: (2001)

Speech Disorders of the Type of Aphasia after Right Hemisphere Stroke in Right — handed People — a Case Study
by: Jolanta Zielińska
Published: (2015-06-01)

SpeechMatch—A novel digital approach to supporting communication for neurodiverse groups
by: Sarah Lennard, et al.
Published: (2024-12-01)

8～64kbit/s super-wideband embedded speech and audio coding algorithm
by: JIA Mao-shen, et al.
Published: (2009-01-01)