Tibetan–Chinese speech-to-speech translation based on discrete units
Abstract Speech-to-speech translation (S2ST) has evolved from cascade systems which integrate Automatic Speech Recognition (ASR), Machine Translation (MT), and Text-to-Speech (TTS), to end-to-end models. This evolution has been driven by advancements in model performance and the expansion of cross-l...
Saved in:
Main Authors: | Zairan Gong, Xiaona Xu, Yue Zhao |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2025-01-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-025-85782-w |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
End-to-End Speech Synthesis for Tibetan Multidialect
by: Xiaona Xu, et al.
Published: (2021-01-01) -
Multitask Learning with Local Attention for Tibetan Speech Recognition
by: Hui Wang, et al.
Published: (2020-01-01) -
Freedom of speech in the United States /
by: Tedford, Thomas L.
Published: (2001) -
ZeST: A Zero-Resourced Speech-to-Speech Translation Approach for Unknown, Unpaired, and Untranscribed Languages
by: Luan Thanh Nguyen, et al.
Published: (2025-01-01) -
Speech Writing and Types of Speeches
by: Ricky Telg
Published: (2011-08-01)