Tibetan–Chinese speech-to-speech translation based on discrete units

Abstract Speech-to-speech translation (S2ST) has evolved from cascade systems which integrate Automatic Speech Recognition (ASR), Machine Translation (MT), and Text-to-Speech (TTS), to end-to-end models. This evolution has been driven by advancements in model performance and the expansion of cross-l...

Full description

Saved in:
Bibliographic Details
Main Authors: Zairan Gong, Xiaona Xu, Yue Zhao
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-85782-w
Tags: Add Tag
No Tags, Be the first to tag this record!