Advancing Spanish Speech Emotion Recognition: A Comprehensive Benchmark of Pre-Trained Models
Feature extraction for speech emotion recognition (SER) has evolved from handcrafted techniques through deep learning methods to embeddings derived from pre-trained models (PTMs). This study presents the first comparative analysis focused on using PTMs for Spanish SER, evaluating six models—Whisper,...
Saved in:
| Main Authors: | Alex Mares, Gerardo Diaz-Arango, Jorge Perez-Jacome-Friscione, Hector Vazquez-Leal, Luis Hernandez-Martinez, Jesus Huerta-Chua, Andres Felipe Jaramillo-Alvarado, Alfonso Dominguez-Chavez |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-04-01
|
| Series: | Applied Sciences |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2076-3417/15/8/4340 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Empathetic Deep Learning: Transferring Adult Speech Emotion Models to Children With Gender-Specific Adaptations Using Neural Embeddings
by: Elina Lesyk, et al.
Published: (2024-12-01) -
Do L1 Chinese speakers use melodic strategies to convey sadness and joy in L2 Spanish? A melodic analysis of speech of L2 acted emotional speech
by: Shaohua Sun, et al.
Published: (2025-05-01) -
Advanced Identification of Prosodic Boundaries, Speakers, and Accents Through Multi-Task Audio Pre-Processing and Speech Language Models
by: Francisco Javier Lima Florido, et al.
Published: (2025-03-01) -
Transformer-based language-independent gender recognition in noisy audio environments
by: Or Haim Anidjar, et al.
Published: (2025-04-01) -
Wav2Lip Bridges Communication Gap: Automating Lip Sync and Language Translation for Indian Languages
by: Vaishnavi Venkataraghavan, et al.
Published: (2025-01-01)