Cross-Attention Fusion of Visual and Geometric Features for Large-Vocabulary Arabic Lipreading

Lipreading involves recognizing spoken words by analyzing the movements of the lips and surrounding area using visual data. It is an emerging research topic with many potential applications, such as human–machine interaction and enhancing audio-based speech recognition. Recent deep learning approach...

Full description

Saved in:
Bibliographic Details
Main Authors: Samar Daou, Achraf Ben-Hamadou, Ahmed Rekik, Abdelaziz Kallel
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Technologies
Subjects:
Online Access:https://www.mdpi.com/2227-7080/13/1/26
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items