Cross-Attention Fusion of Visual and Geometric Features for Large-Vocabulary Arabic Lipreading
Lipreading involves recognizing spoken words by analyzing the movements of the lips and surrounding area using visual data. It is an emerging research topic with many potential applications, such as human–machine interaction and enhancing audio-based speech recognition. Recent deep learning approach...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-01-01
|
Series: | Technologies |
Subjects: | |
Online Access: | https://www.mdpi.com/2227-7080/13/1/26 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!