LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face

The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali's global status as the seventh most spoken language with approximately 265 million speakers,...

Full description

Saved in:
Bibliographic Details
Main Authors: Md. Tanvir Rahman Sahed, Md. Tanjil Islam Aronno, Hussain Nyeem, Md. Abdul Wahed, Tashrif Ahsan, R Rafiul Islam, Tareque Bashar Ovi, Manab Kumar Kundu, Jane Alam Sadeef
Format: Article
Language:English
Published: Elsevier 2025-02-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340924012162
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832576520393064448
author Md. Tanvir Rahman Sahed
Md. Tanjil Islam Aronno
Hussain Nyeem
Md. Abdul Wahed
Tashrif Ahsan
R Rafiul Islam
Tareque Bashar Ovi
Manab Kumar Kundu
Jane Alam Sadeef
author_facet Md. Tanvir Rahman Sahed
Md. Tanjil Islam Aronno
Hussain Nyeem
Md. Abdul Wahed
Tashrif Ahsan
R Rafiul Islam
Tareque Bashar Ovi
Manab Kumar Kundu
Jane Alam Sadeef
author_sort Md. Tanvir Rahman Sahed
collection DOAJ
description The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali's global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. LipBengal fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 54 classes, encompassing Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal stands as the most extensive Bengali lip-reading dataset to date, designed to facilitate robust benchmarking and validation of novel deep learning architectures. Detailed annotations extend from phoneme- level classifications to full sentence constructions, providing a granular and comprehensive dataset. The primary potential of LipBengal lies in its thorough coverage of Bengali phonemes, capturing diverse lip movements linked to distinct sounds. This rich dataset holds promise for training accurate lip-reading models, with implications for improved accessibility, enhanced speech recognition, silent speech interfaces, and linguistic research. The dataset's diversity in speaker backgrounds enhances its utility, ensuring broader representation of Bengali pronunciation patterns. Meticulous annotation and curation further bolster its quality and reliability, making LipBengal a valuable asset for researchers and developers in the field.
format Article
id doaj-art-f745a061be3a437782f00830b7f05d28
institution Kabale University
issn 2352-3409
language English
publishDate 2025-02-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj-art-f745a061be3a437782f00830b7f05d282025-01-31T05:11:41ZengElsevierData in Brief2352-34092025-02-0158111254LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging FaceMd. Tanvir Rahman Sahed0Md. Tanjil Islam Aronno1Hussain Nyeem2Md. Abdul Wahed3Tashrif Ahsan4R Rafiul Islam5Tareque Bashar Ovi6Manab Kumar Kundu7Jane Alam Sadeef8Department of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshCorresponding author.; Department of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshThe LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali's global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. LipBengal fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 54 classes, encompassing Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal stands as the most extensive Bengali lip-reading dataset to date, designed to facilitate robust benchmarking and validation of novel deep learning architectures. Detailed annotations extend from phoneme- level classifications to full sentence constructions, providing a granular and comprehensive dataset. The primary potential of LipBengal lies in its thorough coverage of Bengali phonemes, capturing diverse lip movements linked to distinct sounds. This rich dataset holds promise for training accurate lip-reading models, with implications for improved accessibility, enhanced speech recognition, silent speech interfaces, and linguistic research. The dataset's diversity in speaker backgrounds enhances its utility, ensuring broader representation of Bengali pronunciation patterns. Meticulous annotation and curation further bolster its quality and reliability, making LipBengal a valuable asset for researchers and developers in the field.http://www.sciencedirect.com/science/article/pii/S2352340924012162Bengali LanguageLip-readingVisual speech recognition (VSR)Lip gesturesPhonemeDataset
spellingShingle Md. Tanvir Rahman Sahed
Md. Tanjil Islam Aronno
Hussain Nyeem
Md. Abdul Wahed
Tashrif Ahsan
R Rafiul Islam
Tareque Bashar Ovi
Manab Kumar Kundu
Jane Alam Sadeef
LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
Data in Brief
Bengali Language
Lip-reading
Visual speech recognition (VSR)
Lip gestures
Phoneme
Dataset
title LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
title_full LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
title_fullStr LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
title_full_unstemmed LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
title_short LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
title_sort lipbengal pioneering bengali lip reading dataset for pronunciation mapping through lip gestureshugging face
topic Bengali Language
Lip-reading
Visual speech recognition (VSR)
Lip gestures
Phoneme
Dataset
url http://www.sciencedirect.com/science/article/pii/S2352340924012162
work_keys_str_mv AT mdtanvirrahmansahed lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface
AT mdtanjilislamaronno lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface
AT hussainnyeem lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface
AT mdabdulwahed lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface
AT tashrifahsan lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface
AT rrafiulislam lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface
AT tarequebasharovi lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface
AT manabkumarkundu lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface
AT janealamsadeef lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface