LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali's global status as the seventh most spoken language with approximately 265 million speakers,...
Saved in:
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-02-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340924012162 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832576520393064448 |
---|---|
author | Md. Tanvir Rahman Sahed Md. Tanjil Islam Aronno Hussain Nyeem Md. Abdul Wahed Tashrif Ahsan R Rafiul Islam Tareque Bashar Ovi Manab Kumar Kundu Jane Alam Sadeef |
author_facet | Md. Tanvir Rahman Sahed Md. Tanjil Islam Aronno Hussain Nyeem Md. Abdul Wahed Tashrif Ahsan R Rafiul Islam Tareque Bashar Ovi Manab Kumar Kundu Jane Alam Sadeef |
author_sort | Md. Tanvir Rahman Sahed |
collection | DOAJ |
description | The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali's global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. LipBengal fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 54 classes, encompassing Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal stands as the most extensive Bengali lip-reading dataset to date, designed to facilitate robust benchmarking and validation of novel deep learning architectures. Detailed annotations extend from phoneme- level classifications to full sentence constructions, providing a granular and comprehensive dataset. The primary potential of LipBengal lies in its thorough coverage of Bengali phonemes, capturing diverse lip movements linked to distinct sounds. This rich dataset holds promise for training accurate lip-reading models, with implications for improved accessibility, enhanced speech recognition, silent speech interfaces, and linguistic research. The dataset's diversity in speaker backgrounds enhances its utility, ensuring broader representation of Bengali pronunciation patterns. Meticulous annotation and curation further bolster its quality and reliability, making LipBengal a valuable asset for researchers and developers in the field. |
format | Article |
id | doaj-art-f745a061be3a437782f00830b7f05d28 |
institution | Kabale University |
issn | 2352-3409 |
language | English |
publishDate | 2025-02-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj-art-f745a061be3a437782f00830b7f05d282025-01-31T05:11:41ZengElsevierData in Brief2352-34092025-02-0158111254LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging FaceMd. Tanvir Rahman Sahed0Md. Tanjil Islam Aronno1Hussain Nyeem2Md. Abdul Wahed3Tashrif Ahsan4R Rafiul Islam5Tareque Bashar Ovi6Manab Kumar Kundu7Jane Alam Sadeef8Department of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshCorresponding author.; Department of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology (MIST), Dhaka 1216, BangladeshThe LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali's global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. LipBengal fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 54 classes, encompassing Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal stands as the most extensive Bengali lip-reading dataset to date, designed to facilitate robust benchmarking and validation of novel deep learning architectures. Detailed annotations extend from phoneme- level classifications to full sentence constructions, providing a granular and comprehensive dataset. The primary potential of LipBengal lies in its thorough coverage of Bengali phonemes, capturing diverse lip movements linked to distinct sounds. This rich dataset holds promise for training accurate lip-reading models, with implications for improved accessibility, enhanced speech recognition, silent speech interfaces, and linguistic research. The dataset's diversity in speaker backgrounds enhances its utility, ensuring broader representation of Bengali pronunciation patterns. Meticulous annotation and curation further bolster its quality and reliability, making LipBengal a valuable asset for researchers and developers in the field.http://www.sciencedirect.com/science/article/pii/S2352340924012162Bengali LanguageLip-readingVisual speech recognition (VSR)Lip gesturesPhonemeDataset |
spellingShingle | Md. Tanvir Rahman Sahed Md. Tanjil Islam Aronno Hussain Nyeem Md. Abdul Wahed Tashrif Ahsan R Rafiul Islam Tareque Bashar Ovi Manab Kumar Kundu Jane Alam Sadeef LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face Data in Brief Bengali Language Lip-reading Visual speech recognition (VSR) Lip gestures Phoneme Dataset |
title | LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face |
title_full | LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face |
title_fullStr | LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face |
title_full_unstemmed | LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face |
title_short | LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face |
title_sort | lipbengal pioneering bengali lip reading dataset for pronunciation mapping through lip gestureshugging face |
topic | Bengali Language Lip-reading Visual speech recognition (VSR) Lip gestures Phoneme Dataset |
url | http://www.sciencedirect.com/science/article/pii/S2352340924012162 |
work_keys_str_mv | AT mdtanvirrahmansahed lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface AT mdtanjilislamaronno lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface AT hussainnyeem lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface AT mdabdulwahed lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface AT tashrifahsan lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface AT rrafiulislam lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface AT tarequebasharovi lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface AT manabkumarkundu lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface AT janealamsadeef lipbengalpioneeringbengalilipreadingdatasetforpronunciationmappingthroughlipgestureshuggingface |