A dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesRecherche.Data.Gouv

This dataset was created to investigate the impact of data collection modes and pre-processing techniques on the quality of free comment data related to consumers' sensory perceptions. A total of 200 consumers were recruited and divided into two groups of 100. Each group evaluated six madeleine...

Full description

Saved in:
Bibliographic Details
Main Authors: Michel Visalli, Ronan Symoneaux, Cécile Mursic, Margaux Touret, Flore Lourtioux, Kipédène Coulibaly, Benjamin Mahieu
Format: Article
Language:English
Published: Elsevier 2025-02-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340924012125
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832576453881888768
author Michel Visalli
Ronan Symoneaux
Cécile Mursic
Margaux Touret
Flore Lourtioux
Kipédène Coulibaly
Benjamin Mahieu
author_facet Michel Visalli
Ronan Symoneaux
Cécile Mursic
Margaux Touret
Flore Lourtioux
Kipédène Coulibaly
Benjamin Mahieu
author_sort Michel Visalli
collection DOAJ
description This dataset was created to investigate the impact of data collection modes and pre-processing techniques on the quality of free comment data related to consumers' sensory perceptions. A total of 200 consumers were recruited and divided into two groups of 100. Each group evaluated six madeleine samples (five distinct samples and one replicate) in a sensory analysis laboratory, using different free comment data collection modes. Consumers in the first group provided only words or short expressions, while those in the second group used complete sentences. Additionally, participants reported their liking for each sample.The collected data provided valuable insights into the effectiveness of the free comment method in sensory evaluation of food products. They emphasized the importance of data pre-processing and demonstrated how the chosen techniques can impact the quality of the results. The dataset is based on real-world consumer data, showcasing how individuals naturally express their subjective perceptions. It features descriptions that reflect authentic consumer language, including informal expressions, incorrect phrasing, spelling errors, and unstructured sentences. This raw textual data has been annotated and translated into English. The dataset can therefore be repurposed to assess and compare the performance of different text mining, natural language processing and sentiment analysis algorithms in both French and English, as well as to drive innovations in AI tools for sensory and consumer research.
format Article
id doaj-art-08f37d9d095b4536bf87f7dbc8efe5c0
institution Kabale University
issn 2352-3409
language English
publishDate 2025-02-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj-art-08f37d9d095b4536bf87f7dbc8efe5c02025-01-31T05:11:40ZengElsevierData in Brief2352-34092025-02-0158111250A dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesRecherche.Data.GouvMichel Visalli0Ronan Symoneaux1Cécile Mursic2Margaux Touret3Flore Lourtioux4Kipédène Coulibaly5Benjamin Mahieu6Centre des Sciences du Goût et de l'Alimentation, AgroSup Dijon, CNRS, INRAE, Université Bourgogne, F-21000 Dijon, France; INRAE, PROBE research infrastructure, ChemoSens facility, F-21000 Dijon, France; Corresponding author at: Centre des Sciences du Goût et de l'Alimentation, L'Institut Agro Dijon, CNRS, INRAE, Université Bourgogne, F-21000 Dijon, France.GRAPPE, ESA, USC 1422 INRAE, SensoVeg, SFR 4207 QUASAV, 55 rue Rabelais, F-49007 Angers, FranceTechni'Sens, 17000 La Rochelle, FranceTechni'Sens, 17000 La Rochelle, FranceGRAPPE, ESA, USC 1422 INRAE, SensoVeg, SFR 4207 QUASAV, 55 rue Rabelais, F-49007 Angers, FranceCentre des Sciences du Goût et de l'Alimentation, AgroSup Dijon, CNRS, INRAE, Université Bourgogne, F-21000 Dijon, France; INRAE, PROBE research infrastructure, ChemoSens facility, F-21000 Dijon, FranceOniris, INRAE, StatSC, 44300 Nantes, FranceThis dataset was created to investigate the impact of data collection modes and pre-processing techniques on the quality of free comment data related to consumers' sensory perceptions. A total of 200 consumers were recruited and divided into two groups of 100. Each group evaluated six madeleine samples (five distinct samples and one replicate) in a sensory analysis laboratory, using different free comment data collection modes. Consumers in the first group provided only words or short expressions, while those in the second group used complete sentences. Additionally, participants reported their liking for each sample.The collected data provided valuable insights into the effectiveness of the free comment method in sensory evaluation of food products. They emphasized the importance of data pre-processing and demonstrated how the chosen techniques can impact the quality of the results. The dataset is based on real-world consumer data, showcasing how individuals naturally express their subjective perceptions. It features descriptions that reflect authentic consumer language, including informal expressions, incorrect phrasing, spelling errors, and unstructured sentences. This raw textual data has been annotated and translated into English. The dataset can therefore be repurposed to assess and compare the performance of different text mining, natural language processing and sentiment analysis algorithms in both French and English, as well as to drive innovations in AI tools for sensory and consumer research.http://www.sciencedirect.com/science/article/pii/S2352340924012125Open ended questionsNatural language processingSensory evaluationDrivers of liking
spellingShingle Michel Visalli
Ronan Symoneaux
Cécile Mursic
Margaux Touret
Flore Lourtioux
Kipédène Coulibaly
Benjamin Mahieu
A dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesRecherche.Data.Gouv
Data in Brief
Open ended questions
Natural language processing
Sensory evaluation
Drivers of liking
title A dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesRecherche.Data.Gouv
title_full A dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesRecherche.Data.Gouv
title_fullStr A dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesRecherche.Data.Gouv
title_full_unstemmed A dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesRecherche.Data.Gouv
title_short A dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesRecherche.Data.Gouv
title_sort dataset of annotated free comments on the sensory perception of madeleines for benchmarking text mining techniquesrecherche data gouv
topic Open ended questions
Natural language processing
Sensory evaluation
Drivers of liking
url http://www.sciencedirect.com/science/article/pii/S2352340924012125
work_keys_str_mv AT michelvisalli adatasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT ronansymoneaux adatasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT cecilemursic adatasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT margauxtouret adatasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT florelourtioux adatasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT kipedenecoulibaly adatasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT benjaminmahieu adatasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT michelvisalli datasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT ronansymoneaux datasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT cecilemursic datasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT margauxtouret datasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT florelourtioux datasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT kipedenecoulibaly datasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv
AT benjaminmahieu datasetofannotatedfreecommentsonthesensoryperceptionofmadeleinesforbenchmarkingtextminingtechniquesrecherchedatagouv