Digital chefs and intelligent cooking systems based on multimodal large language model

A digital chef and an intelligent cooking method were proposed to achieve high-quality, precise cooking results. In the offline phase, visual, auditory and thermal sensors record professional chefs' continuous cooking operations. The collected frame-by-frame images and multi-round Q&A t...

Full description

Saved in:
Bibliographic Details
Main Authors: LI Xinyuan, LI Bai, SUN Yueshuo, ZHANG Tantan, TIAN Yonglin, YIN Zhuyan, WANG Fei-Yue
Format: Article
Language:zho
Published: POSTS&TELECOM PRESS Co., LTD 2024-12-01
Series:智能科学与技术学报
Subjects:
Online Access:http://www.cjist.com.cn/zh/article/doi/10.11959/j.issn.2096-6652.202448/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832586368948109312
author LI Xinyuan
LI Bai
SUN Yueshuo
ZHANG Tantan
TIAN Yonglin
YIN Zhuyan
WANG Fei-Yue
author_facet LI Xinyuan
LI Bai
SUN Yueshuo
ZHANG Tantan
TIAN Yonglin
YIN Zhuyan
WANG Fei-Yue
author_sort LI Xinyuan
collection DOAJ
description A digital chef and an intelligent cooking method were proposed to achieve high-quality, precise cooking results. In the offline phase, visual, auditory and thermal sensors record professional chefs' continuous cooking operations. The collected frame-by-frame images and multi-round Q&A texts form a culinary expert knowledge base. A low-rank adaptation method was applied to fine-tune a pretrained multimodal large language model, enabling it to understand cooking intentions. In the online phase, real-time sensory data were converted into image-text inputs for the fine-tuned model, which then generated cooking instructions to guide users through the cooking steps. A hardware-software cooking system was implemented and tested with a pan-frying steak task. Experimental results show that the fine-tuned system effectively controls the steak's doneness and quality, and significantly improves the accuracy and rationality of cooking instructions compared to the model before fine-tuning.
format Article
id doaj-art-c771741ac1014dc98fa4d946e92abbe3
institution Kabale University
issn 2096-6652
language zho
publishDate 2024-12-01
publisher POSTS&TELECOM PRESS Co., LTD
record_format Article
series 智能科学与技术学报
spelling doaj-art-c771741ac1014dc98fa4d946e92abbe32025-01-25T19:00:53ZzhoPOSTS&TELECOM PRESS Co., LTD智能科学与技术学报2096-66522024-12-01642944481046628Digital chefs and intelligent cooking systems based on multimodal large language modelLI XinyuanLI BaiSUN YueshuoZHANG TantanTIAN YonglinYIN ZhuyanWANG Fei-YueA digital chef and an intelligent cooking method were proposed to achieve high-quality, precise cooking results. In the offline phase, visual, auditory and thermal sensors record professional chefs' continuous cooking operations. The collected frame-by-frame images and multi-round Q&A texts form a culinary expert knowledge base. A low-rank adaptation method was applied to fine-tune a pretrained multimodal large language model, enabling it to understand cooking intentions. In the online phase, real-time sensory data were converted into image-text inputs for the fine-tuned model, which then generated cooking instructions to guide users through the cooking steps. A hardware-software cooking system was implemented and tested with a pan-frying steak task. Experimental results show that the fine-tuned system effectively controls the steak's doneness and quality, and significantly improves the accuracy and rationality of cooking instructions compared to the model before fine-tuning.http://www.cjist.com.cn/zh/article/doi/10.11959/j.issn.2096-6652.202448/multimodal large language modeldigital chefintelligent cookingcooking robotexpert systemartificial intelligence
spellingShingle LI Xinyuan
LI Bai
SUN Yueshuo
ZHANG Tantan
TIAN Yonglin
YIN Zhuyan
WANG Fei-Yue
Digital chefs and intelligent cooking systems based on multimodal large language model
智能科学与技术学报
multimodal large language model
digital chef
intelligent cooking
cooking robot
expert system
artificial intelligence
title Digital chefs and intelligent cooking systems based on multimodal large language model
title_full Digital chefs and intelligent cooking systems based on multimodal large language model
title_fullStr Digital chefs and intelligent cooking systems based on multimodal large language model
title_full_unstemmed Digital chefs and intelligent cooking systems based on multimodal large language model
title_short Digital chefs and intelligent cooking systems based on multimodal large language model
title_sort digital chefs and intelligent cooking systems based on multimodal large language model
topic multimodal large language model
digital chef
intelligent cooking
cooking robot
expert system
artificial intelligence
url http://www.cjist.com.cn/zh/article/doi/10.11959/j.issn.2096-6652.202448/
work_keys_str_mv AT lixinyuan digitalchefsandintelligentcookingsystemsbasedonmultimodallargelanguagemodel
AT libai digitalchefsandintelligentcookingsystemsbasedonmultimodallargelanguagemodel
AT sunyueshuo digitalchefsandintelligentcookingsystemsbasedonmultimodallargelanguagemodel
AT zhangtantan digitalchefsandintelligentcookingsystemsbasedonmultimodallargelanguagemodel
AT tianyonglin digitalchefsandintelligentcookingsystemsbasedonmultimodallargelanguagemodel
AT yinzhuyan digitalchefsandintelligentcookingsystemsbasedonmultimodallargelanguagemodel
AT wangfeiyue digitalchefsandintelligentcookingsystemsbasedonmultimodallargelanguagemodel