Sentiment analysis of emoji fused reviews using machine learning and Bert

Abstract The usage of Natural Language Processing (NLP) technology powered by Artificial Intelligence in processing of customer feedback has helped in making critical decisions for business growth in the aviation sector. It is observed that in many of the cases, emojis and emoticons are found to con...

Full description

Saved in:
Bibliographic Details
Main Authors: Amit Khan, Dipankar Majumdar, Bikromadittya Mondal
Format: Article
Language:English
Published: Nature Portfolio 2025-03-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-92286-0
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract The usage of Natural Language Processing (NLP) technology powered by Artificial Intelligence in processing of customer feedback has helped in making critical decisions for business growth in the aviation sector. It is observed that in many of the cases, emojis and emoticons are found to convey a lot of significant information about the user’s opinion or experience regarding a certain product, a service or an event. Consequently, it is very much essential that these emojis/emoticons are considered for processing because they are found to play a vital role in sentiment expression, often conveying more explicit information than the text alone. Their inclusion helps in capturing nuanced sentiments, improving the overall accuracy of sentiment classification. In Spite of the fact that these elements are a significant part of the review comment provided by the customer, it is a common practice among the contemporary researchers to eliminate them right at the data-cleaning or the preprocessing stage. With an objective to provide a solution to the above drawback, we present a novel approach that performs sentiment analysis, with effective utilization of emojis and emoticons, upon the US Airline tweet dataset using various Machine Learning classifiers and the BERT model. Finally, the proposed model was evaluated using various performance metrics and achieved 92% accuracy, outperforming contemporary state-of-the-art frameworks by 9%.
ISSN:2045-2322