Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models

Film reviews are an obvious area for the application of sentiment analysis, but while this is common in the field of computer science, it has been mostly absent in film studies. Film scholars have quite rightly been skeptical of such techniques due to their inability to grasp nuanced critical texts....

Full description

Saved in:

Bibliographic Details
Main Authors:	Isadora Campregher Paiva, Josephine Diecke
Format:	Article
Language:	English
Published:	Department of Languages, Literatures, and Cultures at McGill University 2024-07-01
Series:	Journal of Cultural Analytics
Online Access:	https://doi.org/10.22148/001c.118497
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832583269785272320
author	Isadora Campregher Paiva Josephine Diecke
author_facet	Isadora Campregher Paiva Josephine Diecke
author_sort	Isadora Campregher Paiva
collection	DOAJ
description	Film reviews are an obvious area for the application of sentiment analysis, but while this is common in the field of computer science, it has been mostly absent in film studies. Film scholars have quite rightly been skeptical of such techniques due to their inability to grasp nuanced critical texts. Recent technological developments have, however, given us cause to re-evaluate the usefulness of automated sentiment analysis for historical film reviews. The release of ever more sophisticated Large Language Models (LLMs) has shown that their capacity to handle nuanced language could overcome some of the shortcomings of lexicon-based sentiment analysis. Applying it to historical film reviews seemed logical and promising to us. Some of our early optimism was misplaced: while LLMs, and in particular ChatGPT, proved indeed to be much more adept at dealing with nuanced language, they are also difficult to control and implement in a consistent and reproducible way -- two things that lexicon-based sentiment analysis excels at. Given these contrasting sets of strengths and weaknesses, we propose an innovative solution which combines the two, and has more accurate results. In a two-step process, we first harness ChatGPT's more nuanced grasp of language to undertake a verbose sentiment analysis, in which the model is prompted to explain its judgment of the film reviews at length. We then apply a lexicon-based sentiment analysis (with Python's NLTK library and its VADER lexicon) to the result of ChatGPT's analysis, thus achieving systematic results. When applied to a corpus of 80 reviews of three canonical Weimar films (Das Cabinet des Dr. Caligari, Metropolis and Nosferatu), this approach successfully recognized the sentiments of 88.75% of reviews, a considerable improvement when compared to the accuracy rate of the direct application of VADER to the reviews (66.25%). These results are particularly impressive given that this corpus is especially challenging for automated sentiment analysis, with a prevalence of macabre themes, which can easily trigger falsely negative results, and a high number of mixed reviews. We believe this hybrid approach could prove useful for application in large corpora, for which close reading of all reviews would be humanly impossible.
format	Article
id	doaj-art-f870b81b92184704a5a9cdd95f05f990
institution	Kabale University
issn	2371-4549
language	English
publishDate	2024-07-01
publisher	Department of Languages, Literatures, and Cultures at McGill University
record_format	Article
series	Journal of Cultural Analytics
spelling	doaj-art-f870b81b92184704a5a9cdd95f05f9902025-01-28T22:11:32ZengDepartment of Languages, Literatures, and Cultures at McGill UniversityJournal of Cultural Analytics2371-45492024-07-0194Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language ModelsIsadora Campregher PaivaJosephine DieckeFilm reviews are an obvious area for the application of sentiment analysis, but while this is common in the field of computer science, it has been mostly absent in film studies. Film scholars have quite rightly been skeptical of such techniques due to their inability to grasp nuanced critical texts. Recent technological developments have, however, given us cause to re-evaluate the usefulness of automated sentiment analysis for historical film reviews. The release of ever more sophisticated Large Language Models (LLMs) has shown that their capacity to handle nuanced language could overcome some of the shortcomings of lexicon-based sentiment analysis. Applying it to historical film reviews seemed logical and promising to us. Some of our early optimism was misplaced: while LLMs, and in particular ChatGPT, proved indeed to be much more adept at dealing with nuanced language, they are also difficult to control and implement in a consistent and reproducible way -- two things that lexicon-based sentiment analysis excels at. Given these contrasting sets of strengths and weaknesses, we propose an innovative solution which combines the two, and has more accurate results. In a two-step process, we first harness ChatGPT's more nuanced grasp of language to undertake a verbose sentiment analysis, in which the model is prompted to explain its judgment of the film reviews at length. We then apply a lexicon-based sentiment analysis (with Python's NLTK library and its VADER lexicon) to the result of ChatGPT's analysis, thus achieving systematic results. When applied to a corpus of 80 reviews of three canonical Weimar films (Das Cabinet des Dr. Caligari, Metropolis and Nosferatu), this approach successfully recognized the sentiments of 88.75% of reviews, a considerable improvement when compared to the accuracy rate of the direct application of VADER to the reviews (66.25%). These results are particularly impressive given that this corpus is especially challenging for automated sentiment analysis, with a prevalence of macabre themes, which can easily trigger falsely negative results, and a high number of mixed reviews. We believe this hybrid approach could prove useful for application in large corpora, for which close reading of all reviews would be humanly impossible.https://doi.org/10.22148/001c.118497
spellingShingle	Isadora Campregher Paiva Josephine Diecke Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models Journal of Cultural Analytics
title	Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_full	Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_fullStr	Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_full_unstemmed	Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_short	Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_sort	revisiting weimar film reviewers sentiments integrating lexicon based sentiment analysis with large language models
url	https://doi.org/10.22148/001c.118497
work_keys_str_mv	AT isadoracampregherpaiva revisitingweimarfilmreviewerssentimentsintegratinglexiconbasedsentimentanalysiswithlargelanguagemodels AT josephinediecke revisitingweimarfilmreviewerssentimentsintegratinglexiconbasedsentimentanalysiswithlargelanguagemodels

Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models

Similar Items