Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models

Film reviews are an obvious area for the application of sentiment analysis, but while this is common in the field of computer science, it has been mostly absent in film studies. Film scholars have quite rightly been skeptical of such techniques due to their inability to grasp nuanced critical texts....

Full description

Saved in:
Bibliographic Details
Main Authors: Isadora Campregher Paiva, Josephine Diecke
Format: Article
Language:English
Published: Department of Languages, Literatures, and Cultures at McGill University 2024-07-01
Series:Journal of Cultural Analytics
Online Access:https://doi.org/10.22148/001c.118497
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832583269785272320
author Isadora Campregher Paiva
Josephine Diecke
author_facet Isadora Campregher Paiva
Josephine Diecke
author_sort Isadora Campregher Paiva
collection DOAJ
description Film reviews are an obvious area for the application of sentiment analysis, but while this is common in the field of computer science, it has been mostly absent in film studies. Film scholars have quite rightly been skeptical of such techniques due to their inability to grasp nuanced critical texts. Recent technological developments have, however, given us cause to re-evaluate the usefulness of automated sentiment analysis for historical film reviews. The release of ever more sophisticated Large Language Models (LLMs) has shown that their capacity to handle nuanced language could overcome some of the shortcomings of lexicon-based sentiment analysis. Applying it to historical film reviews seemed logical and promising to us. Some of our early optimism was misplaced: while LLMs, and in particular ChatGPT, proved indeed to be much more adept at dealing with nuanced language, they are also difficult to control and implement in a consistent and reproducible way -- two things that lexicon-based sentiment analysis excels at. Given these contrasting sets of strengths and weaknesses, we propose an innovative solution which combines the two, and has more accurate results. In a two-step process, we first harness ChatGPT's more nuanced grasp of language to undertake a verbose sentiment analysis, in which the model is prompted to explain its judgment of the film reviews at length. We then apply a lexicon-based sentiment analysis (with Python's NLTK library and its VADER lexicon) to the result of ChatGPT's analysis, thus achieving systematic results. When applied to a corpus of 80 reviews of three canonical Weimar films (*Das Cabinet des Dr. Caligari*, *Metropolis* and *Nosferatu*), this approach successfully recognized the sentiments of 88.75% of reviews, a considerable improvement when compared to the accuracy rate of the direct application of VADER to the reviews (66.25%). These results are particularly impressive given that this corpus is especially challenging for automated sentiment analysis, with a prevalence of macabre themes, which can easily trigger falsely negative results, and a high number of mixed reviews. We believe this hybrid approach could prove useful for application in large corpora, for which close reading of all reviews would be humanly impossible.
format Article
id doaj-art-f870b81b92184704a5a9cdd95f05f990
institution Kabale University
issn 2371-4549
language English
publishDate 2024-07-01
publisher Department of Languages, Literatures, and Cultures at McGill University
record_format Article
series Journal of Cultural Analytics
spelling doaj-art-f870b81b92184704a5a9cdd95f05f9902025-01-28T22:11:32ZengDepartment of Languages, Literatures, and Cultures at McGill UniversityJournal of Cultural Analytics2371-45492024-07-0194Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language ModelsIsadora Campregher PaivaJosephine DieckeFilm reviews are an obvious area for the application of sentiment analysis, but while this is common in the field of computer science, it has been mostly absent in film studies. Film scholars have quite rightly been skeptical of such techniques due to their inability to grasp nuanced critical texts. Recent technological developments have, however, given us cause to re-evaluate the usefulness of automated sentiment analysis for historical film reviews. The release of ever more sophisticated Large Language Models (LLMs) has shown that their capacity to handle nuanced language could overcome some of the shortcomings of lexicon-based sentiment analysis. Applying it to historical film reviews seemed logical and promising to us. Some of our early optimism was misplaced: while LLMs, and in particular ChatGPT, proved indeed to be much more adept at dealing with nuanced language, they are also difficult to control and implement in a consistent and reproducible way -- two things that lexicon-based sentiment analysis excels at. Given these contrasting sets of strengths and weaknesses, we propose an innovative solution which combines the two, and has more accurate results. In a two-step process, we first harness ChatGPT's more nuanced grasp of language to undertake a verbose sentiment analysis, in which the model is prompted to explain its judgment of the film reviews at length. We then apply a lexicon-based sentiment analysis (with Python's NLTK library and its VADER lexicon) to the result of ChatGPT's analysis, thus achieving systematic results. When applied to a corpus of 80 reviews of three canonical Weimar films (*Das Cabinet des Dr. Caligari*, *Metropolis* and *Nosferatu*), this approach successfully recognized the sentiments of 88.75% of reviews, a considerable improvement when compared to the accuracy rate of the direct application of VADER to the reviews (66.25%). These results are particularly impressive given that this corpus is especially challenging for automated sentiment analysis, with a prevalence of macabre themes, which can easily trigger falsely negative results, and a high number of mixed reviews. We believe this hybrid approach could prove useful for application in large corpora, for which close reading of all reviews would be humanly impossible.https://doi.org/10.22148/001c.118497
spellingShingle Isadora Campregher Paiva
Josephine Diecke
Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
Journal of Cultural Analytics
title Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_full Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_fullStr Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_full_unstemmed Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_short Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
title_sort revisiting weimar film reviewers sentiments integrating lexicon based sentiment analysis with large language models
url https://doi.org/10.22148/001c.118497
work_keys_str_mv AT isadoracampregherpaiva revisitingweimarfilmreviewerssentimentsintegratinglexiconbasedsentimentanalysiswithlargelanguagemodels
AT josephinediecke revisitingweimarfilmreviewerssentimentsintegratinglexiconbasedsentimentanalysiswithlargelanguagemodels