Quantum many-body physics calculations with large language models

Abstract Large language models (LLMs) have demonstrated abilities to perform complex tasks in multiple domains, including mathematical and scientific reasoning. We demonstrate that with carefully designed prompts, LLMs can accurately carry out key calculations in research papers in theoretical physi...

Full description

Saved in:
Bibliographic Details
Main Authors: Haining Pan, Nayantara Mudur, William Taranto, Maria Tikhanovskaya, Subhashini Venugopalan, Yasaman Bahri, Michael P. Brenner, Eun-Ah Kim
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Communications Physics
Online Access:https://doi.org/10.1038/s42005-025-01956-y
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832571613365665792
author Haining Pan
Nayantara Mudur
William Taranto
Maria Tikhanovskaya
Subhashini Venugopalan
Yasaman Bahri
Michael P. Brenner
Eun-Ah Kim
author_facet Haining Pan
Nayantara Mudur
William Taranto
Maria Tikhanovskaya
Subhashini Venugopalan
Yasaman Bahri
Michael P. Brenner
Eun-Ah Kim
author_sort Haining Pan
collection DOAJ
description Abstract Large language models (LLMs) have demonstrated abilities to perform complex tasks in multiple domains, including mathematical and scientific reasoning. We demonstrate that with carefully designed prompts, LLMs can accurately carry out key calculations in research papers in theoretical physics. We focus on a broadly-used approximation method in quantum physics: the Hartree-Fock method, requiring an analytic multi-step calculation deriving approximate Hamiltonian and corresponding self-consistency equations. To carry out the calculations using LLMs, we design multi-step prompt templates that break down the analytic calculation into standardized steps with placeholders for problem-specific information. We evaluate GPT-4’s performance in executing the calculation for 15 papers from the past decade, demonstrating that, with the correction of intermediate steps, it can correctly derive the final Hartree-Fock Hamiltonian in 13 cases. Aggregating across all research papers, we find an average score of 87.5 (out of 100) on the execution of individual calculation steps. We further use LLMs to mitigate the two primary bottlenecks in this evaluation process: (i) extracting information from papers to fill in templates and (ii) automatic scoring of the calculation steps, demonstrating good results in both cases.
format Article
id doaj-art-c07cad183126418c9bf6406bb073ad46
institution Kabale University
issn 2399-3650
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Communications Physics
spelling doaj-art-c07cad183126418c9bf6406bb073ad462025-02-02T12:28:05ZengNature PortfolioCommunications Physics2399-36502025-01-01811810.1038/s42005-025-01956-yQuantum many-body physics calculations with large language modelsHaining Pan0Nayantara Mudur1William Taranto2Maria Tikhanovskaya3Subhashini Venugopalan4Yasaman Bahri5Michael P. Brenner6Eun-Ah Kim7Department of Physics, Cornell UniversityGoogle ResearchDepartment of Physics, Cornell UniversityGoogle ResearchGoogle ResearchGoogle DeepMindGoogle ResearchDepartment of Physics, Cornell UniversityAbstract Large language models (LLMs) have demonstrated abilities to perform complex tasks in multiple domains, including mathematical and scientific reasoning. We demonstrate that with carefully designed prompts, LLMs can accurately carry out key calculations in research papers in theoretical physics. We focus on a broadly-used approximation method in quantum physics: the Hartree-Fock method, requiring an analytic multi-step calculation deriving approximate Hamiltonian and corresponding self-consistency equations. To carry out the calculations using LLMs, we design multi-step prompt templates that break down the analytic calculation into standardized steps with placeholders for problem-specific information. We evaluate GPT-4’s performance in executing the calculation for 15 papers from the past decade, demonstrating that, with the correction of intermediate steps, it can correctly derive the final Hartree-Fock Hamiltonian in 13 cases. Aggregating across all research papers, we find an average score of 87.5 (out of 100) on the execution of individual calculation steps. We further use LLMs to mitigate the two primary bottlenecks in this evaluation process: (i) extracting information from papers to fill in templates and (ii) automatic scoring of the calculation steps, demonstrating good results in both cases.https://doi.org/10.1038/s42005-025-01956-y
spellingShingle Haining Pan
Nayantara Mudur
William Taranto
Maria Tikhanovskaya
Subhashini Venugopalan
Yasaman Bahri
Michael P. Brenner
Eun-Ah Kim
Quantum many-body physics calculations with large language models
Communications Physics
title Quantum many-body physics calculations with large language models
title_full Quantum many-body physics calculations with large language models
title_fullStr Quantum many-body physics calculations with large language models
title_full_unstemmed Quantum many-body physics calculations with large language models
title_short Quantum many-body physics calculations with large language models
title_sort quantum many body physics calculations with large language models
url https://doi.org/10.1038/s42005-025-01956-y
work_keys_str_mv AT hainingpan quantummanybodyphysicscalculationswithlargelanguagemodels
AT nayantaramudur quantummanybodyphysicscalculationswithlargelanguagemodels
AT williamtaranto quantummanybodyphysicscalculationswithlargelanguagemodels
AT mariatikhanovskaya quantummanybodyphysicscalculationswithlargelanguagemodels
AT subhashinivenugopalan quantummanybodyphysicscalculationswithlargelanguagemodels
AT yasamanbahri quantummanybodyphysicscalculationswithlargelanguagemodels
AT michaelpbrenner quantummanybodyphysicscalculationswithlargelanguagemodels
AT eunahkim quantummanybodyphysicscalculationswithlargelanguagemodels