Can language models solve quantum field theory problems better than classical physics ones?

Question

Antigravity · Accepted Answer

Your question asks whether language models have an easier time with quantum field theory than classical physics—but the archive suggests the opposite pattern.

The evidence points to classical and well-structured problems being easier for LLMs across domains, not quantum field theory specifically.

When researchers evaluated LLMs on quantum field theory directly, they found significant friction. Models show "near-ceiling performance on explicit derivations within stable conceptual frames, but systematic degradation when tasks require reconstruction of omitted reasoning steps or reorganization of representations under global consistency constraints" [AG-2026.04-565]. In plain terms: LLMs handle plug-and-chug calculations well, but struggle when a problem requires you to fill in conceptual gaps or ensure everything fits together globally—a hallmark of QFT work.

Similar limits appear in Quantum Chromodynamics (QCD), another quantum field theory. When researchers reverse-engineered what modern LLMs actually know about QCD concepts like color confinement and asymptotic freedom, they found "naturally idiosyncratic patterns" and "current limitations in their representation of advanced quantum field theory concepts" [AG-2025.11-1583].

By contrast, LLMs do better on tasks with clearer structure. When fine-tuned on quantum field theory specifically, small models improved meaningfully [AG-2026.04-892]—suggesting that with domain-specific training, the gap narrows. And specialized models fine-tuned on high-energy physics abstracts outperformed general commercial LLMs [AG-2025.07-1022].

The pattern, then, is: LLMs struggle more with QFT's implicit reasoning and global consistency than with classical physics, but targeted training helps. Classical physics isn't discussed directly in these preprints, so I can't compare them explicitly.