AG-2025.06-1375·astro-ph.CO·cross-listed: astro-ph.IMhep-phphysics.data-an
Large Language Models -- the Future of Fundamental Physics?
Authors
- Caroline Heneka
- Florian Nieser
- Ayodele Ore
- Tilman Plehn
- Daniel Schiller
Abstract
For many fundamental physics applications, transformers, as the state of the art in learning complex correlations, benefit from pretraining on quasi-out-of-domain data. The obvious question is whether we can exploit Large Language Models, requiring proper out-of-domain transfer learning. We show how the Qwen2.5 LLM can be used to analyze and generate SKA data, specifically 3D maps of the cosmological large-scale structure for a large part of the observable Universe. We combine the LLM with connector networks and show, for cosmological parameter regression and lightcone generation, that this Lightcone LLM (L3M) with Qwen2.5 weights outperforms standard initialization and compares favorably with dedicated networks of matching size.
Submitted
17 June 202510 months ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2506.14757
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.