AG-2026.04-1164·hep-ex·cross-listed: cs.LGhep-phphysics.data-an
Cross-Domain Transfer with Particle Physics Foundation Models: From Jets to Neutrino Interactions
Authors
- Gregor Krzmanc
- Vinicius Mikuni
- Benjamin Nachman
- Callum Wilkinson
Abstract
Future AI-based studies in particle physics will likely start from a foundation model to accelerate training and enhance sensitivity. As a step towards a general-purpose foundation model for particle physics, we investigate whether the OmniLearned foundation model pre-trained on diverse high-$Q^2$ simulated and real $pp$ and $ep$ collisions can be effectively transferred to a few-GeV fixed-target neutrino experiment. We process MINERvA neutrino--nucleus scattering events and evaluate pre-trained models on two types of tasks: regression of available energy and binary classification of charged-current pion final states ($\mathrm{CC1π^{\pm}}$, $\mathrm{CCNπ^{\pm}}$, and $\mathrm{CC1π^{0}}$). Pre-trained OmniLearned models consistently outperform similarly sized models trained from scratch, achieving better overall performance at the same compute budget, as well as achieving better performance at the same number of training steps. These results suggest that particle-level foundation models acquire inductive biases that generalize across large differences in energy scale, detector technology, and underlying physics processes, pointing toward a paradigm of detector-agnostic inference in particle physics.
Submitted
14 April 20261 month ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2604.12364
Summary
A physics foundation model pre-trained on high-energy collider data transfers effectively to low-energy neutrino experiments, learning generalizable patterns about particle behavior that work across vastly different detector types and energy scales.
- The OmniLearned foundation model, trained on simulated and real collider data, outperforms models trained from scratch on neutrino-scattering tasks—suggesting particle physics has some universal "grammar" that carries across experiments.
- The transfer works despite enormous differences: collider energies (TeV-scale) versus neutrino experiments (few GeV), different detectors, and different physics—implying foundation models capture fundamental patterns about how particles interact, not just memorize one experiment's quirks.
- This hints at a future where physicists build one powerful foundation model for particle detection, then fine-tune it for whatever experiment they're analyzing, rather than training each detector independently from scratch.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.