Cross-Domain Transfer with Particle Physics Foundation Models: From Jets to Neutrino Interactions

Gregor Krzmanc; Vinicius Mikuni; Benjamin Nachman; Callum Wilkinson

doi:10.48550/arXiv.2604.12364

← Recent

AG-2026.04-1164·hep-ex·cross-listed: cs.LGhep-phphysics.data-an

Cross-Domain Transfer with Particle Physics Foundation Models: From Jets to Neutrino Interactions

Authors

Gregor Krzmanc
Vinicius Mikuni
Benjamin Nachman
Callum Wilkinson

Abstract

Future AI-based studies in particle physics will likely start from a foundation model to accelerate training and enhance sensitivity. As a step towards a general-purpose foundation model for particle physics, we investigate whether the OmniLearned foundation model pre-trained on diverse high-$Q^2$ simulated and real $pp$ and $ep$ collisions can be effectively transferred to a few-GeV fixed-target neutrino experiment. We process MINERvA neutrino--nucleus scattering events and evaluate pre-trained models on two types of tasks: regression of available energy and binary classification of charged-current pion final states ($\mathrm{CC1π^{\pm}}$, $\mathrm{CCNπ^{\pm}}$, and $\mathrm{CC1π^{0}}$). Pre-trained OmniLearned models consistently outperform similarly sized models trained from scratch, achieving better overall performance at the same compute budget, as well as achieving better performance at the same number of training steps. These results suggest that particle-level foundation models acquire inductive biases that generalize across large differences in energy scale, detector technology, and underlying physics processes, pointing toward a paradigm of detector-agnostic inference in particle physics.

Submitted

14 April 20263 months ago

Version

v1

License

CC-BY-4.0

DOI

10.48550/arXiv.2604.12364

Cite this preprint

BibTeX RIS

Imports into BibLaTeX, Zotero, Mendeley, EndNote.

PDF

Open PDF

Opens in a new tab · v1.

Summary

A physics foundation model pre-trained on high-energy collider data transfers effectively to low-energy neutrino experiments, learning generalizable patterns about particle behavior that work across vastly different detector types and energy scales.

The OmniLearned foundation model, trained on simulated and real collider data, outperforms models trained from scratch on neutrino-scattering tasks—suggesting particle physics has some universal "grammar" that carries across experiments.
The transfer works despite enormous differences: collider energies (TeV-scale) versus neutrino experiments (few GeV), different detectors, and different physics—implying foundation models capture fundamental patterns about how particles interact, not just memorize one experiment's quirks.
This hints at a future where physicists build one powerful foundation model for particle detection, then fine-tune it for whatever experiment they're analyzing, rather than training each detector independently from scratch.

curious · generated by claude-haiku-4-5

Chat with this PDF

Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.

Community

Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.