A universal vision transformer for fast calorimeter simulations

Luigi Favaro; Andrea Giammanco; Claudius Krause

doi:10.48550/arXiv.2601.05289

← Recent

AG-2026.01-1094·hep-ph·cross-listed: cs.LGhep-exphysics.ins-det

A universal vision transformer for fast calorimeter simulations

Authors

Luigi Favaro
Andrea Giammanco
Claudius Krause

Abstract

The high-dimensional complex nature of detectors makes fast calorimeter simulations a prime application for modern generative machine learning. Vision transformers (ViTs) can emulate the Geant4 response with unmatched accuracy and are not limited to regular geometries. Starting from the CaloDREAM architecture, we demonstrate the robustness and scalability of ViTs on regular and irregular geometries, and multiple detectors. Our results show that ViTs generate electromagnetic and hadronic showers statistically indistinguishable from Geant4 in multiple evaluation metrics, while maintaining the generation time in the $\mathcal{O}(10-100)$ ms on a single GPU. Furthermore, we show that pretraining on a large dataset and fine-tuning on the target geometry leads to reduced training costs and higher data efficiency, or altogether improves the fidelity of generated showers.

Submitted

7 January 20265 months ago

Version

v1

License

CC-BY-4.0

DOI

10.48550/arXiv.2601.05289

Cite this preprint

BibTeX RIS

Imports into BibLaTeX, Zotero, Mendeley, EndNote.

PDF

Open PDF

Opens in a new tab · v1.

Chat with this PDF

Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.

Community

Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.