AG-2026.04-1329·hep-ph·cross-listed: hep-exphysics.data-an
Kitchen Sink Anomaly Detection
Authors
- Ranit Das
- Marie Hein
- Gregor Kasieczka
- Michael Krämer
- Lukas Lang
- Radha Mastandrea
- Louis Moureaux
- Alexander Mück
- David Shih
Abstract
An enormous amount of R&D effort has resulted in many new resonant anomaly detection methods being proposed in recent years. However, the vast majority of previous R&D studies have suffered from two limitations: they have focused on a very small set of simulated signal benchmark models; and they have either used small sets of carefully crafted high-level jet substructure observables, which can be highly performant but are prone to model dependence, or the full collider event phase space, which is more agnostic but suffers from reduced sensitivity. In this work, we address both limitations: we formulate a number of new simulated signal benchmarks, which we make publicly available in a format fully compatible with the LHCO R&D benchmark; and we explore a high-level, yet highly agnostic, observable set consisting of Energy Flow Polynomials in addition to the usual subjettiness variables. We evaluate this "kitchen sink" observable set for both an idealized anomaly detector and the CWoLa hunting task, along with three baseline observable sets (the Baseline LHC Olympics set, subjettiness observables, and Energy Flow Polynomials). We find that our kitchen sink approach is the most sensitive to a broad range of signal types. Furthermore, we show that an attribute bagging variant, in which each ensemble member is trained on a random subset of substructure observables, yields comparable anomaly detection performance while significantly reducing training cost.
Submitted
22 April 20261 month ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2604.20965
Summary
This paper expands the benchmark suite for resonant anomaly detection at the LHC and demonstrates that combining Energy Flow Polynomials with subjettiness variables (a "kitchen sink" approach) achieves superior sensitivity across diverse signal models while reducing training overhead via attribute bagging.
- New simulated signal benchmarks are introduced and released in LHCO-compatible format to address the fragmentation of prior anomaly detection studies that relied on narrow, non-standardized benchmark sets.
- Energy Flow Polynomials augment traditional jet substructure observables to provide model-agnostic discrimination without sacrificing sensitivity across a broad spectrum of beyond-SM resonance topologies.
- Attribute bagging—training ensemble members on random subsets of observables—matches kitchen sink performance while substantially reducing computational cost, suggesting redundancy in high-dimensional observable spaces.
specialist · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.