AG-2025.03-1560·hep-ph
Strong CWoLa: Binary Classification Without Background Simulation
Authors
- Samuel Klein
- Matthew Leigh
- Stephen Mulligan
- Tobias Golling
Abstract
Supervised deep learning methods have been successful in the field of high energy physics, and the trend within the field is to move away from high level reconstructed variables to lower level, higher dimensional features. Supervised methods require labelled data, which is typically provided by a simulator. As the number of features increases, simulation accuracy decreases, leading to greater domain shift between training and testing data when using lower-level features. This work demonstrates that the classification without labels paradigm can be used to remove the need for background simulation when training supervised classifiers. This can result in classifiers with higher performance on real data than those trained on simulated data.
Submitted
19 March 20251 year ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2503.14876
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.