AG-2026.01-1203·hep-ph·cross-listed: hep-ex
Topic Modeling in New Physics Detection
Authors
- Alexandre Alves
- Eduardo da Silva Almeida
- Douglas Roberto Pimentel
Abstract
In this work, we apply topic modeling to detect new physics in proton-proton collisions at the LHC in an unsupervised way. We investigate three new physics scenarios where fully leptonic $t\bar{t}\to b\bar{b}\ell^+\ell^-ν_\ell\barν_\ell$ is the main source of background without relying on jet substructure variables. We demonstrate that the algorithm remains effective even in this low-particle multiplicity framework, complementing jet tagging studies, where it is typically employed. Moreover, we demonstrate that the performance of topic modeling is competitive or even better than well-known outlier detectors, such as isolation forest and variational autoencoders, with moderate and high background pollution in almost all new physics scenarios considered.
Submitted
15 January 20263 months ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2601.10871
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.