AG-2024.10-1556·hep-ph·cross-listed: hep-ex
Systematic Interpretability and the Likelihood for Boosted Top Quark Identification
Authors
- Andrew J. Larkoski
Abstract
Identification of boosted, hadronically-decaying top quarks is a problem of central importance for physics goals of the Large Hadron Collider. We present a theoretical analysis of top quark tagging, establishing zeroth-order, minimal assumptions that should be satisfied by any purported top-tagged jet, like existence of three hard subjets, a bottom-tagged subjet, total mass consistent with the top quark, and a pairwise subjet mass consistent with the W boson. From these minimal assumptions, we construct the optimal discrimination observable, the likelihood ratio, for the binary discrimination problem of top quark-initiated versus bottom quark-initiated jets through next-to-leading order in the strong coupling. We compare and compute corresponding signal and background efficiencies both analytically and from simulated data, validating an understanding of the relevant physics identified and exploited by the likelihood. In the process, we construct a method for systematic interpretability of the likelihood ratio for this problem, and explicitly establish a hard floor on possible discrimination power. These results can correspondingly be applied to understanding and interpreting machine learning studies of this problem.
Submitted
31 October 20241 year ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2411.00104
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.