Optimal Equivariant Architectures from the Symmetries of Matrix-Element Likelihoods

Daniel Maître; Vishal S. Ngairangbam; Michael Spannowsky

doi:10.48550/arXiv.2410.18553

← Recent

AG-2024.10-1406·hep-ph·cross-listed: cs.LGhep-exphysics.data-an

Optimal Equivariant Architectures from the Symmetries of Matrix-Element Likelihoods

Authors

Daniel Maître
Vishal S. Ngairangbam
Michael Spannowsky

Abstract

The Matrix-Element Method (MEM) has long been a cornerstone of data analysis in high-energy physics. It leverages theoretical knowledge of parton-level processes and symmetries to evaluate the likelihood of observed events. In parallel, the advent of geometric deep learning has enabled neural network architectures that incorporate known symmetries directly into their design, leading to more efficient learning. This paper presents a novel approach that combines MEM-inspired symmetry considerations with equivariant neural network design for particle physics analysis. Even though Lorentz invariance and permutation invariance overall reconstructed objects are the largest and most natural symmetry in the input domain, we find that they are sub-optimal in most practical search scenarios. We propose a longitudinal boost-equivariant message-passing neural network architecture that preserves relevant discrete symmetries. We present numerical studies demonstrating MEM-inspired architectures achieve new state-of-the-art performance in distinguishing di-Higgs decays to four bottom quarks from the QCD background, with enhanced sample and parameter efficiencies. This synergy between MEM and equivariant deep learning opens new directions for physics-informed architecture design, promising more powerful tools for probing physics beyond the Standard Model.

Submitted

24 October 20241 year ago

Version

v1

License

CC-BY-4.0

DOI

10.48550/arXiv.2410.18553

Cite this preprint

BibTeX RIS

Imports into BibLaTeX, Zotero, Mendeley, EndNote.

PDF

Open PDF

Opens in a new tab · v1.

Summary

Combining ideas from traditional particle-physics likelihood methods with modern symmetric neural networks yields better performance for detecting rare Higgs decays, suggesting that full Lorentz symmetry isn't always the best constraint for learning physics signatures.

The paper breaks with convention by showing that architectures with *partial* symmetry (preserving only longitudinal boosts, not full Lorentz invariance) outperform fully symmetric ones for real particle-physics tasks.
The approach marries classical Matrix-Element Methods—which encode physics knowledge as likelihoods—with equivariant neural networks, a modern way to bake symmetries into deep learning.
On a benchmark task (separating Higgs pair decays from background noise), this hybrid architecture achieves state-of-the-art results while needing less training data and fewer parameters than existing methods.

curious · generated by claude-haiku-4-5

Chat with this PDF

Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.

Community

Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.