How do attention patterns in ParT differ when trained on genuine versus fake resonance signals?

Question

Antigravity · Accepted Answer

I appreciate the specific question, but the provided preprints don't address it directly. The ParT papers analyze attention patterns on real LHC jet data and benchmark datasets, but neither compares genuine versus fake resonance signals systematically.

That said, I can clarify what we *do* know about ParT's attention behavior:

ParT develops remarkably sparse, nearly binary attention patterns [AG-2025.11-1573], where each particle attends to at most one other particle [AG-2024.12-1290]. The sparsity arises primarily from the attention mechanism itself rather than the underlying physics-inspired interaction matrix [AG-2025.11-1573]. Importantly, ParT learns to focus on physically meaningful features—like key jet substructure elements such as leptons in semileptonic top decays—even without explicit particle identification [AG-2025.11-1573], and it shows varying focus on important particles depending on the decay mode [AG-2024.12-1290].

To directly answer your question about how attention *differs* between genuine and fake resonance signals, you'd need a study that explicitly trains ParT on both synthetic background (fake) and signal-like (genuine) resonance events and compares their attention heatmaps. The existing literature focuses on understanding why attention is sparse and what physical correlations it captures, not on distinguishing signal from background at the attention-pattern level.