Antigravity Q&A
How does the model's sensitivity to unseen BSM topologies degrade if you gradually reduce training diversity?
Answered by claude-haiku-4-5 · 10 May 2026
Answer
Your question asks about training diversity — how much variety in the training dataset affects a model's ability to recognize new, unseen BSM signatures. This is a real and important practical problem in collider physics.
The archive doesn't contain a direct study of this degradation curve, but I can connect several relevant findings:
What happens when you reduce diversity
The closest parallel comes from foundational model design. A transformer trained on diverse multi-process datasets (top-quark production in 1t, 2t, 3t, and 4t channels simultaneously) can maintain sensitivity to rare high-multiplicity topologies while preserving performance on conventional channels [AG-2025.11-1264]. The key insight: the model's adaptive attention mechanisms explicitly preserve unique interaction patterns while establishing cross-process correlations. This suggests that *removing* processes from training (reducing diversity) would break those learned correlations and degrade sensitivity to related unseen topologies.
Why diversity matters for anomaly detection
Semi-supervised anomaly detection methods — trained only on Standard Model background — can detect BSM signals [AG-2025.05-1524]. But these methods have untunable hyperparameters whose sensitivity to BSM benchmarks varies significantly. The implicit assumption is that the broader the "normal" background distribution captured during training, the more reliably anomalies stand out. Reducing training diversity would narrow that baseline, making it harder to spot deviations that fall outside the learned background manifold.
The model-dependence trap
A practical caveat: when training data lacks diversity, models become brittle to interference effects and parameter-space structure. Searches for scalar resonances fail under destructive interference because traditional training ignores these subtleties [AG-2026.04-1515]. A model trained only on simple SM events would likely miss BSM signatures hiding in subtle kinematic patterns.
The archive does not contain a systematic study measuring sensitivity degradation as a function of training dataset diversity.
Sources · 8
- 62%hep-phSensitivity to New Physics Phenomena in Anomaly Detection: A Study of Untunable HyperparametersAG-2025.05-1524
- 61%hep-phDeep Learning Approaches for BSM Physics: Evaluating DNN and GNN Performance in Particle Collision Event ClassificationAG-2024.11-1157
- 60%hep-phA Methodology for Developing Foundational Transformer Models in Collider Physics AnalysisAG-2025.11-1264
- 60%hep-phBig Dipper, Help Me Find A Way -- Dip-hunting at hadron collidersAG-2026.04-1515
- 60%hep-exDevelopment of systematic uncertainty-aware neural network trainings for binned-likelihood analyses at the LHCAG-2025.02-1304
- 60%hep-phImproving sensitivity of vectorlike top partner searches with jet substructureAG-2025.07-1193
- 59%hep-phEnhancing Sensitivity for Di-Higgs Boson Searches Using Anomaly Detection and Supervised Machine Learning TechniquesAG-2025.04-1518
- 59%hep-phReweighting and Analysing Event Generator Systematics by Neural Networks on High-Level FeaturesAG-2025.03-1333
Keep exploring
- How does attention-based mechanism preservation break down when you remove specific processes from training?
- Why would reducing background diversity in semi-supervised anomaly detection shift the decision boundary for BSM signals?
- Can interference effects become undetectable if models never encounter complex kinematic patterns during training?
This is a research aid — not a peer review. Verify sources before citing.