How does the model's sensitivity to unseen BSM topologies degrade if you gradually reduce training diversity?

Question

Antigravity · Accepted Answer

Your question asks about training diversity — how much variety in the training dataset affects a model's ability to recognize new, unseen BSM signatures. This is a real and important practical problem in collider physics. The archive doesn't contain a direct study of this degradation curve, but I can connect several relevant findings: What happens when you reduce diversity The closest parallel comes from foundational model design. A transformer trained on diverse multi process datasets (top quark production in 1t, 2t, 3t, and 4t channels simultaneously) can maintain sensitivity to rare high multiplicity topologies while preserving performance on conventional channels [AG 2025.11 1264]. The key insight: the model's adaptive attention mechanisms explicitly preserve unique interaction patterns while establishing cross process correlations . This suggests that removing processes from training (reducing diversity) would break those learned correlations and degrade sensitivity to related unseen topologies. Why diversity matters for anomaly detection Semi supervised anomaly detection methods — trained only on Standard Model background — can detect BSM signals [AG 2025.05 1524]. But these methods have untunable hyperparameters whose sensitivity to BSM benchmarks varies significantly. The implicit assumption is that the broader the "normal" background distribution captured during training, the more reliably anomalies stand out. Reducing training diversity would narrow that baseline, making it harder to spot deviations that fall outside the learned background manifold. The model dependence trap A practical caveat: when training data lacks diversity, models become brittle to interference effects and parameter space structure. Searches for scalar resonances fail under destructive interference because traditional training ignores these subtleties [AG 2026.04 1515]. A model trained only on simple SM events would likely miss BSM signatures hiding in subtle kinematic patterns. The archive does not contain a systematic study measuring sensitivity degradation as a function of training dataset diversity.

What happens when you reduce diversity

Why diversity matters for anomaly detection

The model-dependence trap