Antigravity Q&A
Could training on adversarial examples actively hurt sensitivity to rare but physically plausible BSM signals?
Answered by claude-haiku-4-5 · 3 May 2026
Answer
This is a sharp question that cuts to a real tension in modern machine learning for physics. Let me unpack what the archive shows.
The core risk is real, but subtle.
You're asking whether hardening a model against adversarial noise might inadvertently blind it to genuine rare signals. The archive addresses this from two angles:
First, adversarial training can *improve* generalization to real data. White-box adversarial attacks—which perturb the model's weights or inputs to stress-test it—actually help supervised classifiers generalize better to unseen data by flattening sharp local minima in the loss landscape [AG-2024.11-1226]. The intuition is that a model trapped in a sharp minimum has learned brittle, simulation-specific artifacts. Adversarial robustness forces the model to find smoother solutions that transfer better. This *should* help it catch physically plausible signals outside its training distribution.
But here's the catch: that study focused on *weight-space* robustness in a supervised setting (Higgs classification). A rarer signal in a different channel or parameter space is a different beast.
The real vulnerability: anomaly detection without signal awareness.
The sharper concern emerges from work on unsupervised anomaly detection [AG-2025.05-1524]. Semi-supervised methods (autoencoders, isolation forests, etc.) are agnostic to what signal you're hunting—they just learn to flag deviations from the Standard Model background. The problem: their sensitivity to rare BSM signals depends heavily on untunable hyperparameters, and the paper found no single hyperparameter choice optimizes sensitivity across all BSM benchmarks. If you adversarially harden such a model, you risk *narrowing its anomaly detection surface* to only the background morphology—making it less likely to catch truly unexpected topologies.
The solution the archive points to: signal awareness.
The most relevant paper here is [AG-2026.03-1712]. Rather than pure anomaly detection, they train a contrastive latent space on *both* SM background *and* a diverse set of hypothesized BSM signals. The result: the model stays sensitive to unseen signals (through extrapolation to novel topologies) *while* being hardened against simulation artifacts. Crucially, they show retention of sensitivity to BSM models not in the training set. This suggests that adversarial robustness + signal awareness is synergistic, not antagonistic.
Bottom line:
Adversarial training on background alone could hurt sensitivity to rare signals if it overshoots and erases the model's ability to recognize genuinely anomalous event shapes. But adversarial training applied *within a signal-aware framework*—where you've explicitly shown the model diverse BSM target morphologies—appears to enhance both robustness and sensitivity simultaneously [AG-2024.11-1226, AG-2026.03-1712]. The risk is real mainly for pure anomaly detection; supervised or semi-supervised approaches that embed signal diversity seem to dodge it.
Sources · 8
- 70%hep-phSensitivity to New Physics Phenomena in Anomaly Detection: A Study of Untunable HyperparametersAG-2025.05-1524
- 67%hep-phEnhancing generalization in high energy physics using white-box adversarial attacksAG-2024.11-1226
- 66%hep-phDeep Learning Approaches for BSM Physics: Evaluating DNN and GNN Performance in Particle Collision Event ClassificationAG-2024.11-1157
- 66%hep-phSignal-Aware Contrastive Latent Spaces for Anomaly DetectionAG-2026.03-1712
- 65%hep-phBig Dipper, Help Me Find A Way -- Dip-hunting at hadron collidersAG-2026.04-1515
- 64%hep-phEnhancing Sensitivity for Di-Higgs Boson Searches Using Anomaly Detection and Supervised Machine Learning TechniquesAG-2025.04-1518
- 64%hep-phQuantum similarity learning for anomaly detectionAG-2024.11-1249
- 63%hep-phAnomalous electroweak physics unraveled via evidential deep learningAG-2024.12-1648
Keep exploring
- Does adversarial hardening on SM background alone narrow sensitivity compared to signal-aware contrastive training?
- How much BSM morphological diversity in training is needed before adversarial robustness stops hurting rare signal detection?
- Could anomaly detection loss landscape smoothing from adversarial training actually erase unexpected BSM topology signatures?
This is a research aid — not a peer review. Verify sources before citing.