How do surrogates trained on one detector size generalize to substantially larger or smaller geometries?

Question

Antigravity · Accepted Answer

Cross Geometry Generalization of Machine Learning Surrogates This is a sharp question about a practical bottleneck in physics simulation. The short answer: surrogates generalize surprisingly well across geometry changes if you use the right architecture and transfer learning, but the mechanism depends on how you represent the problem. The Generalization Problem Traditional detector surrogates are "voxel based"—they discretize 3D space into fixed grids tied to one specific detector size. Retraining from scratch for each new geometry is prohibitively expensive, especially for large collaborations designing future colliders [AG 2025.11 1571]. Point Clouds and Geometry Agnostic Representations The key insight is that geometry should not constrain the representation . Point cloud models—which store only the coordinates and properties of active detector elements, not empty voxels—naturally generalize across scales [AG 2025.11 1571]. This matters because a shower's physics (how electrons and photons cascade) is fundamentally about particle interactions , not grid resolution. Using point clouds with transfer learning, researchers achieved a 44% improvement in shower fidelity when adapting from the International Large Detector to new configurations using only 100 target domain samples [AG 2025.11 1571]. This is dramatic: you don't need a million new training events to retrain the entire model. Practical Transfer Learning Results The concrete evidence is compelling: Cross detector transfer : A model pretrained on the Compact Linear Collider detector (CLICdet) required 10× fewer samples to reach the same performance on a larger detector (CLD) compared to training from scratch—100,000 events vs. 1 million [AG 2025.02 1472]. Parameter efficiency : Fine tuning only the bias terms (17% of model parameters) achieved competitive performance while preserving learned shower physics [AG 2025.11 1571]. Vision Transformers scale robustly : Vision Transformers show particular promise because they handle both regular and irregular geometries without re voxelization [AG 2026.01 1094]. Pretraining on a large dataset and fine tuning on the target geometry reduces training costs and improves fidelity [AG 2026.01 1094]. Why It Works: Adaptation Mechanisms Transfer learning succeeds because the early layers of the surrogate learn generic shower physics—how energy deposits correlate, how cascades develop—that doesn't depend on detector size [AG 2025.11 1571]. Only the final layers need adjustment for the new geometry's boundaries and granularity. This is analogous to how a vision model trained on ImageNet learns generic edge detection features that transfer to new image sizes. The Remaining Challenge However, extrapolation to radically different regimes (e.g., much denser detectors or drastically smaller active volumes) remains an open problem. The transfer learning studies shown here are mostly interpolation —moving between similar detector designs at the same collider. Generalization to fundamentally new detector concepts, especially outside the distribution of the pretraining data, is where surrogates can fail [AG 2024.03 993].

The Generalization Problem

Point Clouds and Geometry-Agnostic Representations

Practical Transfer Learning Results

Why It Works: Adaptation Mechanisms

The Remaining Challenge