Antigravity Q&A
What fraction of retraining data is needed before geometry-specific fine-tuning introduces overfitting artifacts?
Answered by claude-haiku-4-5 · 10 May 2026
Answer
Your question asks about a threshold phenomenon in machine learning—specifically, when does fine-tuning on geometry-specific data start to hurt rather than help? The provided preprints don't directly address that overfitting boundary.
However, the archive *does* contain relevant empirical guidance on data efficiency in geometry transfer learning. In cross-detector fine-tuning studies, researchers found that [AG-2025.11-1571] achieved strong results with only 100 target-domain samples using parameter-efficient adaptation (updating just 17% of parameters), while [AG-2025.02-1472] showed that fine-tuning on 100,000 events from a new detector geometry matched the performance of training from scratch on 1 million events—a 10-fold data reduction. Both cases suggest that overfitting risk remains manageable when fine-tuning is done carefully.
The broader picture from [AG-2025.04-1127] is illuminating: deep learning exhibits a two-phase dynamic where rapid fitting (prone to overfitting) is followed by a slower "compression" phase that drives generalization. The implication is that overfitting artifacts may appear early in geometry-specific fine-tuning but can be mitigated by letting training progress longer, or by using parameter-efficient methods that constrain the model's adaptability.
A concrete answer to your precise question—the exact fraction of retraining data at which overfitting begins—is not available in these preprints; they report success cases rather than failure boundaries.
Sources · 8
- 53%physics.ins-detCross-Geometry Transfer Learning in Fast Electromagnetic Shower SimulationAG-2025.11-1571
- 52%cs.LGFine-Tuning Small Reasoning Models for Quantum Field TheoryAG-2026.04-892
- 50%cs.LGOpening the Black Box: predicting the trainability of deep neural networks with reconstruction entropyAG-2024.06-754
- 49%hep-exFine-tuning machine-learned particle-flow reconstruction for new detector geometries in future collidersAG-2025.02-1472
- 48%cs.LGArtificial Entanglement in the Fine-Tuning of Large Language ModelsAG-2026.01-751
- 47%cs.LGSymmetry Breaking in Transformers for Efficient and Interpretable TrainingAG-2026.01-998
- 47%hep-thA Two-Phase Perspective on Deep Learning DynamicsAG-2025.04-1127
- 47%hep-phInvestigating 1-Bit Quantization in Transformer-Based Top TaggingAG-2025.08-1162
Keep exploring
- How does parameter-efficient adaptation's 17% update compare to full fine-tuning in preventing geometry overfitting?
- Does the compression phase mechanism explain why longer training reduces overfitting despite using only 100 samples?
- What specific regularization techniques beyond parameter efficiency prevented artifacts in the 100,000-event detector geometry study?
This is a research aid — not a peer review. Verify sources before citing.