AG-2026.04-1891·hep-th·cross-listed: hep-ph
Optimal Architecture and Fundamental Bounds in Neural Network Field Theory
Authors
- Zhengkang Zhang
Abstract
Neural network field theory (NNFT) represents fields as neural networks and samples field configurations by drawing network parameters from a probability distribution. We identify a previously unexplored architectural freedom in NNFT, parameterized by $α$, that leaves the infinite-width theory invariant but dramatically affects finite-width errors in the calculation of correlation functions. For a massive scalar field, we show that $α=0$, corresponding to propagator-weighted neuron momenta and constant neuron amplitudes, is optimal: it minimizes finite-width variance and uniquely removes IR-sensitive corrections in the interacting theory. Even at $α=0$, relative errors from both bias and variance grow exponentially with distance beyond the correlation length. The bias can be removed by extrapolating to infinite width, which we demonstrate numerically, while the variance imposes a fundamental bound on the achievable signal-to-noise ratio as in lattice field theory. These results chart a path toward developing NNFT into a practical tool for the numerical study of field theories.
Submitted
29 April 20261 week ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2604.27050
Summary
Neural networks can represent quantum fields in multiple mathematically equivalent ways, but one architecture dramatically reduces computational errors—though fundamental noise limits remain, similar to those in traditional lattice simulations.
- The optimal neural network architecture for field theory (α=0) uses weights shaped by the field's propagator and constant amplitudes, minimizing both statistical noise and systematic bias in correlation calculations.
- Even with the best design, errors grow exponentially with distance, revealing a fundamental signal-to-noise tradeoff that mirrors limitations in conventional lattice field theory simulations.
- The bias (systematic error) vanishes when extrapolating to infinitely wide networks, showing the approach is theoretically sound but practically constrained by irreducible quantum noise.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.