AG-2026.04-1540·astro-ph.CO
Propagating data-driven galaxy redshift distribution uncertainties in 3$\times$2-pt analyses
Authors
- Jaime Ruiz-Zapatero
- Qianjun Hang
- Yun-Hao Zhang
- Benjamin Joachimi
- Joe Zuntz
- Ian Harrison
- Carlos García-García
- Alex Malz
- Benjamin Stölzner
- the LSST Dark Energy Science Collaboration
Abstract
Uncertainties in the radial distribution of galaxies, $\boldsymbol{n}(\boldsymbol{z})$, are one of the major contributions to the error budget of early Stage-IV galaxy survey analyses of weak gravitational lensing, galaxy clustering and galaxy-galaxy lensing (3$\times$2-pt). Based on ensembles of simulated $\boldsymbol{n}(\boldsymbol{z})$ including stochastic and systematic variations, we study the impact of four different $\boldsymbol{n}(\boldsymbol{z})$ uncertainty models: shifts, shifts & stretches, Gaussian processes (GP) and principal component analysis (PCA). Due to the high dimensionality of the latter models, we make use of state-of-the-art gradient-based inference methods as well as approximate analytical marginalisation schemes. Our results show that Stage-IV 3$\times$2-pt analyses must go beyond simple shift & stretch models. In particular, we advocate for the adoption of PCA models even in early Stage-IV surveys. Our results show that considering a five-parameters PCA model only degrades the constraint on the $S_{\rm 8}$ parameter by $5$ per cent with respect to the case when only a shift and a stretch parameter are included, while incurring half the bias in its constituents parameters, $Ω_{\rm m}$ and $σ_{\rm 8}$. We demonstrate that all models considered can be safely marginalised analytically, with speed-ups of up to a factor of 25 depending on the dimensionality of the model. This will allow Stage-IV analyses to safely include higher-dimensional $\boldsymbol{n}(\boldsymbol{z})$ uncertainty models in their analysis at negligible additional computational cost.
Submitted
27 April 20261 month ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2604.24425
Summary
When measuring dark energy with large galaxy surveys, uncertainties in how galaxies are distributed across cosmic distances significantly bias results; the authors show that better mathematical models of these uncertainties improve accuracy by 5% while cutting bias in half, all without slowing down the analysis.
- Galaxy redshift uncertainties are a major systematic error in Stage-IV surveys—ignoring them properly can skew cosmological parameters like the matter density and structure growth.
- Simple shift-and-stretch models of redshift errors are insufficient; principal component analysis (PCA) models capture the full complexity of real uncertainties at minimal computational cost.
- Analytical marginalisation techniques can be 25× faster than sampling methods, making sophisticated uncertainty models practical even for next-generation surveys analyzing billions of galaxies.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.