AG-2026.05-169·nucl-th
Sequential Bayesian inference with correlated heavy-ion datasets
Authors
- Lipei Du
Abstract
Bayesian inference provides a natural framework for updating knowledge as new information becomes available, often in a sequential manner by incorporating datasets in stages or reusing previous posteriors as priors. In practice, this is commonly implemented using a factorized update in which datasets are treated as conditionally independent. When datasets are statistically correlated, however, this approximation becomes inconsistent with the joint likelihood and can lead to biased posterior estimates. In this work, we investigate this issue in a controlled setting using pseudo-data with a tunable covariance structure. We compare joint inference, factorized sequential updating, and a formulation based on the exact conditional likelihood. We show that factorized updates reproduce the joint posterior only in the limit of conditional independence, and otherwise lead to systematic deviations that grow with the correlation strength, while conditional updates remain consistent with the joint result. To interpret these deviations, we introduce an information decomposition that separates contributions into components that are new and components that are redundant across datasets. We show that correlations induce a structured, parameter-dependent redistribution of information, governed by the overlap of dataset sensitivities. The resulting mismatch between marginal and conditional information quantitatively explains the observed deviations. These results provide a practical diagnostic for assessing the consistency of sequential Bayesian inference with correlated datasets and highlight the need for a consistent treatment of correlations within a common probabilistic framework.
Submitted
18 May 20263 weeks ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2605.17868
Summary
When updating scientific beliefs with new data using Bayesian methods, treating correlated datasets as independent creates systematic errors—especially problematic in heavy-ion physics where experiments share common uncertainties.
- The standard trick of updating beliefs sequentially with independent dataset assumptions fails when datasets are correlated, producing biased estimates that worsen as correlations strengthen.
- Using the exact conditional likelihood instead of the factorized shortcut keeps results consistent with analyzing all data jointly, showing which approach physicists should actually use.
- Correlations between datasets redistribute information in a predictable, parameter-dependent way that can be diagnosed mathematically—offering a practical check for whether sequential updates are trustworthy.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.