Loading…
Loading…
stat.AP
AG-2025.12-1480
stat.AP
Huanbiao Zhu, Krish Desai, Mikael Kuusela, Vinicius Mikuni, Benjamin Nachman, Larry Wasserman
Statistically correcting measured cross sections for detector effects is an important step across many applications. In particle physics, this inverse problem is known as unfolding. In cases with complex instruments, the distortions they introduce are often known only implicitly through simulations of the detector. Modern machine learning has enabled efficient simulation-based approaches for unfolding high-dimensional data. Among these, one of the first methods successfully deployed on experimental data is the OmniFold algorithm, a classifier-based Expectation-Maximization procedure. In practice, however, the forward model is only approximately specified, and the corresponding uncertainty is encoded through nuisance parameters. Building on the well-studied OmniFold algorithm, we show how to extend machine learning-based unfolding to incorporate nuisance parameters. Our new algorithm, called Profile OmniFold, is demonstrated using a Gaussian example as well as a particle physics case study using simulated data from the CMS Experiment at the Large Hadron Collider.
8 Dec 2025
AG-2025.09-036
stat.AP
Alex Leviyev, Francesco Iacovelli, Aaron Zimmerman
Bayesian inference plays a central role in scientific and engineering applications by enabling principled reasoning under uncertainty. However, sampling from generic probability distributions remains a computationally demanding task. This difficulty is compounded when the distributions are ill-conditioned, multi-modal, or supported on topologically non-Euclidean spaces. Motivated by challenges in gravitational wave parameter estimation, we propose simulating a Langevin diffusion augmented with a birth-death process. The dynamics are rescaled with a simple preconditioner, and generalized to apply to the product spaces of a hypercube and hypertorus. Our method is first-order and embarrassingly parallel with respect to model evaluations, making it well-suited for algorithmic differentiation and modern hardware accelerators. We validate the algorithm on a suite of toy problems and successfully apply it to recover the parameters of GW150914 -- the first observed binary black hole merger. This approach addresses key limitations of traditional sampling methods, and introduces a template that can be used to design robust samplers in the future.
2 Sept 2025
AG-2024.09-1051
stat.AP
Purvasha Chakravarti, Lucas Kania, Olaf Behnke, Mikael Kuusela, Larry Wasserman
Searches for signals of new physics in particle physics are usually done by training a supervised classifier to separate a signal model from the known Standard Model physics (also called the background model). However, even when the signal model is correct, systematic errors in the background model can influence supervised classifiers and might adversely affect the signal detection procedure. To tackle this problem, one approach is to use the (possibly misspecified) classifier only to perform a preliminary signal-enrichment step and then to carry out a signal detection test on the signal-rich sample. For this procedure to work, we need a classifier constrained to be decorrelated with one or more protected variables used for the signal-detection step. We do this by considering an optimal transport map of the classifier output that makes it independent of the protected variable(s) for the background. We then fit a semiparametric mixture model to the distribution of the protected variable after making cuts on the transformed classifier to detect the presence of a signal. We compare and contrast this decorrelation method with previous approaches, show that the decorrelation procedure is robust to moderate background misspecification, and analyze the power and validity of the signal detection test as a function of the cut on the classifier both with and without decorrelation. We conclude that decorrelation and signal enrichment help produce a stable, robust, valid, and more powerful test.
10 Sept 2024