AG-2026.04-1095·hep-ph·cross-listed: cs.LG
Lecture notes on Machine Learning applications for global fits
Authors
- Jorge Alda
Abstract
These lecture notes provide a comprehensive framework for performing global statistical fits in high-energy physics using modern Machine Learning (ML) surrogates. We begin by reviewing the statistical foundations of model building, including the likelihood function, Wilks' theorem, and profile likelihoods. Recognizing that the computational cost of evaluating model predictions often renders traditional minimization prohibitive, we introduce Boosted Decision Trees to approximate the log-likelihood function. The notes detail a robust ML workflow including efficient generation of training data with active learning and Gaussian processes, hyperparameter optimization, model compilation for speed-up, and interpretability through SHAP values to decode the influence of model parameters and interactions between parameters. We further discuss posterior distribution sampling using Markov Chain Monte Carlo (MCMC). These techniques are finally applied to the $B^\pm \to K^\pm ν\barν$ anomaly at Belle II, demonstrating how a two-stage ML model can efficiently explore the parameter space of Axion-Like Particles (ALPs) while satisfying stringent experimental constraints on decay lengths and flavor-violating couplings.
Submitted
8 April 20262 months ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2604.07520
Summary
These lecture notes show how to use machine learning—especially decision trees—to speed up statistical fits in particle physics, applying the method to search for exotic particles called axion-like particles in rare B-meson decays.
- Machine learning surrogates can replace expensive physics simulations during parameter fitting, making searches feasible that would otherwise be computationally prohibitive.
- The workflow combines active learning (smart data selection), hyperparameter tuning, and interpretability tools (SHAP) so physicists understand which parameters matter and how they interact.
- Testing on a real Belle II anomaly shows the approach works: a two-stage ML model efficiently maps out allowed values for exotic particle properties while respecting experimental constraints.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.