Learning to Validate Generative Models: a Goodness-of-Fit Approach

Pietro Cappelli; Gaia Grosso; Marco Letizia; Humberto Reyes-González; Marco Zanetti

doi:10.48550/arXiv.2511.09118

← Recent

AG-2025.11-1255·stat.ML·cross-listed: cs.LGhep-exhep-ph

Learning to Validate Generative Models: a Goodness-of-Fit Approach

Authors

Pietro Cappelli
Gaia Grosso
Marco Letizia
Humberto Reyes-González
Marco Zanetti

Abstract

Generative models are increasingly central to scientific workflows, yet their systematic use and interpretation require a proper understanding of their limitations through rigorous validation. Classic approaches struggle with scalability, statistical power, or interpretability when applied to high-dimensional data, making it difficult to certify the reliability of these models in realistic, high-dimensional scientific settings. Here, we propose the use of the New Physics Learning Machine (NPLM), a learning-based approach to goodness-of-fit testing inspired by the Neyman--Pearson construction, to test generative networks trained on high-dimensional scientific data. We demonstrate the performance of NPLM for validation in two benchmark cases: generative models trained on mixtures of Gaussian models with increasing dimensionality, and a public end-to-end model, known as FlowSim, developed to generate high-energy physics collision events. We demonstrate that the NPLM can serve as a powerful validation method while also providing a means to diagnose sub-optimally modeled regions of the data.

Submitted

12 November 20255 months ago

Version

v1

License

CC-BY-4.0

DOI

10.48550/arXiv.2511.09118

Cite this preprint

BibTeX RIS

Imports into BibLaTeX, Zotero, Mendeley, EndNote.

PDF

Open PDF

Opens in a new tab · v1.

Chat with this PDF

Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.

Community

Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.