math.ST — preprints · Antigravity

AG-2026.06-378

math.ST

Constraint residuals, graph posteriors, and determinant-corrected full-space targets in Bayesian inverse problems

Jonathon Cottom, Emilia Olsson

Bayesian inverse problems constrained by state equations are often sampled in a full parameter-state space by penalising the residual, rather than in a reduced space where the state is eliminated. We show that these formulations are not automatically equivalent as posterior measures. For finite-dimensional discretisations of equality-constrained inverse problems, assume the state equation $c(θ,u)=0$ has a unique solution $u=G(θ)$ and nonsingular state Jacobian $\D_u c$. The reduced posterior, its graph lift, and the zero-noise residual posterior are then distinct. A local change of variables shows that an uncorrected Gaussian residual penalty converges, after marginalisation over $u$, to the reduced density multiplied by $\abs{\det \D_u c(θ,G(θ))}^{-1}$. Thus algebraically equivalent residuals can define the same feasible set but different limiting posteriors. We derive determinant corrections for unweighted, weighted, and rescaled residual penalties that have the graph-lifted reduced posterior as their hard-constraint limit. The result separates feasibility from posterior calibration: driving the residual to zero is not sufficient for exact sampling of the graph-lifted reduced posterior unless the sampling or correction step targets the corresponding corrected density.

8 Jun 2026

2w ago

cond-mat.stat-mechmath-phmath.NA

AG-2024.07-2330

math.ST

Matrix Majorization in Large Samples with Varying Support Restrictions

Frits Verhagen, Marco Tomamichel, Erkka Haapasalo

We say that a matrix $P$ with non-negative entries majorizes another such matrix $Q$ if there is a stochastic matrix $T$ such that $Q=TP$. We study matrix majorization in large samples and in the catalytic regime in the case where the columns of the matrices need not have equal support, as has been assumed in earlier works. We focus on two cases: either there are no support restrictions (except for requiring a non-empty intersection for the supports) or the final column dominates the others. Using real-algebraic methods, we identify sufficient and almost necessary conditions for majorization in large samples or when using catalytic states under these support conditions. These conditions are given in terms of multivariate divergences that generalize the Rényi divergences. We notice that varying support conditions dramatically affect the relevant set of divergences. Our results find an application in the theory of catalytic state transformation in quantum thermodynamics.

23 Jul 2024

cs.ITmath.PRquant-ph

AG-2015.03-1073

math.ST

Consistency of community detection in networks under degree-corrected stochastic block models

Yunpeng Zhao, Elizaveta Levina, Ji Zhu

Community detection is a fundamental problem in network analysis, with applications in many diverse areas. The stochastic block model is a common tool for model-based community detection, and asymptotic tools for checking consistency of community detection under the block model have been recently developed. However, the block model is limited by its assumption that all nodes within a community are stochastically equivalent, and provides a poor fit to networks with hubs or highly varying node degrees within communities, which are common in practice. The degree-corrected stochastic block model was proposed to address this shortcoming and allows variation in node degrees within a community while preserving the overall block community structure. In this paper we establish general theory for checking consistency of community detection under the degree-corrected stochastic block model and compare several community detection criteria under both the standard and the degree-corrected models. We show which criteria are consistent under which models and constraints, as well as compare their relative performance in practice. We find that methods based on the degree-corrected block model, which includes the standard block model as a special case, are consistent under a wider class of models and that modularity-type methods require parameter constraints for consistency, whereas likelihood-based methods do not. On the other hand, in practice, the degree correction involves estimating many more parameters, and empirically we find it is only worth doing if the node degrees within communities are indeed highly variable. We illustrate the methods on simulated networks and on a network of political blogs.

17 Mar 2015

cs.SIphysics.soc-phstat.TH

AG-2015.02-1326

math.ST

Kramers-type effective Reactive Flow in Structured-noise Environments

Chun-Yang Wang

The non-Markovian features of three typical anomalous diffusing systems are studied by analytically solving the generalized Langevin equation directly driven by three kind of internal structured-noises: harmonic noise, harmonic velocity noise and harmonic acceleration noise, respectively. The time-dependent reaction rate and the transmission coefficient are calculated by using of the reactive flux method. A startling behavior of Kramers-type effective reactive flow is witnessed in the harmonic noise and harmonic acceleration noise systems.

22 Feb 2015

physics.chem-phstat.TH

AG-2014.04-520

math.ST

A Linear Iterative Unfolding Method

Andras Laszlo

A frequently faced task in experimental physics is to measure the probability distribution of some quantity. Often this quantity to be measured is smeared by a non-ideal detector response or by some physical process. The procedure of removing this smearing effect from the measured distribution is called unfolding, and is a delicate problem in signal processing, due to the well-known numerical ill behavior of this task. Various methods were invented which, given some assumptions on the initial probability distribution, try to regularize the unfolding problem. Most of these methods definitely introduce bias into the estimate of the initial probability distribution. We propose a linear iterative method, which has the advantage that no assumptions on the initial probability distribution is needed, and the only regularization parameter is the stopping order of the iteration, which can be used to choose the best compromise between the introduced bias and the propagated statistical and systematic errors. The method is consistent: "binwise" convergence to the initial probability distribution is proved in absence of measurement errors under a quite general condition on the response function. This condition holds for practical applications such as convolutions, calorimeter response functions, momentum reconstruction response functions based on tracking in magnetic field etc. In presence of measurement errors, explicit formulae for the propagation of the three important error terms is provided: bias error, statistical error, and systematic error. A trade-off between these three error terms can be used to define an optimal iteration stopping criterion, and the errors can be estimated there. We provide a numerical C library for the implementation of the method, which incorporates automatic statistical error propagation as well.

6 Apr 2014

physics.data-anstat.APstat.TH

AG-2013.11-1147

math.ST

Estimating Functions of Distributions Defined over Spaces of Unknown Size

David H. Wolpert, Simon DeDeo

We consider Bayesian estimation of information-theoretic quantities from data, using a Dirichlet prior. Acknowledging the uncertainty of the event space size $m$ and the Dirichlet prior's concentration parameter $c$, we treat both as random variables set by a hyperprior. We show that the associated hyperprior, $P(c, m)$, obeys a simple "Irrelevance of Unseen Variables" (IUV) desideratum iff $P(c, m) = P(c) P(m)$. Thus, requiring IUV greatly reduces the number of degrees of freedom of the hyperprior. Some information-theoretic quantities can be expressed multiple ways, in terms of different event spaces, e.g., mutual information. With all hyperpriors (implicitly) used in earlier work, different choices of this event space lead to different posterior expected values of these information-theoretic quantities. We show that there is no such dependence on the choice of event space for a hyperprior that obeys IUV. We also derive a result that allows us to exploit IUV to greatly simplify calculations, like the posterior expected mutual information or posterior expected multi-information. We also use computer experiments to favorably compare an IUV-based estimator of entropy to three alternative methods in common use. We end by discussing how seemingly innocuous changes to the formalization of an estimation problem can substantially affect the resultant estimates of posterior expectations.

18 Nov 2013

physics.data-anq-bio.QMstat.TH

AG-2013.10-1213

math.ST

On the graph-theoretical interpretation of Pearson correlations in a multivariate process and a novel partial correlation measure

Jakob Runge

The dependencies of the lagged (Pearson) correlation function on the coefficients of multivariate autoregressive models are interpreted in the framework of time series graphs. Time series graphs are related to the concept of Granger causality and encode the conditional independence structure of a multivariate process. The authors show that the complex dependencies of the Pearson correlation coefficient complicate an interpretation and propose a novel partial correlation measure with a straightforward graph-theoretical interpretation. The novel measure has the additional advantage that its sampling distribution is not affected by serial dependencies like that of the Pearson correlation coefficient. In an application to climatological time series the potential of the novel measure is demonstrated.

18 Oct 2013

physics.data-anstat.APstat.TH

AG-2013.10-1327

math.ST

Statistical Curse of the Second Half Rank, Eulerian numbers and Stirling numbers

Stephane Ouvry

I describe the occurence of Eulerian numbers and Stirling numbers of the second kind in the combinatorics of the Statistical Curse of the Second Half Rank problem.

14 Oct 2013

cond-mat.stat-mechmath.HOstat.TH

AG-2013.09-1842

math.ST

Measurement of statistical evidence on an absolute scale following thermodynamic principles

V. J. Vieland, J. Das, S. E. Hodge, S. -C. Seok

Statistical analysis is used throughout biomedical research and elsewhere to assess strength of evidence. We have previously argued that typical outcome statistics (including p-values and maximum likelihood ratios) have poor measure-theoretic properties: they can erroneously indicate decreasing evidence as data supporting an hypothesis accumulate; and they are not amenable to calibration, necessary for meaningful comparison of evidence across different study designs, data types, and levels of analysis. We have also previously proposed that thermodynamic theory, which allowed for the first time derivation of an absolute measurement scale for temperature (T), could be used to derive an absolute scale for evidence (E). Here we present a novel thermodynamically-based framework in which measurement of E on an absolute scale, for which "one degree" always means the same thing, becomes possible for the first time. The new framework invites us to think about statistical analyses in terms of the flow of (evidential) information, placing this work in the context of a growing literature on connections among physics, information theory, and statistics.

30 Sept 2013

cs.ITmath.ITphysics.data-an+2

AG-2013.09-1538

math.ST

Theoretical foundations and mathematical formalism of the power-law tailed statistical distributions

G. Kaniadakis

We present the main features of the mathematical theory generated by the κ-deformed exponential function exp_κ(x)=(\sqrt{1+κ^2 x^2}+κx)^{1/κ}, with 0<κ<1, developed in the last twelve years, which turns out to be a continuous one parameter deformation of the ordinary mathematics generated by the Euler exponential function. The κ-mathematics has its roots in special relativity and furnishes the theoretical foundations of the κ-statistical mechanics predicting power law tailed statistical distributions which have been observed experimentally in many physical, natural and artificial systems. After introducing the κ-algebra we present the associated κ-differential and κ-integral calculus. Then we obtain the corresponding κ-exponential and κ-logarithm functions and give the κ-version of the main functions of the ordinary mathematics.

25 Sept 2013

hep-thmath-phmath.MP+2

AG-2013.08-3727

math.ST

Examples of Application of Nonparametric Information Geometry to Statistical Physics

Giovanni Pistone

We review a nonparametric version of Amari's Information Geometry in which the set of positive probability densities on a given sample space is endowed with an atlas of charts to form a differentiable manifold modeled on Orlicz Banach spaces. This nonparametric setting is used to discuss the setting of typical problems in Machine Learning and Statistical Physics, such as relaxed optimization, Kullback-Leibler divergence, Boltzmann entropy, Boltzmann equation

24 Aug 2013

math-phmath.MPstat.TH

AG-2013.07-1900

math.ST

A Fractional Generalization of the Poisson Processes and Some of its Properties

Nicy Sebastian, Rudolf Gorenflo

We have provided a fractional generalization of the Poisson renewal processes by replacing the first time derivative in the relaxation equation of the survival probability by a fractional derivative of order $α~(0 < α\leq 1)$. A generalized Laplacian model associated with the Mittag-Leffler distribution is examined. We also discuss some properties of this new model and its relevance to time series. Distribution of gliding sums, regression behaviors and sample path properties are studied. Finally we introduce the $q$-Mittag-Leffler process associated with the $q$-Mittag-Leffler distribution.

31 Jul 2013

math-phmath.MPstat.AP+1

AG-2013.07-977

math.ST

Bayesian estimate of the degree of a polynomial given a noisy data sample

Giovanni Mana, Paolo Alberto Giuliano Albo, Simona Lago

A widely used method to create a continuous representation of a discrete data-set is regression analysis. When the regression model is not based on a mathematical description of the physics underlying the data, heuristic techniques play a crucial role and the model choice can have a significant impact on the result. In this paper, the problem of identifying the most appropriate model is formulated and solved in terms of Bayesian selection. Besides, probability calculus is the best way to choose among different alternatives. The results obtained are applied to the case of both univariate and bivariate polynomials used as trial solutions of systems of thermodynamic partial differential equations.

17 Jul 2013

math.PRphysics.data-anstat.TH

AG-2013.04-2271

math.ST

Asymptotic Results on Adaptive False Discovery Rate Controlling Procedures Based on Kernel Estimators

Pierre Neuvial

The False Discovery Rate (FDR) is a commonly used type I error rate in multiple testing problems. It is defined as the expected False Discovery Proportion (FDP), that is, the expected fraction of false positives among rejected hypotheses. When the hypotheses are independent, the Benjamini-Hochberg procedure achieves FDR control at any pre-specified level. By construction, FDR control offers no guarantee in terms of power, or type II error. A number of alternative procedures have been developed, including plug-in procedures that aim at gaining power by incorporating an estimate of the proportion of true null hypotheses. In this paper, we study the asymptotic behavior of a class of plug-in procedures based on kernel estimators of the density of the $p$-values, as the number $m$ of tested hypotheses grows to infinity. In a setting where the hypotheses tested are independent, we prove that these procedures are asymptotically more powerful in two respects: (i) a tighter asymptotic FDR control for any target FDR level and (ii) a broader range of target levels yielding positive asymptotic power. We also show that this increased asymptotic power comes at the price of slower, non-parametric convergence rates for the FDP. These rates are of the form $m^{-k/(2k+1)}$, where $k$ is determined by the regularity of the density of the $p$-value distribution, or, equivalently, of the test statistics distribution. These results are applied to one- and two-sided tests statistics for Gaussian and Laplace location models, and for the Student model.

20 Apr 2013

physics.data-anq-bio.QMstat.AP+2

AG-2013.03-2769

math.ST

Needlet-Whittle Estimates on the Unit Sphere

Claudio Durastanti, Xiaohong Lan, Domenico Marinucci

We study the asymptotic behaviour of needlets-based approximate maximum likelihood estimators for the spectral parameters of Gaussian and isotropic spherical random fields. We prove consistency and asymptotic Gaussianity, in the high-frequency limit, thus generalizing earlier results by Durastanti et al. (2011) based upon standard Fourier analysis on the sphere. The asymptotic results are then illustrated by an extensive Monte Carlo study.

1 Mar 2013

astro-ph.IMmath.PRstat.TH

AG-2013.01-1201

math.ST

Maximum Fidelity

Ali Kinkhabwala

The most fundamental problem in statistics is the inference of an unknown probability distribution from a finite number of samples. For a specific observed data set, answers to the following questions would be desirable: (1) Estimation: Which candidate distribution provides the best fit to the observed data?, (2) Goodness-of-fit: How concordant is this distribution with the observed data?, and (3) Uncertainty: How concordant are other candidate distributions with the observed data? A simple unified approach for univariate data that addresses these traditionally distinct statistical notions is presented called "maximum fidelity". Maximum fidelity is a strict frequentist approach that is fundamentally based on model concordance with the observed data. The fidelity statistic is a general information measure based on the coordinate-independent cumulative distribution and critical yet previously neglected symmetry considerations. An approximation for the null distribution of the fidelity allows its direct conversion to absolute model concordance (p value). Fidelity maximization allows identification of the most concordant model distribution, generating a method for parameter estimation, with neighboring, less concordant distributions providing the "uncertainty" in this estimate. Maximum fidelity provides an optimal approach for parameter estimation (superior to maximum likelihood) and a generally optimal approach for goodness-of-fit assessment of arbitrary models applied to univariate data. Extensions to binary data, binned data, multidimensional data, and classical parametric and nonparametric statistical tests are described. Maximum fidelity provides a philosophically consistent, robust, and seemingly optimal foundation for statistical inference. All findings are presented in an elementary way to be immediately accessible to all researchers utilizing statistical analysis.

22 Jan 2013

astro-ph.IMhep-exstat.TH

AG-2012.10-035

math.ST

Modeling stationary data by a class of generalised Ornstein-Uhlenbeck processes

Argimiro Arratia, Alejandra Cabaña, Enrique M. Cabaña

An Ornstein-Uhlenbeck (OU) process can be considered as a continuous time interpolation of the discrete time AR$(1)$ process. Departing from this fact, we analyse in this work the effect of iterating OU treated as a linear operator that maps a Wiener process onto Ornstein-Uhlenbeck process, so as to build a family of higher order Ornstein-Uhlenbeck processes, OU$(p)$, in a similar spirit as the higher order autoregressive processes AR$(p)$. We show that for $p \ge 2$ we obtain in general a process with covariances different than those of an AR$(p)$, and that for various continuous time processes, sampled from real data at equally spaced time instants, the OU$(p)$ model outperforms the appropriate AR$(p)$ model. Technically our composition of the OU operator is easy to manipulate and its parameters can be computed efficiently because, as we show, the iteration of OU operators leads to a process that can be expressed as a linear combination of basic OU processes. Using this expression we obtain a closed formula for the covariance of the iterated OU process, and consequently estimate the parameters of an OU$(p)$ process by maximum likelihood or, as an alternative, by matching correlations, the latter being a procedure resembling the method of moments.

1 Oct 2012

math-phmath.MPstat.TH

AG-2012.08-2322

math.ST

Supersymmetry approach to Wishart correlation matrices: Exact results

Christian Recher, Mario Kieburg, Thomas Guhr, Martin R. Zirnbauer

We calculate the `one-point function', meaning the marginal probability density function for any single eigenvalue, of real and complex Wishart correlation matrices. No explicit expression had been obtained for the real case so far. We succeed in doing so by using supersymmetry techniques to express the one-point function of real Wishart correlation matrices as a twofold integral. The result can be viewed as a resummation of a series of Jack polynomials in a non-trivial case. We illustrate our formula by numerical simulations. We also rederive a known expression for the one-point function of complex Wishart correlation matrices.

31 Aug 2012

math-phmath.MPstat.TH

AG-2012.07-1046

math.ST

Golden Ratio estimate of success probability based on one and only sample

Sun Ping

This paper proposes iterative Bayesian method to estimate success probability based on unique sample. The procedure is replacing the distribution characteristic of prior with Bayes estimate on the every iteration until they coincide. Iterative Bayes estimate is generally independent of hyperparameters. For binomial, Poisson, exponential and normal model, iterative limit is shown to be MLE in case the expectation of conjugate prior is replaced respectively. Particularly, suppose success appears in one and only trial, while the mode of triangle prior is replaced iterative Bayesian method gives $1/ϕ\approx 0.618$ ($ϕ$ is Golden Ratio) as the estimate of success probability $p$, this result reveals the truth of Golden Ratio from the point of statistics. Furthermore, under triangle prior the unique sample $X$ from binomial model $B(n,p)$ is considered. Existence and uniqueness of iterative Bayes estimator $\hat{p}_{IB}$ for parameter $p$ is given.

22 Jul 2012

quant-phstat.TH

AG-2012.07-2522

math.ST

Majority Dynamics and Aggregation of Information in Social Networks

Elchanan Mossel, Joe Neeman, Omer Tamuz

Consider n individuals who, by popular vote, choose among q >= 2 alternatives, one of which is "better" than the others. Assume that each individual votes independently at random, and that the probability of voting for the better alternative is larger than the probability of voting for any other. It follows from the law of large numbers that a plurality vote among the n individuals would result in the correct outcome, with probability approaching one exponentially quickly as n tends to infinity. Our interest in this paper is in a variant of the process above where, after forming their initial opinions, the voters update their decisions based on some interaction with their neighbors in a social network. Our main example is "majority dynamics", in which each voter adopts the most popular opinion among its friends. The interaction repeats for some number of rounds and is then followed by a population-wide plurality vote. The question we tackle is that of "efficient aggregation of information": in which cases is the better alternative chosen with probability approaching one as n tends to infinity? Conversely, for which sequences of growing graphs does aggregation fail, so that the wrong alternative gets chosen with probability bounded away from zero? We construct a family of examples in which interaction prevents efficient aggregation of information, and give a condition on the social network which ensures that aggregation occurs. For the case of majority dynamics we also investigate the question of unanimity in the limit. In particular, if the voters' social network is an expander graph, we show that if the initial population is sufficiently biased towards a particular alternative then that alternative will eventually become the unanimous preference of the entire population.

4 Jul 2012

cs.SIphysics.soc-phstat.TH

AG-2012.06-386

math.ST

Dynamic Bayesian Combination of Multiple Imperfect Classifiers

Edwin Simpson, Stephen Roberts, Ioannis Psorakis, Arfon Smith

Classifier combination methods need to make best use of the outputs of multiple, imperfect classifiers to enable higher accuracy classifications. In many situations, such as when human decisions need to be combined, the base decisions can vary enormously in reliability. A Bayesian approach to such uncertain combination allows us to infer the differences in performance between individuals and to incorporate any available prior knowledge about their abilities when training data is sparse. In this paper we explore Bayesian classifier combination, using the computationally efficient framework of variational Bayesian inference. We apply the approach to real data from a large citizen science project, Galaxy Zoo Supernovae, and show that our method far outperforms other established approaches to imperfect decision combination. We go on to analyse the putative community structure of the decision makers, based on their inferred decision making strategies, and show that natural groupings are formed. Finally we present a dynamic Bayesian classifier combination approach and investigate the changes in base classifier performance over time.

8 Jun 2012

astro-ph.IMstat.TH

AG-2012.05-3586

math.ST

Arbitrary Truncated Levy Flight: Asymmetrical Truncation and High-Order Correlations

Dmitry V. Vinogradov

The generalized correlation approach, which has been successfully used in statistical radio physics to describe non-Gaussian random processes, is proposed to describe stochastic financial processes. The generalized correlation approach has been used to describe a non-Gaussian random walk with independent, identically distributed increments in the general case, and high-order correlations have been investigated. The cumulants of an asymmetrically truncated Levy distribution have been found. The behaviors of asymmetrically truncated Levy flight, as a particular case of a random walk, are considered. It is shown that, in the Levy regime, high-order correlations between values of asymmetrically truncated Levy flight exist. The source of high-order correlations is the non-Gaussianity of the increments: the increment skewness generates threefold correlation, and the increment kurtosis generates fourfold correlation.

15 May 2012

cond-mat.stat-mechphysics.data-anq-fin.ST+1

AG-2012.03-2434

math.ST

High Dimensional Structure Learning of Ising Models on Sparse Random Graphs

Animashree Anandkumar, Vincent Tan, Alan Willsky

We consider the problem of learning the structure of ferromagnetic Ising models Markov on sparse Erdos-Renyi random graph. We propose simple local algorithms and analyze their performance in the regime of correlation decay. We prove that an algorithm based on a set of conditional mutual information tests is consistent for structure learning throughout the regime of correlation decay. This algorithm requires the number of samples to scale as ω(\log n), and has a computational complexity of O(n^4). A simpler algorithm based on correlation thresholding outputs a graph with a constant edit distance to the original graph when there is correlation decay, and the number of samples required is Ω(\log n). Under a more stringent condition, correlation thresholding is consistent for structure estimation. We finally prove a lower bound that Ω(c\log n) samples are also needed for consistent reconstruction of random graphs by any algorithm with positive probability, where c is the average degree. Thus, we establish that consistent structure estimation is possible with almost order-optimal sample complexity throughout the regime of correlation decay.

4 Mar 2012

cond-mat.stat-mechstat.TH

AG-2012.01-2223

math.ST

Fluctuation geometry: A counterpart approach of inference geometry

L Velazquez

Starting from an axiomatic perspective, \emph{fluctuation geometry} is developed as a counterpart approach of inference geometry. This approach is inspired on the existence of a notable analogy between the general theorems of \emph{inference theory} and the the \emph{general fluctuation theorems} associated with a parametric family of distribution functions $dp(I|θ)=ρ(I|θ)dI$, which describes the behavior of a set of \emph{continuous stochastic variables} driven by a set of control parameters $θ$. In this approach, statistical properties are rephrased as purely geometric notions derived from the \emph{Riemannian structure} on the manifold $\mathcal{M}_θ$ of stochastic variables $I$. Consequently, this theory arises as an alternative framework for applying the powerful methods of differential geometry for the statistical analysis. Fluctuation geometry has direct implications on statistics and physics. This geometric approach inspires a Riemannian reformulation of Einstein fluctuation theory as well as a geometric redefinition of the information entropy for a continuous distribution.

15 Jan 2012

cond-mat.stat-mechmath-phmath.MP+1

AG-2012.01-2451

math.ST

Estimating the bias of a noisy coin

Christopher Ferrie, Robin Blume-Kohout

Optimal estimation of a coin's bias using noisy data is surprisingly different from the same problem with noiseless data. We study this problem using entropy risk to quantify estimators' accuracy. We generalize the "add Beta" estimators that work well for noiseless coins, and we find that these hedged maximum-likelihood (HML) estimators achieve a worst-case risk of O(N^{-1/2}) on noisy coins, in contrast to O(1/N) in the noiseless case. We demonstrate that this increased risk is unavoidable and intrinsic to noisy coins, by constructing minimax estimators (numerically). However, minimax estimators introduce extreme bias in return for slight improvements in the worst-case risk. So we introduce a pointwise lower bound on the minimum achievable risk as an alternative to the minimax criterion, and use this bound to show that HML estimators are pretty good. We conclude with a survey of scientific applications of the noisy coin model in social science, physical science, and quantum information science.

6 Jan 2012

quant-phstat.TH

AG-2011.07-3140

math.ST

A Bayesian Approach to Detection of Small Low Emission Sources

Xiaolei Xun, Bani Mallick, Raymond J. Carroll, Peter Kuchment

The article addresses the problem of detecting presence and location of a small low emission source inside of an object, when the background noise dominates. This problem arises, for instance, in some homeland security applications. The goal is to reach the signal-to-noise ratio (SNR) levels on the order of $10^{-3}$. A Bayesian approach to this problem is implemented in 2D. The method allows inference not only about the existence of the source, but also about its location. We derive Bayes factors for model selection and estimation of location based on Markov Chain Monte Carlo (MCMC) simulation. A simulation study shows that with sufficiently high total emission level, our method can effectively locate the source.

15 Jul 2011

nucl-thstat.TH

AG-2011.06-2191

math.ST

Philosophy and the practice of Bayesian statistics

Andrew Gelman, Cosma Rohilla Shalizi

A substantial school in the philosophy of science identifies Bayesian inference with inductive inference and even rationality as such, and seems to be strengthened by the rise and practical success of Bayesian statistics. We argue that the most successful forms of Bayesian statistics do not actually support that particular philosophy but rather accord much better with sophisticated forms of hypothetico-deductivism. We examine the actual role played by prior distributions in Bayesian models, and the crucial aspects of model checking and model revision, which fall outside the scope of Bayesian confirmation theory. We draw on the literature on the consistency of Bayesian updating and also on our experience of applied work in social science. Clarity about these matters should benefit not just philosophy of science, but also statistical practice. At best, the inductivist view has encouraged researchers to fit and compare models without checking them; at worst, theorists have actively discouraged practitioners from performing model checking because it does not fit into their framework.

28 Jun 2011

physics.data-anstat.TH

AG-2011.05-2533

math.ST

Inferring an optimal Fisher measure

S. P. Flego, A. Plastino, A. R. Plastino

It is well known that a suggestive relation exists that links Schrödinger's equation (SE) to the information-optimizing principle based on Fisher's information measure (FIM). We explore here an approach that will allow one to infer the optimal FIM compatible with a given amount of prior information without explicitly solving first the associated SE. This technique is based on the virial theorem and it provides analytic solutions for the physically relevant FIM, that which is minimal subject to the constraints posed by the prior information.

4 May 2011

quant-phstat.TH

AG-2011.05-1786

math.ST

On the Convergence of the Ensemble Kalman Filter

Jan Mandel, Loren Cobb, Jonathan D. Beezley

Convergence of the ensemble Kalman filter in the limit for large ensembles to the Kalman filter is proved. In each step of the filter, convergence of the ensemble sample covariance follows from a weak law of large numbers for exchangeable random variables, the continuous mapping theorem gives convergence in probability of the ensemble members, and $L^p$ bounds on the ensemble then give $L^p$ convergence.

3 May 2011

math.PRphysics.ao-phstat.TH

AG-2011.03-2086

math.ST

Critical moment definition and estimation, for finite size observation of log-exponential-power law random variables

Florian Angeletti, Eric Bertin, Patrice Abry

This contribution aims at studying the behaviour of the classical sample moment estimator, $S(n,q)= \sum_{k=1}^n X_k^{q}/n $, as a function of the number of available samples $n$, in the case where the random variables $X$ are positive, have finite moments at all orders and are naturally of the form $X= \exp Y$ with the tail of $Y$ behaving like $e^{-y^ρ}$. This class of laws encompasses and generalizes the classical example of the log-normal law. This form is motivated by a number of applications stemming from modern statistical physics or multifractal analysis. Borrowing heuristic and analytical results from the analysis of the Random Energy Model in statistical physics, a critical moment $q_c(n)$ is defined as the largest statistical order $q$ up to which the sample mean estimator $S(n,q)$ correctly accounts for the ensemble average $\E X^q$, for a given $n$. A practical estimator for the critical moment $q_c(n)$ is then proposed. Its statistical performance are studied analytically and illustrated numerically in the case of \emph{i.i.d.} samples. A simple modification is proposed to explicitly account for correlation amongst the observed samples. Estimation performance are then carefully evaluated by means of Monte-Carlo simulations in the practical case of correlated time series.

25 Mar 2011

cond-mat.stat-mechmath.PRstat.TH

AG-2011.03-2439

math.ST

Statistics of statisticians: Critical mass of statistics and operational research groups in the UK

Ralph Kenna, Bertrand Berche

Using a recently developed model, inspired by mean field theory in statistical physics, and data from the UK's Research Assessment Exercise, we analyse the relationship between the quality of statistics and operational research groups and the quantity researchers in them. Similar to other academic disciplines, we provide evidence for a linear dependency of quality on quantity up to an upper critical mass, which is interpreted as the average maximum number of colleagues with whom a researcher can communicate meaningfully within a research group. The model also predicts a lower critical mass, which research groups should strive to achieve to avoid extinction. For statistics and operational research, the lower critical mass is estimated to be 9 $\pm$ 3. The upper critical mass, beyond which research quality does not significantly depend on group size, is about twice this value.

6 Mar 2011

cs.DLphysics.soc-phstat.TH

AG-2011.02-2112

math.ST

Robust nonparametric detection of objects in noisy images

Mikhail A. Langovoy, Olaf Wittich

We propose a novel statistical hypothesis testing method for detection of objects in noisy images. The method uses results from percolation theory and random graph theory. We present an algorithm that allows to detect objects of unknown shapes in the presence of nonparametric noise of unknown level and of unknown distribution. No boundary shape constraints are imposed on the object, only a weak bulk condition for the object's interior is required. The algorithm has linear complexity and exponential accuracy and is appropriate for real-time systems. In this paper, we develop further the mathematical formalism of our method and explore important connections to the mathematical theory of percolation and statistical physics. We prove results on consistency and algorithmic complexity of our testing procedure. In addition, we address not only an asymptotic behavior of the method, but also a finite sample performance of our test.

23 Feb 2011

math-phmath.MPmath.PR+3

AG-2011.02-2130

math.ST

Statistical analysis of the Hirsch Index

Luca Pratelli, Alberto Baccini, Lucio Barabesi, Marzia Marcheselli

The Hirsch index (commonly referred to as h-index) is a bibliometric indicator which is widely recognized as effective for measuring the scientific production of a scholar since it summarizes size and impact of the research output. In a formal setting, the h-index is actually an empirical functional of the distribution of the citation counts received by the scholar. Under this approach, the asymptotic theory for the empirical h-index has been recently exploited when the citation counts follow a continuous distribution and, in particular, variance estimation has been considered for the Pareto-type and the Weibull-type distribution families. However, in bibliometric applications, citation counts display a distribution supported by the integers. Thus, we provide general properties for the empirical h-index under the small- and large-sample settings. In addition, we also introduce consistent nonparametric variance estimation, which allows for the implemention of large-sample set estimation for the theoretical h-index.

14 Feb 2011

cs.DLphysics.soc-phstat.TH

AG-2011.01-349

math.ST

Determination of Different Biological Factors on the Base of Dried Blood Spot Technology

V. K. Bozhenko, A. O. Ivanov, A. S. Mishchenko, A. A. Tuzhilin, A. M. Shishkin

It is well-known that distinct biological indices (analytes) have distinct variability. We try to use some mathematical algorithms to pick out a set of blood parameters which give an opportunity to retrieve the initial volume of the blood spotted, and use it to calculate exact concentrations of analyts interesting to a physician. For our analysis we used the database of biochemical blood parameters obtained in Russian Scientific Center of Roentgen-Radiology during 1995-2000, which includes more than 30000 of patients.

13 Jan 2011

physics.med-phstat.TH

AG-2010.12-2405

math.ST

The Geometry of Nonparametric Filament Estimation

Christopher R. Genovese, Marco Perone-Pacifico, Isabella Verdinelli, Larry Wasserman

We consider the problem of estimating filamentary structure from planar point process data. We make some connections with computational geometry and we develop nonparametric methods for estimating the filaments. We show that, under weak conditions, the filaments have a simple geometric representation as the medial axis of the data distribution's support. Our methods convert an estimator of the support's boundary into an estimator of the filaments. We also find the rates of convergence of our estimators.

12 Dec 2010

astro-ph.IMstat.TH

AG-2010.12-289

math.ST

A measure of statistical complexity based on predictive information

Samer A. Abdallah, Mark D. Plumbley

We introduce an information theoretic measure of statistical structure, called 'binding information', for sets of random variables, and compare it with several previously proposed measures including excess entropy, Bialek et al.'s predictive information, and the multi-information. We derive some of the properties of the binding information, particularly in relation to the multi-information, and show that, for finite sets of binary random variables, the processes which maximises binding information are the 'parity' processes. Finally we discuss some of the implications this has for the use of the binding information as a measure of complexity.

8 Dec 2010

cs.ITmath.ITphysics.data-an+1

AG-2010.11-1888

math.ST

Learning Networks of Stochastic Differential Equations

José Bento, Morteza Ibrahimi, Andrea Montanari

We consider linear models for stochastic dynamics. To any such model can be associated a network (namely a directed graph) describing which degrees of freedom interact under the dynamics. We tackle the problem of learning such a network from observation of the system trajectory over a time interval $T$. We analyze the $\ell_1$-regularized least squares algorithm and, in the setting in which the underlying network is sparse, we prove performance guarantees that are \emph{uniform in the sampling rate} as long as this is sufficiently high. This result substantiates the notion of a well defined `time complexity' for the network inference problem.

1 Nov 2010

cond-mat.stat-mechcs.ITcs.LG+2

AG-2010.09-2390

math.ST

Adaptive Nonparametric Regression on Spin Fiber Bundles

Claudio Durastanti, Daryl Geller, Domenico Marinucci

The construction of adaptive nonparametric procedures by means of wavelet thresholding techniques is now a classical topic in modern mathematical statistics. In this paper, we extend this framework to the analysis of nonparametric regression on sections of spin fiber bundles defined on the sphere. This can be viewed as a regression problem where the function to be estimated takes as its values algebraic curves (for instance, ellipses) rather than scalars, as usual. The problem is motivated by many important astrophysical applications, concerning for instance the analysis of the weak gravitational lensing effect, i.e. the distortion effect of gravity on the images of distant galaxies. We propose a thresholding procedure based upon the (mixed) spin needlets construction recently advocated by Geller and Marinucci (2008,2010) and Geller et al. (2008,2009), and we investigate their rates of convergence and their adaptive properties over spin Besov balls.

22 Sept 2010

astro-ph.COstat.TH

AG-2009.08-471

math.ST

One and two side generalisations of the log-Normal distribution by means of a new product definition

Silvio M. Duarte Queiros

In this manuscript we introduce a generalisation of the log-Normal distribution that is inspired by a modification of the Kaypten multiplicative process using the $q$-product of Borges [Physica A \textbf{340}, 95 (2004)]. Depending on the value of q the distribution increases the tail for small (when $q<1$) or large (when $q>1$) values of the variable upon analysis. The usual log-Normal distribution is retrieved when $q=1$. The main statistical features of this distribution are presented as well as a related random number generators and tables of quantiles of the Kolmogorov-Smirnov. Lastly, we illustrate the application of this distribution studying the adjustment of a set of variables of biological and financial origin.

29 Aug 2009

physics.data-anstat.APstat.TH

AG-2009.07-247

math.ST

Spin Needlets Spectral Estimation

Daryl Geller, Xiaohong Lan, Domenico Marinucci

We consider the statistical analysis of random sections of a spin fibre bundle over the sphere. These may be thought of as random fields that at each point p in $S^2$ take as a value a curve (e.g. an ellipse) living in the tangent plane at that point $T_{p}S^2$, rather than a number as in ordinary situations. The analysis of such fields is strongly motivated by applications, for instance polarization experiments in Cosmology. To investigate such fields, spin needlets were recently introduced by Geller and Marinucci (2008) and Geller et al. (2008). We consider the use of spin needlets for spin angular power spectrum estimation, in the presence of noise and missing observations, and we provide Central Limit Theorem results, in the high frequency sense; we discuss also tests for bias and asymmetries with an asymptotic justification.

20 Jul 2009

astro-ph.COmath.PRstat.TH

AG-2009.06-758

math.ST

The ensemble of random Markov matrices

Martin Horvat

The ensemble of random Markov matrices is introduced as a set of Markov or stochastic matrices with the maximal Shannon entropy. The statistical properties of the stationary distribution pi, the average entropy growth rate $h$ and the second largest eigenvalue nu across the ensemble are studied. It is shown and heuristically proven that the entropy growth-rate and second largest eigenvalue of Markov matrices scale in average with dimension of matrices d as h ~ log(O(d)) and nu ~ d^(-1/2), respectively, yielding the asymptotic relation h tau_c ~ 1/2 between entropy h and correlation decay time tau_c = -1/log|nu| . Additionally, the correlation between h and and tau_c is analysed and is decreasing with increasing dimension d.

18 Jun 2009

nlin.CDstat.TH

AG-2009.06-1194

math.ST

Observed Universality of Phase Transitions in High-Dimensional Geometry, with Implications for Modern Data Analysis and Signal Processing

David L. Donoho, Jared Tanner

We review connections between phase transitions in high-dimensional combinatorial geometry and phase transitions occurring in modern high-dimensional data analysis and signal processing. In data analysis, such transitions arise as abrupt breakdown of linear model selection, robust data fitting or compressed sensing reconstructions, when the complexity of the model or the number of outliers increases beyond a threshold. In combinatorial geometry these transitions appear as abrupt changes in the properties of face counts of convex polytopes when the dimensions are varied. The thresholds in these very different problems appear in the same critical locations after appropriate calibration of variables. These thresholds are important in each subject area: for linear modelling, they place hard limits on the degree to which the now-ubiquitous high-throughput data analysis can be successful; for robustness, they place hard limits on the degree to which standard robust fitting methods can tolerate outliers before breaking down; for compressed sensing, they define the sharp boundary of the undersampling/sparsity tradeoff in undersampling theorems. Existing derivations of phase transitions in combinatorial geometry assume the underlying matrices have independent and identically distributed (iid) Gaussian elements. In applications, however, it often seems that Gaussianity is not required. We conducted an extensive computational experiment and formal inferential analysis to test the hypothesis that these phase transitions are {\it universal} across a range of underlying matrix ensembles. The experimental results are consistent with an asymptotic large-$n$ universality across matrix ensembles; finite-sample universality can be rejected.

14 Jun 2009

cs.ITmath.ITphysics.data-an+2

AG-2009.04-168

math.ST

On The Dependence Structure of Wavelet Coefficients for Spherical Random Fields

Xiaohong Lan, Domenico Marinucci

We consider the correlation structure of the random coefficients for a wide class of wavelet systems on the sphere (Mexican needlets) which were recently introduced in the literature by Geller and Mayeli (2007). We provide necessary and sufficient conditions for these coefficients to be asymptotic uncorrelated in the real and in the frequency domain. Here, the asymptotic theory is developed in the high resolution sense. Statistical applications are also discussed, in particular with reference to the analysis of cosmological data.

24 Apr 2009

astro-phmath.PRstat.ME+1

AG-2008.07-070

math.ST

Adaptive density estimation for directional data using needlets

P. Baldi, G. Kerkyacharian, D. Marinucci, D. Picard

This paper is concerned with density estimation of directional data on the sphere. We introduce a procedure based on thresholding on a new type of spherical wavelets called {\it needlets}. We establish a minimax result and prove its optimality. We are motivated by astrophysical applications, in particular in connection with the analysis of ultra high energy cosmic rays.

31 Jul 2008

astro-phstat.TH

AG-2007.08-102

math.ST

Minimax and adaptive estimation of the Wigner function in quantum homodyne tomography with noisy data

Cristina Butucea, Madalin Guţa, Luis Artiles

We estimate the quantum state of a light beam from results of quantum homodyne measurements performed on identically prepared quantum systems. The state is represented through the Wigner function, a generalized probability density on $\mathbb{R}^2$ which may take negative values and must respect intrinsic positivity constraints imposed by quantum physics. The effect of the losses due to detection inefficiencies, which are always present in a real experiment, is the addition to the tomographic data of independent Gaussian noise. We construct a kernel estimator for the Wigner function, prove that it is minimax efficient for the pointwise risk over a class of infinitely differentiable functions, and implement it for numerical results. We construct adaptive estimators, that is, which do not depend on the smoothness parameters, and prove that in some setups they attain the minimax rates for the corresponding smoothness class.

14 Aug 2007

math.PRquant-phstat.TH

AG-2006.11-106

math.ST

Equi-energy sampler with applications in statistical inference and statistical mechanics

S. C. Kou, Qing Zhou, Wing Hung Wong

We introduce a new sampling algorithm, the equi-energy sampler, for efficient statistical sampling and estimation. Complementary to the widely used temperature-domain methods, the equi-energy sampler, utilizing the temperature--energy duality, targets the energy directly. The focus on the energy function not only facilitates efficient sampling, but also provides a powerful means for statistical estimation, for example, the calculation of the density of states and microcanonical averages in statistical mechanics. The equi-energy sampler is applied to a variety of problems, including exponential regression in statistics, motif sampling in computational biology and protein folding in biophysics.

8 Nov 2006

astro-phphysics.comp-phstat.TH

AG-2006.05-033

math.ST

The Hyperanalytic Wavelet Transform

S. C. Olhede, G. Metikas

In this paper novel classes of 2-D vector-valued spatial domain wavelets are defined, and their properties given. The wavelets are 2-D generalizations of 1-D analytic wavelets, developed from the Generalized Cauchy-Riemann equations and represented as quaternionic functions. Higher dimensionality complicates the issue of analyticity, more than one `analytic' extension of a real function is possible, and an `analytic' analysis wavelet will not necessarily construct `analytic' decomposition coefficients. The decomposition of locally unidirectional and/or separable variation is investigated in detail, and two distinct families of hyperanalytic wavelet coefficients are introduced, the monogenic and the hypercomplex wavelet coefficients. The recasting of the analysis in a different frame of reference and its effect on the constructed coefficients is investigated, important issues for sampled transform coefficients. The magnitudes of the coefficients are shown to exhibit stability with respect to shifts in phase. Hyperanalytic 2-D wavelet coefficients enable the retrieval of a phase-and-magnitude description of an image in phase space, similarly to the description of a 1-D signal with the use of 1-D analytic wavelets, especially appropriate for oscillatory signals. Existing 2-D directional wavelet decompositions are related to the newly developed framework, and new classes of mother wavelets are introduced.

23 May 2006

quant-phstat.TH