4.4 Article

EXTREME DECONVOLUTION: INFERRING COMPLETE DISTRIBUTION FUNCTIONS FROM NOISY, HETEROGENEOUS AND INCOMPLETE OBSERVATIONS

Journal

ANNALS OF APPLIED STATISTICS
Volume 5, Issue 2B, Pages 1657-1677

Publisher

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/10-AOAS439

Keywords

Bayesian inference; density estimation; Expectation-Maximization; missing data; multivariate estimation; noise

Funding

  1. NASA [NNX08AJ48G]
  2. NSF [AST-0908357]
  3. Direct For Mathematical & Physical Scien
  4. Division Of Astronomical Sciences [0908357] Funding Source: National Science Foundation
  5. NASA [100766, NNX08AJ48G] Funding Source: Federal RePORTER

Ask authors/readers for more resources

We generalize the well-known mixtures of Gaussians approach to density estimation and the accompanying Expectation-Maximization technique for finding the maximum likelihood parameters of the mixture to the case where each data point carries an individual d-dimensional uncertainty covariance and has unique missing data properties. This algorithm reconstructs the error-deconvolved or underlying distribution function common to all samples, even when the individual data points are samples from different distributions, obtained by convolving the underlying distribution with the heteroskedastic uncertainty distribution of the data point and projecting out the missing data directions. We show how this basic algorithm can be extended with conjugate priors on all of the model parameters and a split-and-merge procedure designed to avoid local maxima of the likelihood. We demonstrate the full method by applying it to the problem of inferring the three-dimensional velocity distribution of stars near the Sun from noisy two-dimensional, transverse velocity measurements from the Hipparcos satellite.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available