4.6 Article

TESTING GOODNESS-OF-FIT AND CONDITIONAL INDEPENDENCE WITH APPROXIMATE CO-SUFFICIENT SAMPLING

Journal

ANNALS OF STATISTICS
Volume 50, Issue 5, Pages 2514-2544

Publisher

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/22-AOS2187

Keywords

Goodness-of-fit test; approximate sufficiency; co-sufficiency; conditional randomization test; model-X; conditional independence testing; high-dimensional inference

Funding

  1. National Science Foundation [DMS-1654076]
  2. Office of Naval Research [N00014-20-1-2337]

Ask authors/readers for more resources

Goodness-of-fit testing is widely used in statistics and has various applications in model selection, confidence interval construction, conditional independence testing, and multiple testing. Current GoF tests for composite null hypotheses are often restricted in the choice of test statistic, except for co-sufficient sampling (CSS) testing which requires a compact sufficient statistic for the null model. In this paper, we propose approximate CSS (aCSS) testing that generalizes CSS testing to almost any parametric model with an asymptotically efficient estimator, overcoming the limitations of CSS testing.
Goodness-of-fit (GoF) testing is ubiquitous in statistics, with direct ties to model selection, confidence interval construction, conditional independence testing, and multiple testing, just to name a few applications. While testing the GoF of a simple (point) null hypothesis provides an analyst great flexibility in the choice of test statistic while still ensuring validity, most GoF tests for composite null hypotheses are far more constrained, as the test statistic must have a tractable distribution over the entire null model space. A notable exception is co-sufficient sampling (CSS): resampling the data conditional on a sufficient statistic for the null model guarantees valid GoF testing using any test statistic the analyst chooses. But CSS testing requires the null model to have a compact (in an information-theoretic sense) sufficient statistic, which only holds for a very limited class of models; even for a null model as simple as logistic regression, CSS testing is powerless. In this paper, we leverage the concept of approximate sufficiency to generalize CSS testing to essentially any parametric model with an asymptotically efficient estimator; we call our extension approximate CSS (aCSS) testing. We quantify the finite-sample Type I error inflation of aCSS testing and show that it is vanishing under standard maximum likelihood asymptotics, for any choice of test statistic. We apply our proposed procedure both theoretically and in simulation to a number of models of interest to demonstrate its finite-sample Type I error and power.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available