4.1 Article

Testing the equality of a large number of populations

Journal

TEST
Volume 31, Issue 1, Pages 1-21

Publisher

SPRINGER
DOI: 10.1007/s11749-021-00769-9

Keywords

k-Sample problem; Finite dimensional data; Hilbert data

Funding

  1. Spanish Ministry of Economy, Industry and Competitiveness, the State Agency of Investigation, the European Regional Development Fund [MTM2017-89422-P]
  2. Junta de Andalucia [P18-FR-2369]
  3. SiDOR research group through the Grant Competitive Reference Group, 2016-2019 - Conselleria de Cultura, Educacion e Ordenacion Universitaria, Xunta de Galicia [ED431C 2016/040]
  4. Universidad de Jaen [EI_SEJ5_2019]

Ask authors/readers for more resources

This paper examines the problem of testing for the equality of distributions of k independent samples with finite but arbitrary dimension, providing the asymptotic distribution of two test statistics under the null hypothesis and under alternatives. Both test statistics are shown to be asymptotically free distributed under the null hypothesis. The finite sample performance of the tests based on the asymptotic null distribution is studied through simulation, with an application to a real data set included.
Given k independent samples with finite but arbitrary dimension, this paper deals with the problem of testing for the equality of their distributions that can be continuous, discrete or mixed. In contrast to the classical setting where k is assumed to be fixed and the sample size from each population increases without bound, here k is assumed to be large and the size of each sample is either bounded or small in comparison with k. The asymptotic distribution of two test statistics is stated under the null hypothesis of the equality of the k distributions as well as under alternatives, which let us to study the asymptotic power of the resulting tests. Specifically, it is shown that both test statistics are asymptotically free distributed under the null hypothesis. The finite sample performance of the tests based on the asymptotic null distribution is studied via simulation. An application of the proposal to a real data set is included. The use of the proposed procedure for infinite dimensional data, as well as other possible extensions, are discussed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available