4.4 Article

Addressing overdispersion and zero-inflation for clustered count data via new multilevel heterogenous hurdle models

Journal

JOURNAL OF APPLIED STATISTICS
Volume 50, Issue 2, Pages 408-433

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/02664763.2022.2096875

Keywords

Multilevel modeling; count data; overdispersion; zero-inflation; Poisson-Lindley distribution; Poisson-Ailamujia distribution

Ask authors/readers for more resources

Unobserved heterogeneity and excessive number of zeros are important challenges in count modeling. This study aims to merge different solutions to address these challenges, using recently developed heterogeneous distributions. The performance of the new models is evaluated through simulation studies and real-world examples.
Unobserved heterogeneity causing overdispersion and the excessive number of zeros take a prominent place in the methodological development on count modeling. An insight into the mechanisms that induce heterogeneity is required for better understanding of the phenomenon of overdispersion. When the heterogeneity is sourced by the stochastic component of the model, the use of a heterogenous Poisson distribution for this part encounters as an elegant solution. Hierarchical design of the study is also responsible for the heterogeneity as the unobservable effects at various levels also contribute to the overdispersion. Zero-inflation, heterogeneity and multilevel nature in the count data present special challenges in their own respect, however the presence of all in one study adds more challenges to the modeling strategies. This study therefore is designed to merge the attractive features of the separate strand of the solutions in order to face such a comprehensive challenge. This study differs from the previous attempts by the choice of two recently developed heterogeneous distributions, namely Poisson-Lindley (PL) and Poisson-Ailamujia (PA) for the truncated part. Using generalized linear mixed modeling settings, predictive performances of the multilevel PL and PA models and their hurdle counterparts were assessed within a comprehensive simulation study in terms of bias, precision and accuracy measures. Multilevel models were applied to two separate real world examples for the assessment of practical implications of the new models proposed in this study.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available