4.6 Article

Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm

Journal

PLOS ONE
Volume 16, Issue 4, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0249604

Keywords

-

Funding

  1. Centre d'Excellence Africain en Sciences Mathematiques et Applications (CEA-SMA)
  2. African German Network of Excellence in Science (AGNES), through the AGNES mobility grant for young scientists from sub Saharan Africa

Ask authors/readers for more resources

The study introduced a GLMM method based on skew generalized t distributions that outperforms traditional approaches in estimating population parameters and predicting random effects; Utilizing the Expectation-Maximization algorithm for model fitting and parameter expansion; In an application with respiratory infection data, the superiority of the method was demonstrated as the most adequate model.
Binary Generalized Linear Mixed Model (GLMM) is the most common method used by researchers to analyze clustered binary data in biological and social sciences. The traditional approach to GLMMs causes substantial bias in estimates due to steady shape of logistic and normal distribution assumptions thereby resulting into wrong and misleading decisions. This study brings forward an approach governed by skew generalized t distributions that belong to a class of potentially skewed and heavy tailed distributions. Interestingly, both the traditional logistic and probit mixed models, as well as other available methods can be utilized within the skew generalized t-link model (SGTLM) frame. We have taken advantage of the Expectation-Maximization algorithm accelerated via parameter-expansion for model fitting. We evaluated the performance of this approach to GLMMs through a simulation experiment by varying sample size and data distribution. Our findings indicated that the proposed methodology outperforms competing approaches in estimating population parameters and predicting random effects, when the traditional link and normality assumptions are violated. In addition, empirical standard errors and information criteria proved useful for detecting spurious skewness and avoiding complex models for probit data. An application with respiratory infection data points out to the superiority of the SGTLM which turns to be the most adequate model. In future, studies should focus on integrating the demonstrated flexibility in other generalized linear mixed models to enhance robust modeling.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available