4.7 Article

Heterogeneous Overdispersed Count Data Regressions via Double-Penalized Estimations

期刊

MATHEMATICS
卷 10, 期 10, 页码 -

出版社

MDPI
DOI: 10.3390/math10101700

关键词

negative binomial regressions; heterogeneous count data regression; estimation of dispersion parameter; oracle inequalities

资金

  1. National Natural Science Foundation of China [12101056]

向作者/读者索取更多资源

This paper investigates the variable selection and dispersion estimation for heterogeneous NBR models, which models the dispersion parameter as a function. The proposed double regression and double Li-penalty are used, and the oracle inequalities for the lasso estimators are proven. The consistency and convergence rate of the estimators are theoretical guarantees for further statistical inference.
Recently, the high-dimensional negative binomial regression (NBR) for count data has been widely used in many scientific fields. However, most studies assumed the dispersion parameter as a constant, which may not be satisfied in practice. This paper studies the variable selection and dispersion estimation for the heterogeneous NBR models, which model the dispersion parameter as a function. Specifically, we proposed a double regression and applied a double Li-penalty to both regressions. Under the restricted eigenvalue conditions, we prove the oracle inequalities for the lasso estimators of two partial regression coefficients for the first time, using concentration inequalities of empirical processes. Furthermore, derived from the oracle inequalities, the consistency and convergence rate for the estimators are the theoretical guarantees for further statistical inference. Finally, both simulations and a real data analysis demonstrate that the new methods are effective.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据