Statistics & Probability

Article Statistics & Probability

Jackknife empirical likelihood confidence intervals for the categorical Gini correlation

Sameera Hewage, Yongli Sang

Summary: This paper introduces a new method for measuring dependence, the categorical Gini correlation rho(g), and proposes a Jackknife empirical likelihood approach for constructing confidence intervals. Simulation studies and real data applications demonstrate competitive performance of the proposed method in terms of coverage accuracy and interval length.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

A pair of novel priors for improving and extending the conditional MLE

Takemi Yanagimoto, Yoichi Miyata

Summary: A Bayesian estimator is proposed to improve the conditional maximum likelihood estimation by introducing a pair of priors. The conditional maximum likelihood estimation is explained using the posterior mode under a prior, and a promising estimator is defined using the posterior mean under a corresponding prior. The advantages of this approach include two different optimality properties of the induced estimator, the ease of various extensions, and the possible treatments for a finite sample size. The existing approaches are discussed and critiqued.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Subgroup analysis for the functional linear model

Yifan Sun, Ziyi Liu, Wu Wang

Summary: This paper extends the classical functional linear regression model to allow for heterogeneous coefficient functions among different subgroups of subjects. A penalization-based approach is proposed to simultaneously determine the number and structure of subgroups and coefficient functions within each subgroup. The paper provides an effective computational algorithm and establishes the oracle properties and estimation consistency of the model. Extensive numerical simulations demonstrate its superiority compared to competing methods, and an analysis of an air quality dataset leads to interesting findings and improved predictions.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

A multidimensional objective prior distribution from a scoring rule

Isadora Antoniano-Villalobos, Cristiano Villa, Stephen G. Walker

Summary: Constructing objective priors for multidimensional parameter spaces is challenging, and a common approach assumes independence and uses standard objective methods to obtain marginal distributions. In this paper, a novel objective prior is proposed by extending the objective method for one-dimensional case, allowing for a dependence structure in multidimensional parameter spaces.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

A comparison of likelihood-based methods for size-biased sampling

Victoria L. Leaver, Robert G. Clark, Pavel N. Krivitsky, Carole L. Birrell

Summary: This article compares three likelihood approaches to estimation under informative sampling and examines their efficiency and asymptotic variance. The study shows that sample likelihood estimation approaches the efficiency of full maximum likelihood estimation when the sample size tends to infinity and the sampling fraction tends to zero. However, when the sample size tends to infinity and the sampling fraction is not negligible, maximum likelihood estimation is more efficient due to considering the possibility of duplicate samples. Pseudo-likelihood estimation can perform poorly in certain cases. For a special case where the superpopulation is exponential and the selection is probability proportional to size, the anticipated variance of pseudo-likelihood estimation is infinite.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Time changes and stationarity issues for extended scalar autoregressive models

V. Girardin, R. Senoussi

Summary: This paper investigates different issues related to stationarity reduction in autoregressive models, including both continuous and discrete time cases. Necessary and sufficient conditions for autoregressive models to be weakly stationary are explored, with explicit formulas for the time changes. Furthermore, the issue of stationarity reduction for discrete sequences sampled from continuous time autoregressive processes is also considered.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Regression models for circular data based on nonnegative trigonometric sums

Juan Jose Fernandez-Duran, Maria Mercedes Gregorio-Dominguez

Summary: This paper presents the application of nonnegative trigonometric sums (NNTS) models in circular data analysis. Regression models for circular-dependent variables are constructed by fitting great circles on the parameter hypersphere, enabling the identification of different regions along the circle. The transformation of the original circular variable into a linear variable allows for the application of common linear regression methods in circular data analysis.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Construction of optimal supersaturated designs by the expansive replacement method

Hui Li, Liuqing Yang, Kashinath Chatterjee, Min-Qian Liu

Summary: Supersaturated design (SSD) plays a crucial role in factor screening, and E(f(NOD)) criterion is one of the most widely used criteria for evaluating multi-level and mixed-level SSDs. This paper provides methods to construct multi-level E(f(NOD)) optimal SSDs with general run sizes, which can also be extended to mixed-level SSDs. The main idea of these methods is to combine two processed generalized Hadamard matrices with the expansive replacement method. These proposed methods are easy to implement, and the non-orthogonality between any two columns of the resulting SSDs is well controlled by that of the source designs.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Maximum likelihood estimation of the log-concave component in a semi-parametric mixture with a standard normal density

Fadoua Balabdaoui, Harald Besdziek

Summary: The two-component mixture model with known background density, unknown signal density, and unknown mixing proportion has been studied in this paper. The log-concave MLE of the signal density is computed using the estimator of Patra & Sen (2016), and its consistency and convergence are shown. The performance of this method is evaluated through a simulation study.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Computer Science, Interdisciplinary Applications

Multi-block alternating direction method of multipliers for ultrahigh dimensional quantile fused regression

Xiaofei Wu, Hao Ming, Zhimin Zhang, Zhenyu Cui

Summary: This paper proposes a model that combines quantile regression and fused LASSO penalty, and introduces an iterative algorithm based on ADMM to solve high-dimensional datasets. The paper proves the global convergence and comparable convergence rates of the algorithm, and analyzes the theoretical properties of the model. Numerical experimental results support the superior performance of the model.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Simultaneous confidence region of an embedded one-dimensional curve in multi-dimensional space

Hiroya Yamazoe, Kanta Naito

Summary: This paper focuses on the simultaneous confidence region of a one-dimensional curve embedded in multi-dimensional space. An estimator of the curve is obtained through local linear regression on each variable in multi-dimensional data. A method to construct a simultaneous confidence region based on this estimator is proposed, and theoretical results for the estimator and the region are developed. The effectiveness of the region is demonstrated through simulation studies and applications to artificial and real datasets.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Efficient and robust optimal design for quantile regression based on linear programming

Cheng Peng, Drew P. Kouri, Stan Uryasev

Summary: This paper introduces a novel optimal experimental design method for quantifying the distribution tails of uncertain system responses. The method minimizes the variance or conditional value-at-risk of the upper bound of the predicted quantile, and estimates the data uncertainty using quantile regression. The optimal design problems are solved as linear programming problems, making the proposed methods efficient even for large datasets.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Statistics & Probability

Almost sure polynomial stability and stabilization of stochastic differential systems with impulsive effects

Shuning Liu, Guangying Lv

Summary: This paper investigates the stability of stochastic differential equations with impulsive effects and provides sufficient conditions for obtaining stability. The research shows that impulse can stabilize stochastic differential equations.

STATISTICS & PROBABILITY LETTERS (2024)

Article Geosciences, Multidisciplinary

Data-driven modeling of wildfire spread with stochastic cellular automata and latent spatio-temporal dynamics

Nicholas Grieshop, Christopher K. Wikle

Summary: We propose a Bayesian stochastic cellular automata modeling approach to model the spread of wildfires with uncertainty quantification. The model considers a dynamic neighborhood structure and captures additional spatial information, allowing for accurate prediction of fire states.

SPATIAL STATISTICS (2024)

Article Statistics & Probability

Testing homogeneity in high dimensional data through random projections

Tao Qiu, Qintong Zhang, Yuanyuan Fang, Wangli Xu

Summary: This article introduces a method for testing the homogeneity of two random vectors. The method involves selecting two subspaces and projecting them onto one-dimensional spaces, using the Cramer-von Mises distance to construct the test statistic. The performance is enhanced by repeating this procedure and the effectiveness is demonstrated through numerical simulations.

JOURNAL OF MULTIVARIATE ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Nonparametric augmented probability weighting with sparsity

Xin He, Xiaojun Mao, Zhonglei Wang

Summary: This paper proposes a nonparametric imputation method with sparsity to estimate the finite population mean, using an efficient kernel method and sparse learning for estimation. An augmented inverse probability weighting framework is adopted to achieve a central limit theorem for the proposed estimator under regularity conditions.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Computer Science, Interdisciplinary Applications

Conditional-mean multiplicative operator models for count time series

Christian H. Weiss, Fukang Zhu

Summary: This study introduces a multiplicative error model (CMEMs) for discrete-valued count time series, which is closely related to the integer-valued generalized autoregressive conditional heteroscedasticity (INGARCH) models. It derives the stochastic properties and estimation approaches of different types of INGARCH-CMEMs, and demonstrates their performance and application through simulations and real-world data examples.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2024)

Article Statistics & Probability

Robust inference for subgroup analysis with general transformation models

Miao Han, Yuanyuan Lin, Wenxin Liu, Zhanfeng Wang

Summary: The article proposes a method based on maximum rank correlation and concave fusion to automatically determine the number of subgroups, identify subgroup structure, and estimate subgroup-specific covariate effects. The method can be used without prior grouping information and is applicable to handling censored data.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Concentration inequality and the weak law of large numbers for the sum of partly negatively dependent 9-subgaussian random variables

Yuta Tanoue

Summary: In this study, we examine the properties of 9-subgaussian random variables, including inequalities, concentration inequalities, and the law of large numbers. We also discuss the characteristics of m-acceptable 9-subgaussian random variables.

STATISTICS & PROBABILITY LETTERS (2024)

Article Statistics & Probability

Matrix-valued isotropic covariance functions with local extrema

Alfredo Alegria, Xavier Emery

Summary: This study contributes to covariance modeling by proposing new parametric families of isotropic matrix-valued functions that exhibit non-monotonic behaviors, such as hole effects and cross-dimples. The benefit of these models is demonstrated on a bivariate dataset of airborne particulate matter concentrations.

JOURNAL OF MULTIVARIATE ANALYSIS (2024)