4.7 Article

Sparse Markov Models for High-dimensional Inference

期刊

出版社

MICROTOME PUBL

关键词

Markov Chains; High-dimensional inference; Mixture Transition Distribution

向作者/读者索取更多资源

Finite-order Markov models are rarely applied in empirical work when the order is large relative to the sample size due to the exponential growth in the number of parameters and required sample size, as well as the difficulty in interpretation. This paper proposes a subclass of Markov models called Mixture of Transition Distribution models, which can effectively recover the lags and estimate the transition probabilities of high-dimensional MTD models when the set of relevant lags is sparse. The estimated model also allows straightforward interpretation. The key innovation is a recursive procedure for a priori selection of the relevant lags.
Finite-order Markov models are well-studied models for dependent finite alphabet data. Despite their generality, application in empirical work is rare when the order d is large relative to the sample size n (e.g., d = O(n)). Practitioners rarely use higher-order Markov models because (1) the number of parameters grows exponentially with the order, (2) the sample size n required to estimate each parameter grows exponentially with the order, and (3) the interpretation is often difficult. Here, we consider a subclass of Markov models called Mixture of Transition Distribution (MTD) models, proving that when the set of relevant lags is sparse (i.e., O(log(n))), we can consistently and efficiently recover the lags and estimate the transition probabilities of high-dimensional (d = O(n)) MTD models. Moreover, the estimated model allows straightforward interpretation. The key innovation is a recursive procedure for a priori selection of the relevant lags of the model. We prove a new structural result for the MTD and an improved martingale concentration inequality to prove our results. Using simulations, we show that our method performs well compared to other relevant methods. We also illustrate the usefulness of our method on weather data where the proposed method correctly recovers the long-range dependence.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据