☆ 4.7 Article

Sparse Markov Models for High-dimensional Inference

JOURNAL OF MACHINE LEARNING RESEARCH (2023)

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

卷 24, 期 -, 页码 -

出版社

MICROTOME PUBL

关键词

Markov Chains; High-dimensional inference; Mixture Transition Distribution

类别

Automation & Control Systems Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Finite-order Markov models are rarely applied in empirical work when the order is large relative to the sample size due to the exponential growth in the number of parameters and required sample size, as well as the difficulty in interpretation. This paper proposes a subclass of Markov models called Mixture of Transition Distribution models, which can effectively recover the lags and estimate the transition probabilities of high-dimensional MTD models when the set of relevant lags is sparse. The estimated model also allows straightforward interpretation. The key innovation is a recursive procedure for a priori selection of the relevant lags.

Finite-order Markov models are well-studied models for dependent finite alphabet data. Despite their generality, application in empirical work is rare when the order d is large relative to the sample size n (e.g., d = O(n)). Practitioners rarely use higher-order Markov models because (1) the number of parameters grows exponentially with the order, (2) the sample size n required to estimate each parameter grows exponentially with the order, and (3) the interpretation is often difficult. Here, we consider a subclass of Markov models called Mixture of Transition Distribution (MTD) models, proving that when the set of relevant lags is sparse (i.e., O(log(n))), we can consistently and efficiently recover the lags and estimate the transition probabilities of high-dimensional (d = O(n)) MTD models. Moreover, the estimated model allows straightforward interpretation. The key innovation is a recursive procedure for a priori selection of the relevant lags of the model. We prove a new structural result for the MTD and an improved martingale concentration inequality to prove our results. Using simulations, we show that our method performs well compared to other relevant methods. We also illustrate the usefulness of our method on weather data where the proposed method correctly recovers the long-range dependence.

Sparse Markov Models for High-dimensional Inference

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

出版社

MICROTOME PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Sparse Markov Models for High-dimensional Inference

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

出版社

MICROTOME PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文