4.6 Article

Categorical Nature of Major Factor Selection via Information Theoretic Measurements

期刊

ENTROPY
卷 23, 期 12, 页码 -

出版社

MDPI
DOI: 10.3390/e23121684

关键词

CEDA; conditional entropy; conditional mutual information; heterogeneity; information gain

向作者/读者索取更多资源

This research selects collections of major factors embedded within response-versus-covariate dynamics based on information theoretic measurements through Categorical Exploratory Data Analysis (CEDA) computing paradigm, exploring the relevance to Wiener-Granger causality. The selection task identifies a chief collection and several secondary collections, with reliability checks through algorithmic computations.
Without assuming any functional or distributional structure, we select collections of major factors embedded within response-versus-covariate (Re-Co) dynamics via selection criteria [C1: confirmable] and [C2: irrepaceable], which are based on information theoretic measurements. The two criteria are constructed based on the computing paradigm called Categorical Exploratory Data Analysis (CEDA) and linked to Wiener-Granger causality. All the information theoretical measurements, including conditional mutual information and entropy, are evaluated through the contingency table platform, which primarily rests on the categorical nature within all involved features of any data types: quantitative or qualitative. Our selection task identifies one chief collection, together with several secondary collections of major factors of various orders underlying the targeted Re-Co dynamics. Each selected collection is checked with algorithmically computed reliability against the finite sample phenomenon, and so is each member's major factor individually. The developments of our selection protocol are illustrated in detail through two experimental examples: a simple one and a complex one. We then apply this protocol on two data sets pertaining to two somewhat related but distinct pitching dynamics of two pitch types: slider and fastball. In particular, we refer to a specific Major League Baseball (MLB) pitcher and we consider data of multiple seasons.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据