4.3 Article

A regression approach to the two-dataset problem

Journal

STATISTICS
Volume 56, Issue 6, Pages 1225-1241

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/02331888.2022.2134385

Keywords

Data collection process; random coefficients model; Bayesian model averaging

Funding

  1. Kwansei Gakuin University

Ask authors/readers for more resources

This paper addresses the issue of two-dataset problem, where data are collected from potentially different populations while sharing common aspects. It proposes statistical regression models to capture the measurement differences and introduces two prediction errors for evaluating the underlying data collection process. Two real datasets are used to illustrate the proposed method.
This paper considers the two-dataset problem, where data are collected from two potentially different populations sharing common aspects. This problem arises when data are collected by two different types of researchers or from two different sources. We may reach invalid conclusions without using knowledge about the data collection process. To address this problem, this paper develops statistical regression models focusing on the difference in measurement and proposes two prediction errors that help to evaluate the underlying data collection process. As a consequence, it is possible to discuss the heterogeneity/similarity of the set of predictors in terms of prediction. Two real datasets are selected to illustrate our method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available