4.6 Article

MARGINAL SINGULARITY AND THE BENEFITS OF LABELS IN COVARIATE-SHIFT

期刊

ANNALS OF STATISTICS
卷 49, 期 6, 页码 3299-3323

出版社

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/21-AOS2084

关键词

Transfer learning; domain adaptation; covariate shift; nonparametric classification

向作者/读者索取更多资源

Transfer Learning tackles situations in which little or no labeled data is available for a target prediction problem, but much labeled data is accessible from some related but different data distribution. The study focuses on the fundamental limits of transfer, including sample sizes from different distributions and differences in data distributions, aiming to address practical questions about optimal sampling and performance limits.
Transfer Learning addresses common situations in Machine Leaning where little or no labeled data is available for a target prediction problem- corresponding to a distribution Q, but much labeled data is available from some related but different data distribution P. This work is concerned with the fundamental limits of transfer, that is, the limits in target performance in terms of (1) sample sizes from P and Q, and (2) differences in data distributions P, Q. In particular, we aim to address practical questions such as how much target data from Q is sufficient given a certain amount of related data from P, and how to optimally sample such target data for labeling. We present new minimax results for transfer in nonparametric classification (i.e., for situations where little is known about the target classifier), under the common assumption that the marginal distributions of covariates differ between P and Q (often termed covariate-shift). Our results are first to concisely capture the relative benefits of source and target labeled data in these settings through information-theoretic limits. Namely, we show that the benefits of target labels are tightly controlled by a transfer-exponent gamma that encodes how singular Q is locally with respect to P, and interestingly paints a more favorable picture of transfer than what might be believed from insights from previous work. In fact, while previous work rely largely on refinements of traditional metrics and divergences between distributions, and often only yield a coarse view of when transfer is possible or not, our analysis-in terms of gamma-reveals a continuum of new regimes ranging from easy to hard transfer. We then address the practical question of how to efficiently sample target data to label, by showing that a recently proposed semi-supervised procedure-based on k-NN classification, can be refined to adapt to unknown gamma and, therefore, requests target labels only when beneficial, while achieving nearly minimax-optimal transfer rates without knowledge of distributional parameters. Of independent interest, we obtain new minimax-optimality results for vanilla k-NN classification in regimes with nonuniform marginals.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据