4.7 Article

Cross-Domain Learning from Multiple Sources: A Consensus Regularization Perspective

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2009.205

关键词

Classification; multiple source domains; cross-domain learning; consensus regularization

资金

  1. National Science Foundation of China [60675010, 60933004, 60975039]
  2. 863 National High-Tech Program [2007AA01Z132]
  3. National Basic Research Priorities Programme [2007CB311004]
  4. National Science and Technology Support Plan [2006BAC08B06]
  5. US National Science Foundation (NSF) [CNS 0831186]
  6. Rutgers Seed Funding for Collaborative Computing Research

向作者/读者索取更多资源

Classification across different domains studies how to adapt a learning model from one domain to another domain which shares similar data characteristics. While there are a number of existing works along this line, many of them are only focused on learning from a single source domain to a target domain. In particular, a remaining challenge is how to apply the knowledge learned from multiple source domains to a target domain. Indeed, data from multiple source domains can be semantically related, but have different data distributions. It is not clear how to exploit the distribution differences among multiple source domains to boost the learning performance in a target domain. To that end, in this paper, we propose a consensus regularization framework for learning from multiple source domains to a target domain. In this framework, a local classifier is trained by considering both local data available in one source domain and the prediction consensus with the classifiers learned from other source domains. Moreover, we provide a theoretical analysis as well as an empirical study of the proposed consensus regularization framework. The experimental results on text categorization and image classification problems show the effectiveness of this consensus regularization learning method. Finally, to deal with the situation that the multiple source domains are geographically distributed, we also develop the distributed version of the proposed algorithm, which avoids the need to upload all the data to a centralized location and helps to mitigate privacy concerns.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据