3.8 Proceedings Paper

Online Adaptive Asymmetric Active Learning for Budgeted Imbalanced Data

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3219819.3219948

关键词

-

资金

  1. National Natural Science Foundation of China (NSFC) [61502177, 61602185]
  2. Recruitment Program for Young Professionals
  3. Guangdong Provincial Scientific and Technological funds [2017B090901008, 2017A010101011, 2017B090910005]
  4. Fundamental Research Funds for the Central Universities [D2172500, D2172480]
  5. Pearl River S&T Nova Program of Guangzhou [201806010-081]
  6. CCF-Tencent Open Research Fund [RAGR20170105]

向作者/读者索取更多资源

This paper investigates Online Active Learning (OAL) for imbalanced unlabeled datastream, where only a budget of labels can be queried to optimize some cost-sensitive performance measure. OAL can solve many real-world problems, such as anomaly detection in healthcare, finance and network security. In these problems, there are two key challenges: the query budget is often limited; the ratio between two classes is highly imbalanced. To address these challenges, existing work of OAL adopts either asymmetric losses or queries (an isolated asymmetric strategy) to tackle the imbalance, and uses first-order methods to optimize the cost-sensitive measure. However, they may incur two deficiencies: (1) the poor ability in handling imbalanced data due to the isolated asymmetric strategy; (2) relative slow convergence rate due to the first-order optimization. In this paper, we propose a novel Online Adaptive Asymmetric Active (OA3) learning algorithm, which is based on a new asymmetric strategy (merging both the asymmetric losses and queries strategies), and second-order optimization. We theoretically analyze its bounds, and also empirically evaluate it on four real-world online anomaly detection tasks. Promising results confirm the effectiveness and robustness of the proposed algorithm in various application domains.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据