3.8 Proceedings Paper

Online Adaptive Asymmetric Active Learning for Budgeted Imbalanced Data

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3219819.3219948

Keywords

-

Funding

  1. National Natural Science Foundation of China (NSFC) [61502177, 61602185]
  2. Recruitment Program for Young Professionals
  3. Guangdong Provincial Scientific and Technological funds [2017B090901008, 2017A010101011, 2017B090910005]
  4. Fundamental Research Funds for the Central Universities [D2172500, D2172480]
  5. Pearl River S&T Nova Program of Guangzhou [201806010-081]
  6. CCF-Tencent Open Research Fund [RAGR20170105]

Ask authors/readers for more resources

This paper investigates Online Active Learning (OAL) for imbalanced unlabeled datastream, where only a budget of labels can be queried to optimize some cost-sensitive performance measure. OAL can solve many real-world problems, such as anomaly detection in healthcare, finance and network security. In these problems, there are two key challenges: the query budget is often limited; the ratio between two classes is highly imbalanced. To address these challenges, existing work of OAL adopts either asymmetric losses or queries (an isolated asymmetric strategy) to tackle the imbalance, and uses first-order methods to optimize the cost-sensitive measure. However, they may incur two deficiencies: (1) the poor ability in handling imbalanced data due to the isolated asymmetric strategy; (2) relative slow convergence rate due to the first-order optimization. In this paper, we propose a novel Online Adaptive Asymmetric Active (OA3) learning algorithm, which is based on a new asymmetric strategy (merging both the asymmetric losses and queries strategies), and second-order optimization. We theoretically analyze its bounds, and also empirically evaluate it on four real-world online anomaly detection tasks. Promising results confirm the effectiveness and robustness of the proposed algorithm in various application domains.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available