4.7 Article

AFGSL: Automatic Feature Generation based on Graph Structure Learning

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 238, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2021.107835

Keywords

Automatic feature generation; Categorical features; Graph Structure Learning; Reinforcement learning

Funding

  1. National Key R&D Program of China [2019YFC1711000]
  2. Collaborative Innovation Center of Novel Software Technology and Industrialization

Ask authors/readers for more resources

Feature engineering relies on domain knowledge and human intervention. To automate this process, this paper proposes a novel Automatic Feature Generation model based on Graph Structure Learning (AFGSL). AFGSL utilizes the adjacency matrix to model feature relationships and employs Q-learning to train the stacking interaction layers, achieving better performance.
Feature engineering relies on domain knowledge and human intervention. To automate the process of feature engineering, automated feature construction methods use deep neural networks to capture feature interactions and attention coefficients to quantify the relationship between features. However, these methods ignore the influence of insignificant features that introduce noise and degrade the performance of the model. In this paper, we study the problem of feature interactions from the perspective of graph and propose a novel Automatic Feature Generation model based on Graph Structure Learning, called AFGSL. In this model, the adjacency matrix reflects the relationships between features, so that it can be used to filter out insignificant features. Furthermore, Q-learning is used to train the policy of stacking interaction layers, which enables it to make full use of both local and global information in the process of feature generation. The results of experiments on four real-world datasets show that AFGSL outperforms the state-of-the-art methods. (c) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available