4.5 Article

Adapting Feature Selection Algorithms for the Classification of Chinese Texts

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Information Systems

A joint multiobjective optimization of feature selection and classifier design for high-dimensional data classification

Lixia Bai et al.

Summary: Feature selection has been extensively studied in data mining and machine learning. Meta-heuristic algorithms are commonly used to solve feature selection problems, however, they suffer from issues such as large search space and long computation time. This article proposes a joint multiobjective optimization method, called JMO-FSCD, for feature selection and classifier design. The proposed approach uses a neural network as a classifier and introduces a non-iterative algorithm for training the classifier. Experimental results demonstrate the superior performance of JMO-FSCD compared to six state-of-the-art feature selection algorithms.

INFORMATION SCIENCES (2023)

Article Environmental Sciences

A Deep Learning Model of Spatial Distance and Named Entity Recognition (SD-NER) for Flood Mark Text Classification

Robert Szczepanek

Summary: Information on historical flood levels can be communicated verbally, in documents, or in the form of flood marks. The aim of the presented work is to create a new model for classifying Internet sources using advanced text analysis, neural networks, and spatial analysis. The proposed model achieved a high F1 score for the binary classification task, improving the results by utilizing spatial information about toponyms.
Article Humanities, Multidisciplinary

Emotion classification for short texts: an improved multi-label method

Xuan Liu et al.

Summary: The computational identification and categorization of opinions in text is crucial for providing better understanding and services to online users. However, the current multi-label automatic classification is still inadequate. This study proposes a modified MLkNN classifier that considers both in-sentence and adjacent sentence features, resulting in improved accuracy and speed in emotion classification for short texts on Twitter.

HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS (2023)

Article Social Sciences, Interdisciplinary

Developing Multi-Labelled Corpus of Twitter Short Texts: A Semi-Automatic Method

Xuan Liu et al.

Summary: Facing the growing need to extract textual features of online texts for better communication in the Digital Media Age, sentiment classification, by developing corpora with annotation of emotions, is considered the key method to catch emotions of online communication. However, the manual annotation process is labor-intensive and costly, resulting in the lack of corpora for emotional words. Therefore, there is an urgent need for improvement in the methods of automatic emotion tagging with multiple emotion labels to construct new semantic corpora.

SYSTEMS (2023)

Article Computer Science, Information Systems

Two-stage three-way enhanced technique for ensemble learning in inclusive policy text classification

Decui Liang et al.

Summary: This study proposes a two-stage three-way enhanced technique to automatically classify policy text paragraphs into predefined categories. Experimental results show that the proposed method effectively supports the design of policy recommended platforms and serves SMEs.

INFORMATION SCIENCES (2021)

Article Computer Science, Artificial Intelligence

Feature Selection for Classification using Principal Component Analysis and Information Gain

Erick Odhiambo Omuya et al.

Summary: This study investigates the application of feature selection and classification in various fields, addressing the challenges of high dimensionality in datasets and the negative impact of irrelevant and redundant attributes on classification algorithms. To improve classification performance, a hybrid filter model based on principal component analysis and information gain is proposed and applied to machine learning techniques, demonstrating enhanced accuracy, precision, and recall.

EXPERT SYSTEMS WITH APPLICATIONS (2021)

Article Computer Science, Information Systems

Approximating XGBoost with an interpretable decision tree

Omer Sagi et al.

Summary: The increasing use of machine-learning models in critical domains has highlighted the importance of interpretable machine-learning models. Decision forests, especially Gradient Boosting Decision Trees (GBDT), are considered state-of-the-art in many classification challenges. This paper introduces a novel method for transforming any decision forest into an interpretable decision tree, providing transparency without compromising predictive performance like XGBoost.

INFORMATION SCIENCES (2021)

Proceedings Paper Computer Science, Theory & Methods

Chinese Texts Classification System

Meng Zhu et al.

2019 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT) (2019)

Article Computer Science, Information Systems

Composite Feature Extraction and Selection for Text Classification

Chuan Wan et al.

IEEE ACCESS (2019)

Article Computer Science, Artificial Intelligence

Opinion mining using ensemble text hidden Markov models for text classification

Mangi Kang et al.

EXPERT SYSTEMS WITH APPLICATIONS (2018)

Article Computer Science, Artificial Intelligence

An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition

Baoguang Shi et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Article Engineering, Mechanical

A Data-Driven Text Mining and Semantic Network Analysis for Design Information Retrieval

Feng Shi et al.

JOURNAL OF MECHANICAL DESIGN (2017)

Article Computer Science, Artificial Intelligence

Ensemble of keyword extraction methods and classifiers in text classification

Aytug Onan et al.

EXPERT SYSTEMS WITH APPLICATIONS (2016)

Article Computer Science, Artificial Intelligence

Category Specific Dictionary Learning for Attribute Specific Feature Selection

Wei Wang et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2016)

Review Computer Science, Artificial Intelligence

A Review on Multi-Label Learning Algorithms

Min-Ling Zhang et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2014)

Article Computer Science, Artificial Intelligence

Comparison of term frequency and document frequency based feature selection metrics in text categorization

Nouman Azam et al.

EXPERT SYSTEMS WITH APPLICATIONS (2012)

Article Engineering, Mechanical

Mutual information algorithms

Ai-Hua Jiang et al.

MECHANICAL SYSTEMS AND SIGNAL PROCESSING (2010)