☆ 4.3 Article

Validation of Text Data Preprocessing Using a Neural Network Model

MATHEMATICAL PROBLEMS IN ENGINEERING (2020)

Journal

MATHEMATICAL PROBLEMS IN ENGINEERING

Volume 2020, Issue -, Pages -

Publisher

HINDAWI LTD

DOI: 10.1155/2020/1958149

Keywords

Funding

National Research Foundation of Korea (NRF) - Korean government (MSIP) [2019R1H1A1079885]
National Research Foundation of Korea [2019R1H1A1079885] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Many artificial intelligence studies focus on designing new neural network models or optimizing hyperparameters to improve model accuracy. To develop a reliable model, appropriate data are required, and data preprocessing is an essential part of acquiring the data. Although various studies regard data preprocessing as part of the data exploration process, those studies lack awareness about the need for separate technologies and solutions for preprocessing. Therefore, this study evaluated combinations of preprocessing types in a text-processing neural network model. Better performance was observed when two preprocessing types were used than when three or more preprocessing types were used for data purification. More specifically, using lemmatization and punctuation splitting together, lemmatization and lowering together, and lowering and punctuation splitting together showed positive effects on accuracy. This study is significant because the results allow better decisions to be made about the selection of the preprocessing types in various research fields, including neural network research.

Validation of Text Data Preprocessing Using a Neural Network Model

Journal

MATHEMATICAL PROBLEMS IN ENGINEERING

Publisher

HINDAWI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Validation of Text Data Preprocessing Using a Neural Network Model

Journal

MATHEMATICAL PROBLEMS IN ENGINEERING

Publisher

HINDAWI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper