期刊
MATHEMATICS
卷 10, 期 1, 页码 -出版社
MDPI
DOI: 10.3390/math10010029
关键词
electronic nose (EN); data transformation; data reduction; manifold learning; mean-centered unitary group-scaling (MCUGS); uniform manifold approximation and projection (UMAP); extreme learning machine (ELM); odor recognition
类别
资金
- Fondo de Ciencia, Tecnologia e Innovacion (FCTeI) del Sistema General de Regalias (SGR) from Colombia
- Administrative Department of Science, Technology, and Innovation-Colciencias [779]
- Spanish Agencia Estatal de Investigacion (AEI)-Ministerio de Economia, Industria y Competitividad (MINECO)
- Fondo Europeo de Desarrollo Regional (FEDER) [DPI2017-82930-C2-1-R, PGC2018-097257-B-C33]
- Generalitat de Catalunya [2017-SGR-388, 2017-SGR-1278]
The study introduced a machine learning methodology to improve signal processing and develop classification methodologies for EN applications. The approach enhanced sensor classification accuracy through data preprocessing and machine learning algorithms.
The classification and use of robust methodologies in sensor array applications of electronic noses (ENs) remain an open problem. Among the several steps used in the developed methodologies, data preprocessing improves the classification accuracy of this type of sensor. Data preprocessing methods, such as data transformation and data reduction, enable the treatment of data with anomalies, such as outliers and features, that do not provide quality information; in addition, they reduce the dimensionality of the data, thereby facilitating the tasks of a machine learning classifier. To help solve this problem, in this study, a machine learning methodology is introduced to improve signal processing and develop methodologies for classification when an EN is used. The proposed methodology involves a normalization stage to scale the data from the sensors, using both the well-known min-max approach and the more recent mean-centered unitary group scaling (MCUGS). Next, a manifold learning algorithm for data reduction is applied using uniform manifold approximation and projection (UMAP). The dimensionality of the data at the input of the classification machine is reduced, and an extreme learning machine (ELM) is used as a machine learning classifier algorithm. To validate the EN classification methodology, three datasets of ENs were used. The first dataset was composed of 3600 measurements of 6 volatile organic compounds performed by employing 16 metal-oxide gas sensors. The second dataset was composed of 235 measurements of 3 different qualities of wine, namely, high, average, and low, as evaluated by using an EN sensor array composed of 6 different sensors. The third dataset was composed of 309 measurements of 3 different gases obtained by using an EN sensor array of 2 sensors. A 5-fold cross-validation approach was used to evaluate the proposed methodology. A test set consisting of 25% of the data was used to validate the methodology with unseen data. The results showed a fully correct average classification accuracy of 1 when the MCUGS, UMAP, and ELM methods were used. Finally, the effect of changing the number of target dimensions on the reduction of the number of data was determined based on the highest average classification accuracy.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据