4.7 Article

Reducing Uncertainty and Increasing Confidence in Unsupervised Learning

Related references

Note: Only part of the references are listed.
Article Engineering, Mechanical

Dictionary learning-based damage detection under varying environmental conditions using only vibration responses of numerical model and real intact State: Verification on an experimental offshore jacket model

Zohreh Mousavi et al.

Summary: This study proposes a novel vibration-based method for damage detection of real systems using Dictionary Learning (DL) based on a FE model and real intact state under different uncertainties. The method is validated using real measurements from an experimental offshore jacket model, and the results show higher accuracy in the case of changing working load.

MECHANICAL SYSTEMS AND SIGNAL PROCESSING (2023)

Article Computer Science, Artificial Intelligence

Unsupervised domain adaptation based on the predictive uncertainty of models

JoonHo Lee et al.

Summary: Unsupervised domain adaptation (UDA) aims to improve prediction performance in the target domain by minimizing the divergence between the source and target domains. This paper proposes a novel UDA method, called Model Uncertainty-based UDA (MUDA), which learns domain-invariant features to minimize domain divergence. MUDA utilizes a Bayesian framework and Monte Carlo dropout sampling to evaluate model uncertainty. Experimental results on image recognition tasks demonstrate the superiority of MUDA over existing state-of-the-art methods. MUDA is also extended to multi-source domain adaptation problems.

NEUROCOMPUTING (2023)

Article Mathematics

Machine-Learning Methods on Noisy and Sparse Data

Konstantinos Poulinakis et al.

Summary: This study compares machine-learning methods and cubic splines on their ability to handle sparse and noisy training data. The results show that cubic splines provide more precise interpolation than deep neural networks and multivariate adaptive regression splines with very sparse data. However, machine-learning models show robustness to noise and can outperform splines after reaching a threshold of training data. The study aims to provide a general framework for interpolating one-dimensional signals, often obtained from complex scientific simulations or laboratory experiments.

MATHEMATICS (2023)

Article Computer Science, Artificial Intelligence

Feature Alignment by Uncertainty and Self-Training for Source-Free Domain

JoonHo Lee et al.

Summary: In this paper, a novel source-free unsupervised domain adaptation (UDA) method is proposed, which only uses a pre-trained source model and unlabeled target images for training. The method captures aleatoric uncertainty through data augmentation and trains the feature generator with two consistency objectives to improve the robustness of the adapted model to image perturbations. Experimental results on popular UDA benchmark datasets demonstrate that the proposed method is comparable or even superior to vanilla UDA methods.

NEURAL NETWORKS (2023)

Article Chemistry, Multidisciplinary

Evaluating Human versus Machine Learning Performance in a LegalTech Problem

Tamas Orosz et al.

Summary: In recent years, many machine learning-based document processing applications have been developed, which can reduce costs and reshape company structures. These applications can replace trainees, allowing experts to focus on higher-value tasks and foster innovation. However, the development cost of these methods is often high and not straightforward. This paper presents a survey that compares a machine learning-based legal text labeler with individuals possessing legal domain knowledge. The results show the effectiveness and accuracy of the machine learning system and highlight the potential for increased discoverability and value enrichment.

APPLIED SCIENCES-BASEL (2022)

Article Engineering, Electrical & Electronic

A quantitative discriminant method of elbow point for the optimal number of clusters in clustering algorithm

Congming Shi et al.

Summary: Clustering, a traditional machine learning method, often relies on a predetermined exact number of clusters which may not be practical in real-world scenarios where the number of clusters is unpredictable. A new elbow point discriminant method is proposed to estimate the optimal cluster number using statistical metrics. Experimental results demonstrate that this method outperforms the widely used Silhouette method in determining the optimal cluster number.

EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING (2021)

Article Chemistry, Analytical

A Convolutional Autoencoder Topology for Classification in High-Dimensional Noisy Image Datasets

Emmanuel Pintelas et al.

Summary: The study introduces a novel approach using a convolutional autoencoder topological model to address the issue of noise and redundant information affecting deep learning models, leading to a significant performance improvement by compressing and filtering initial high-dimensional input images.

SENSORS (2021)

Article Biochemical Research Methods

HGC: fast hierarchical clustering for large-scale single-cell data

Ziheng Zou et al.

Summary: HGC is a fast Hierarchical Graph-based Clustering tool that addresses the issues of fixed number of clusters and lack of hierarchical information in single-cell data clustering. Experiments demonstrate that HGC enables multiresolution exploration of biological hierarchy, achieves state-of-the-art accuracy on benchmark data, and is capable of scaling to large datasets.

BIOINFORMATICS (2021)

Article Physics, Multidisciplinary

Silhouette Analysis for Performance Evaluation in Machine Learning with Applications to Clustering

Meshal Shutaywi et al.

Summary: Grouping objects based on similarities is crucial in machine learning, with k-means and kernel k-means being popular clustering methods. This study extends previous work by introducing a weighted majority voting method based on NMI, and proposing an unsupervised weighting function based on the Silhouette index to improve clustering without the need for a training set.

ENTROPY (2021)

Article Mathematics, Interdisciplinary Applications

Clustering Large Datasets by MergingK-Means Solutions

Volodymyr Melnykov et al.

JOURNAL OF CLASSIFICATION (2020)

Article Computer Science, Artificial Intelligence

The importance of interpretability and visualization in machine learning for applications in medicine and health care

Alfredo Vellido

NEURAL COMPUTING & APPLICATIONS (2020)

Review Computer Science, Information Systems

The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

Mohiuddin Ahmed et al.

ELECTRONICS (2020)

Proceedings Paper Computer Science, Hardware & Architecture

Communication-Efficient Jaccard similarity for High-Performance Distributed Genome Comparisons

Maciej Besta et al.

2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020 (2020)

Review Mathematics, Interdisciplinary Applications

Machine-Learning Methods for Computational Science and Engineering

Michael Frank et al.

COMPUTATION (2020)

Article Multidisciplinary Sciences

Unsupervised learning by competing hidden units

Dmitry Krotov et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2019)

Article Biochemical Research Methods

Deep convolutional neural network for classification of sleep stages from single-channel EEG signals

Z. Mousavi et al.

JOURNAL OF NEUROSCIENCE METHODS (2019)

Article Computer Science, Artificial Intelligence

How much can k-means be improved by using better initialization and repeats?

Pasi Franti et al.

PATTERN RECOGNITION (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Count-Min: Optimal Estimation and Tight Error Bounds using Empirical Error Distributions

Daniel Ting

KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING (2018)

Review Computer Science, Artificial Intelligence

Algorithms for hierarchical clustering: an overview

Fionn Murtagh et al.

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY (2012)

Article Biochemical Research Methods

MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering

Eun-Youn Kim et al.

BMC BIOINFORMATICS (2009)

Article Computer Science, Software Engineering

How fast is the k-means method?

S Har-Peled et al.

ALGORITHMICA (2005)