4.5 Article

Cross-Study Replicability in Cluster Analysis

Related references

Note: Only part of the references are listed.
Review Computer Science, Artificial Intelligence

Validation of cluster analysis results on validation data: A systematic framework

Theresa Ullmann et al.

Summary: Cluster analysis is a popular data analytic technique for class discovery, with different methods for assessing the quality of clustering results. While there is extensive work on traditional validation techniques, more attention needs to be given to validating clustering results using a separate validation dataset. This article provides a systematic review of existing literature on this topic and outlines a formal framework for validating clustering results on validation data.

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY (2022)

Review Medicine, General & Internal

Breast Cancer Treatment A Review

Adrienne G. Waks et al.

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION (2019)

Article Multidisciplinary Sciences

Definitions, methods, and applications in interpretable machine learning

W. James Murdoch et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2019)

Article Computer Science, Artificial Intelligence

K-means properties on six clustering benchmark datasets

Pasi Franti et al.

APPLIED INTELLIGENCE (2018)

Article Mathematics, Interdisciplinary Applications

Bayesian Cluster Analysis: Point Estimation and Credible Balls (with Discussion)

Sara Wade et al.

BAYESIAN ANALYSIS (2018)

Article Statistics & Probability

Estimation Stability With Cross-Validation (ESCV)

Chinghway Lim et al.

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2016)

Article Statistics & Probability

BAYESIAN NONPARAMETRIC CROSS-STUDY VALIDATION OF PREDICTION METHODS

Lorenzo Trippa et al.

ANNALS OF APPLIED STATISTICS (2015)

Article Biochemical Research Methods

On the selection of appropriate distances for gene expression data clustering

Pablo A. Jaskowiak et al.

BMC BIOINFORMATICS (2014)

Article Statistics & Probability

Stability

Bin Yu

BERNOULLI (2013)

Article Computer Science, Interdisciplinary Applications

Selection of the number of clusters via the bootstrap method

Yixin Fang et al.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2012)

Article Oncology

A Three-Gene Model to Robustly Identify Breast Cancer Molecular Subtypes

Benjamin Haibe-Kains et al.

JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE (2012)

Article Statistics & Probability

Random partition models with regression on covariates

Peter Muellner et al.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2010)

Article Computer Science, Interdisciplinary Applications

clValid: An R package for cluster validation

Guy Brock et al.

JOURNAL OF STATISTICAL SOFTWARE (2008)

Article Statistics & Probability

Statistical Significance of Clustering for High-Dimension, Low-Sample Size Data

Yufeng Liu et al.

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2008)

Article Computer Science, Interdisciplinary Applications

Cluster-wise assessment of cluster stability

Christian Hennig

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2007)

Article Multidisciplinary Sciences

Clustering by passing messages between data points

Brendan J. Frey et al.

SCIENCE (2007)

Article Biochemical Research Methods

Model order selection for bio-molecular data clustering

Alberto Bertoni et al.

BMC BIOINFORMATICS (2007)

Proceedings Paper Computer Science, Artificial Intelligence

Stability of k-means clustering

Shai Ben-David et al.

LEARNING THEORY, PROCEEDINGS (2007)

Article Mathematical & Computational Biology

Are clusters found in one dataset present in another dataset?

Amy V. Kapp et al.

BIOSTATISTICS (2007)

Article Computer Science, Artificial Intelligence

Fast agglomerative clustering using a k-nearest neighbor graph

Pasi Franti et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2006)

Article Mathematics, Interdisciplinary Applications

On similarity indices and correction for chance agreement

Ahmed N. Albatineh et al.

JOURNAL OF CLASSIFICATION (2006)

Article Statistics & Probability

Cluster validation by prediction strength

R Tibshirani et al.

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2005)

Article Statistics & Probability

Problems in gene clustering based on gene expression data

J Bryan

JOURNAL OF MULTIVARIATE ANALYSIS (2004)

Article Computer Science, Artificial Intelligence

Stability-based validation of clustering solutions

T Lange et al.

NEURAL COMPUTATION (2004)

Article Biochemical Research Methods

Cluster stability scores for microarray data in cancer studies

M Smolkin et al.

BMC BIOINFORMATICS (2003)

Article Computer Science, Artificial Intelligence

Resampling method for unsupervised estimation of cluster validity

E Levine et al.

NEURAL COMPUTATION (2001)