4.4 Article

Asymptotic properties of hierarchical clustering in high-dimensional settings

Related references

Note: Only part of the references are listed.
Article Statistics & Probability

Clustering by principal component analysis with Gaussian kernel in high-dimension, low-sample-size settings

Yugo Nakayama et al.

Summary: This paper examines clustering based on kernel principal component analysis (KPCA) for high-dimension, low-sample-size (HDLSS) data. The study provides theoretical reasons for the effectiveness of the Gaussian kernel in clustering high-dimensional data, and explores the choice of scale parameter for optimal KPCA performance with the Gaussian kernel. Finally, the clustering performance is tested using microarray data sets.

JOURNAL OF MULTIVARIATE ANALYSIS (2021)

Article Statistics & Probability

Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data

Kento Egashira et al.

Summary: The DWD is sensitive to imbalanced sample sizes in high-dimensional, low-sample-size settings. The proposed BC-DWD corrects for this bias and shows consistency in misclassification rates. Optimal weights are also proposed for the WDWD to improve its performance.

JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE (2021)

Article Statistics & Probability

Geometric consistency of principal component scores for high-dimensional mixture models and its application

Kazuyoshi Yata et al.

SCANDINAVIAN JOURNAL OF STATISTICS (2020)

Article Statistics & Probability

Support vector machine and its bias correction in high-dimension, low-sample-size settings

Yugo Nakayama et al.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2017)

Article Biology

Statistical Significance for Hierarchical Clustering

Patrick K. Kimes et al.

BIOMETRICS (2017)

Article Statistics & Probability

Statistical Significance of Clustering Using Soft Thresholding

Hanwen Huang et al.

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2015)

Article Statistics & Probability

A distance-based, misclassification rate adjusted classifier for multiclass, high-dimensional data

Makoto Aoshima et al.

ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS (2014)

Article Statistics & Probability

Asymptotics of hierarchical clustering for growing dimension

Petro Borysov et al.

JOURNAL OF MULTIVARIATE ANALYSIS (2014)

Article Statistics & Probability

CLUSTERING HIGH DIMENSION, LOW SAMPLE SIZE DATA USING THE MAXIMAL DATA PILING DISTANCE

Jeongyoun Ahn et al.

STATISTICA SINICA (2012)

Article Statistics & Probability

Two-Stage Procedures for High-Dimensional Data

Makoto Aoshima et al.

SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS (2011)

Article Statistics & Probability

Statistical Significance of Clustering for High-Dimension, Low-Sample Size Data

Yufeng Liu et al.

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2008)

Article Multidisciplinary Sciences

Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses

A Bhattacharjee et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2001)

Article Multidisciplinary Sciences

Molecular portraits of human breast tumours

CM Perou et al.

NATURE (2000)

Article Genetics & Heredity

Systematic variation in gene expression patterns in human cancer cell lines

DT Ross et al.

NATURE GENETICS (2000)