☆ 4.2 Article

Latent class analysis variable selection

ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS (2010)

Journal

ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS

Volume 62, Issue 1, Pages 11-35

Publisher

SPRINGER HEIDELBERG

DOI: 10.1007/s10463-009-0258-9

Keywords

Bayes factor; BIC; Categorical data; Feature selection; Model-based clustering; Single nucleotide polymorphism (SNP)

Funding

NIH [8 R01 EB002137-02]
NICHD [1 R01HD O54511]
NSF [IIS0534094, ATM0724721]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We propose a method for selecting variables in latent class analysis, which is the most common model-based clustering method for discrete data. The method assesses a variable's usefulness for clustering by comparing two models, given the clustering variables already selected. In one model the variable contributes information about cluster allocation beyond that contained in the already selected variables, and in the other model it does not. A headlong search algorithm is used to explore the model space and select clustering variables. In simulated datasets we found that the method selected the correct clustering variables, and also led to improvements in classification performance and in accuracy of the choice of the number of classes. In two real datasets, our method discovered the same group structure with fewer variables. In a dataset from the International HapMap Project consisting of 639 single nucleotide polymorphisms (SNPs) from 210 members of different groups, our method discovered the same group structure with a much smaller number of SNPs.

Latent class analysis variable selection

Journal

ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS

Publisher

SPRINGER HEIDELBERG

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Latent class analysis variable selection

Journal

ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS

Publisher

SPRINGER HEIDELBERG

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper