☆ 4.5 Article

Estimating dataset size requirements for classifying DNA microarray data

JOURNAL OF COMPUTATIONAL BIOLOGY (2003)

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY

Volume 10, Issue 2, Pages 119-142

Publisher

MARY ANN LIEBERT, INC

DOI: 10.1089/106652703321825928

Keywords

gene expression profiling; molecular pattern recognition; DNA microarrays; microarray analysis; sample size estimation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

A statistical methodology for estimating dataset size requirements for classifying microarray data using learning curves is introduced. The goal is to use existing classification results to estimate dataset size requirements for future classification experiments and to evaluate the gain in accuracy and significance of classifiers built with additional data. The method is based on fitting inverse power-law models to construct empirical learning curves. It also includes a permutation test procedure to assess the statistical significance of classification performance for a given dataset size. This procedure is applied to several molecular classification problems representing a broad spectrum of levels of complexity.

Estimating dataset size requirements for classifying DNA microarray data

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY

Publisher

MARY ANN LIEBERT, INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Estimating dataset size requirements for classifying DNA microarray data

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY

Publisher

MARY ANN LIEBERT, INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper