Gaining Confidence on Molecular Classification through Consensus Modeling and Validation
Authors:
Weida Tong Dr. a;
Hong Fang b;
Qian Xie b;
Huixiao Hong b;
Leming Shi a;
Roger Perkins b;
Uwe Scherf c;
Federico Goodsaid d;
Felix Frueh d
| Affiliations: | a Center for Toxicoinformatics, National Center for Toxicological Research (NCTR), U.S. Food and Drug Administration, Jefferson, AR, USA |
| b Division of Bioinformatics, Z-Tech Inc., Jefferson, AR, USA | |
| c Center for Device and Radiological Health, U.S. Food and Drug Administration, Rockville, Maryland, USA | |
| d Center for Drug Evaluation and Research, U.S. Food and Drug Administration, Rockville, Maryland, USA |
DOI:
10.1080/15376520600558259
Publication Frequency:
9 issues per year
Published in:
Toxicology Mechanisms and Methods,
Volume
16,
Issue
2 &
3
December
2006
, pages 59
- 68
Subjects:
Pharmaceutical Science;
Toxicology;
Formats available:
HTML
(English)
:
PDF
(English)
Previously published as:
Toxic Substances Journal
(0199-3178)
until 1995
Previously published as:
Toxicology Methods
(1051-7235,
1091-7667)
until 2002
Also incorporating: Toxic Substance Mechanisms(on-Line)
View Article:
View Article (PDF)
View Article (HTML)
Abstract
Current advances in genomics, proteomics, and metabonomics would result in a constellation of benefits in human health. Classification applying supervised learning methods to omics data as one of the molecular classification approaches has enjoyed its growing role in clinical application. However, the utility of a molecular classifier will not be fully appreciated unless its quality is carefully validated. A clinical omics data is usually noisy with the number of independent variables far more than the number of subjects and, possibly, with a skewed subject distribution. Given that, the consensus approach holds an advantage over a single classifier. Thus, the focus of this review is mainly placed on how validating a molecular classifier using Decision Forest (DF), a robust consensus approach. We recommended that a molecular classifier has to be assessed with respect to overall prediction accuracy, prediction confidence and chance correlation, which can be readily achieved in DF. The commonalities and differences between external validation and cross-validation are also discussed for perspective use of these methods to validate a DF classifier. In addition, the advantages of using consensus approaches for identification of potential biomarkers are also rationalized. Although specific DF examples are used in this review, the provided rationales and recommendations should be equally applicable to other consensus methods.
|
| Keywords: Molecular Classification; Consensus Modeling; Validation; Decision Forest (DF); Cross-Validation |
| view references (54) |


Download Citation
CiteULike
Del.icio.us
BibSonomy
Connotea