MUSC Faculty Journal Articles

A Multivariate Prediction Model for Microarray Cross-Hybridization

Yian A. Chen, Medical University of South Carolina
Cheng-Chung Chou, National Taiwan University
Xinghua Lu, Medical University of South Carolina
Elizabeth H. Slate, Medical University of South Carolina
Konan Peck, Academia Sinica
Wenying Xu, Chinese Academy of Sciences
Eberhand O. Voit, Georgia Tech
Jonas S. Almeida, University of Texas MD Anderson Cancer Center

Document Type

Article

Publication Date

3-1-2006

Abstract

Background: Expression microarray analysis is one of the most popular molecular diagnostic techniques in the post-genomic era. However, this technique faces the fundamental problem of potential cross-hybridization. This is a pervasive problem for both oligonucleotide and cDNA microarrays; it is considered particularly problematic for the latter. No comprehensive multivariate predictive modeling has been performed to understand how multiple variables contribute to (cross-) hybridization. Results: We propose a systematic search strategy using multiple multivariate models [multiple linear regressions, regression trees, and artificial neural network analyses (ANNs)] to select an effective set of predictors for hybridization. We validate this approach on a set of DNA microarrays with cytochrome p450 family genes. The performance of our multiple multivariate models is compared with that of a recently proposed third-order polynomial regression method that uses percent identity as the sole predictor. All multivariate models agree that the 'most contiguous base pairs between probe and target sequences,' rather than percent identity, is the best univariate predictor. The predictive power is improved by inclusion of additional nonlinear effects, in particular target GC content, when regression trees or ANNs are used. Conclusion: A systematic multivariate approach is provided to assess the importance of multiple sequence features for hybridization and of relationships among these features. This approach can easily be applied to larger datasets. This will allow future developments of generalized hybridization models that will be able to correct for false-positive cross-hybridization signals in expression experiments.

Comments

Article written by researchers from the Department of Biostatistics, Bioinformatics, and Epidemiology, Medical University of South Carolina;Center for Genomic Medicine, National Taiwan University;Institute of Biomedical Sciences, Academia Sinica;Key Laboratory of Molecular and Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences;Department of Biomedical Engineering, Georgia Tech;and Department of Biostatistics and Applied Mathematics, University of Texas, MD Anderson Cancer Center. Published in BMC Bioinformatics, March 1, 2006, volume 7, number 101, pages 1-12 (not for citation purposes). Includes abstract, references, table, and diagrams.

Recommended Citation

Chen, Yian A.; Chou, Cheng-Chung; Lu, Xinghua; Slate, Elizabeth H.; Peck, Konan; Xu, Wenying; Voit, Eberhand O.; and Almeida, Jonas S., "A Multivariate Prediction Model for Microarray Cross-Hybridization" (2006). MUSC Faculty Journal Articles. 17.
https://medica-musc.researchcommons.org/facarticles/17

Download

COinS

MUSC Faculty Journal Articles

A Multivariate Prediction Model for Microarray Cross-Hybridization

Document Type

Publication Date

Abstract

Comments

Recommended Citation

Browse

Search

Author Corner

MUSC Faculty Journal Articles

A Multivariate Prediction Model for Microarray Cross-Hybridization

Authors

Document Type

Publication Date

Abstract

Comments

Recommended Citation

Share

Browse

Search

Author Corner