Artificial Intelligence in Medicine
Volume 48, Issue 2 , Pages 75-82 , February 2010

A GMM-IG framework for selecting genes as expression panel biomarkers

  • Mingyi Wang

      Affiliations

    • School of Informatics, Indiana University, 535 W. Michigan Street, Indianapolis, IN 46202, USA
  • ,
  • Jake Y. Chen

      Affiliations

    • School of Informatics, Indiana University, 535 W. Michigan Street, Indianapolis, IN 46202, USA
    • Department of Computer and Information Science, Purdue University School of Science, Indianapolis, IN 46202, USA
    • Indiana Center for Systems Biology and Personalized Medicine, 719 N. Indiana Ave, WK Suite #190, Indianapolis, IN 46202, USA
    • Corresponding Author InformationCorresponding author at: Department of Computer and Information Science, Purdue University School of Science, Indianapolis, IN 46202, USA. Tel.: +1 317 278 7604; fax: +1 317 278 9201.

Received 28 August 2008 ,Revised 29 June 2009 ,Accepted 2 July 2009.

References 

  1. Beer DG, Kardia SL, Huang CC, Giordano TJ, Levin AM, Misek DE, et al. Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med. 2002;8:816–824
  2. Bhattacharjee A, Richards WG, Staunton J, Li C, Monti S, Vasa P, et al. Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. Proc Natl Acad Sci USA. 2001;98:13790–13795
  3. Garber ME, Troyanskaya OG, Schluens K, Petersen S, Thaesler Z, Pacyna-Gengelbach M, et al. Diversity of gene expression in adenocarcinoma of the lung. Proc Natl Acad Sci USA. 2001;98:13784–13789
  4. Chen HY, Yu SL, Chen CH, Chang GC, Chen CY, Yuan A, et al. A five-gene signature and clinical outcome in non-small-cell lung cancer. N Engl J Med. 2007;356:11–20
  5. Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. 1999;286:531–537
  6. van’t Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, Mao M, et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002;415:530–536
  7. Kosorok MR, Ma S. Marginal asymptotics for the “large P, small N” paradigm: With applications to microarray data. Ann Stat. 2007;35:1456–1486
  8. Ein-Dor L, Kela I, Getz G, Givol D, Domany E. Outcome signature genes in breast cancer: is there a unique set?. Bioinformatics. 2005;21:171–178
  9. Wei T, Liao B, Ackermann BL, Jolly RA, Eckstein JA, Kulkarni NH, et al. Data-driven analysis approach for biomarker discovery using molecular-profiling technologies. Biomarkers. 2005;10:153–172
  10. Fishel I, Kaufman A, Ruppin E. Meta-analysis of gene expression data: a predictor-based approach. Bioinformatics. 2007;23:1599–1606
  11. Rhodes DR, Barrette TR, Rubin MA, Ghosh D, Chinnaiyan AM. Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. Cancer Res. 2002;62:4427–4433
  12. Jiang H, Deng Y, Chen HS, Tao L, Sha Q, Chen J, et al. Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes. BMC Bioinformatics. 2004;5:81
  13. Warnat P, Eils R, Brors B. Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes. BMC Bioinformatics. 2005;6:265
  14. Ng SK, McLachlan GJ, Wang K, Ben-Tovim Jones L, Ng SW. A mixture model with random-effects components for clustering correlated gene-expression profiles. Bioinformatics. 2006;22:1745–1752
  15. McLachlan GJ, Bean RW, Jones LB. A simple implementation of a normal mixture approach to differential gene expression in multiclass microarrays. Bioinformatics. 2006;22:1608–1615
  16. Xing EP, Jordan MI, Karp RM. Feature selection for high-dimensional genomic microarray data. In:  Brodley CE,  Danyluk AP editor. Proceedings of the 18th international conference on machine learning. MA, USA: Morgan Kaufmann, Williamstown; 2001;p. 601–608
  17. Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc. 1977;39:1–38
  18. Cover TM, Thomas JA, John Wiley & Sons . Elements of information theory. New York: Wiley; 1991;
  19. Hong F, Breitling R. A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments. Bioinformatics. 2008;24:374–382
  20. Shi L, Reid LH, Jones WD, Shippy R, Warrington JA, Baker SC, et al. The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. Nat Biotechnol. 2006;24:1151–1161
  21. Irizarry RA, Warren D, Spencer F, Kim IF, Biswal S, Frank BC, et al. Multiple-laboratory comparison of microarray platforms. Nat Methods. 2005;2:345–350
  22. Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20:273–297
  23. Duda RO, Hart PE. Pattern classification and scene analysis. New York: Wiley; 1973;
  24. Breiman L, Friedman J, Stone CJ, Olshen RA. Classification and regression trees. Boca Raton, Florida: Chapman & Hall/CRC; 1998;
  25. Pan F, Kamath K, Zhang K, Pulapura S, Achar A, Nunez-Iglesias J, et al. Integrative Array Analyzer: a software package for analysis of cross-platform and cross-species microarray data. Bioinformatics. 2006;22:1665–1667
  26. Heighway J, Knapp T, Boyce L, Brennand S, Field JK, Betticher DC, et al. Expression profiling of primary non-small cell lung cancer for target identification. Oncogene. 2002;21:7749–7763
  27. Tang DG, Chen YQ, Newman PJ, Shi L, Gao X, Diglio CA, et al. Identification of PECAM-1 in solid tumor cells and its potential involvement in tumor cell adhesion to endothelium. J Biol Chem. 1993;268:22883–22894
  28. Taguchi A, Blood DC, del Toro G, Canet A, Lee DC, Qu W, et al. Blockade of RAGE-amphoterin signalling suppresses tumour growth and metastases. Nature. 2000;405:354–360

PII: S0933-3657(09)00097-9

doi: 10.1016/j.artmed.2009.07.006

Artificial Intelligence in Medicine
Volume 48, Issue 2 , Pages 75-82 , February 2010