Artificial Intelligence in Medicine
Volume 41, Issue 1 , Pages 57-67 , September 2007

Extension of mixture-of-experts networks for binary classification of hierarchical data

  • Shu-Kay Ng

      Affiliations

    • Department of Mathematics, University of Queensland, Brisbane, Qld 4072, Australia
    • Corresponding Author InformationCorresponding author. Tel.: +61 7 33656139; fax: +61 7 33651477.
  • ,
  • Geoffrey J. McLachlan

      Affiliations

    • Department of Mathematics, University of Queensland, Brisbane, Qld 4072, Australia
    • Institute for Molecular Bioscience, University of Queensland, Brisbane, Qld 4072, Australia

Received 27 January 2007 ,Revised 30 May 2007 ,Accepted 2 June 2007.

References 

  1. Ng S-K, McLachlan GJ, Yau KKW , Lee AH. Modelling the distribution of ischaemic stroke-specific survival time using an EM-based mixture approach with random effects adjustment. Stat Med. 2004;23:2729–2744
  2. Bakker B, Heskes T. Task clustering and gating for Bayesian multitask learning. J Mach Learn Res. 2003;4:83–99
  3. Goldstein H. Multilevel statistical models. 2nd ed. London: Arnold; 1995;
  4. Jacobs RA, Jordan MI, Nowlan SJ, Hinton GE. Adaptive mixtures of local experts. Neural Comput. 1991;3:79–87
  5. Jordan MI, Jacobs RA. Hierarchical mixtures of experts and the EM algorithm. Neural Comput. 1994;6:181–214
  6. Jacobs RA, Peng F, Tanner MA. A Bayesian approach to model selection in hierarchical mixtures-of-experts architectures. Neural Networks. 1997;10:231–241
  7. Ng S-K, McLachlan GJ, Lee AH. An incremental EM-based learning approach for on-line prediction of hospital resource utilization. Artif Intell Med. 2006;36:257–267
  8. Rosen O, Tanner MA. Mixtures of proportional hazards regression models. Stat Med. 1999;18:1119–1131
  9. Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm (with discussion). J Roy Stat Soc B. 1977;39:1–38
  10. Jordan MI, Xu L. Convergence results for the EM approach to mixtures of experts architectures. Neural Networks. 1995;8:1409–1431
  11. Ng S-K, McLachlan GJ. Using the EM algorithm to train neural networks: misconceptions and a new algorithm for multiclass classification. IEEE Trans Neural Networks. 2004;15:738–749
  12. McGilchrist CA. Estimation in generalized mixed models. J Roy Stat Soc B. 1994;56:61–69
  13. Yau KKW, Lee AH, Ng S-K. Finite mixture regression model with random effects: application to neonatal hospital length of stay. Comput Stat Data Anal. 2003;41:359–366
  14. Meng XL, Rubin DB. Maximum likelihood estimation via the ECM algorithm: a general framework. Biometrika. 1993;80:267–278
  15. Jiang W, Tanner MA. On the identifiability of mixtures-of-experts. Neural Networks. 1999;12:1253–1258
  16. McGilchrist CA, Yau KKW. The derivation of BLUP, ML, REML estimation methods for generalized linear mixed models. Commun Stat-Theor Meth. 1995;24:2963–2980
  17. Ng S-K, Krishnan T, McLachlan GJ. The EM algorithm. In Gentle J, Hardle W, Mori Y, editors. Handbook of computational statistics, vol. 1, Chapter II.5. New York: Springer-Verlag; 2004. p. 137–68.
  18. Blake CL, Merz CJ. UCI Repository of machine learning databases. Irvine, CA: University of California, Department of Information and Computer Science; 1998. Available: http://www.ics.uci.edu/∼mlearn/MLRepository.html (accessed: 1 June, 2007).
  19. Schwarz G. Estimating the dimension of a model. Ann Stat. 1978;6:461–464
  20. Kalbfleisch JD, Prentice RL. The statistical analysis of failure time data. 2nd ed. New Jersey: Wiley; 2002;
  21. Ambroise C, McLachlan GJ. Selection bias in gene extraction on the basis of microarray gene-expression data. Proc Natl Acad Sci USA. 2002;99:6562–6566
  22. Lenk PJ, DeSarbo WS. Bayesian inference for finite mixture of generalized linear models with random effects. Psychometrika. 2000;65:93–119
  23. Vermunt JK, Magidson J. Latent class models for classification. Comput Stat Data Anal. 2003;41:531–537
  24. Fruhwirth-Schnatter S, Tuchler R, Otter T. Bayesian analysis of the heterogeneity model. J Bus Econ Stat. 2004;22:2–15
  25. Xu L, Jordan MI, Hinton GE. An alternative model for mixtures of experts. In:  Cowan JD,  Tesauro G,  Alspector J editor. Adv Neural Inf Proc Systems 7. Cambridge: MIT Press; 1995;p. 633–640
  26. McLachlan GJ, Krishnan T. The EM algorithm and extensions. New York: Wiley; 1997;

PII: S0933-3657(07)00068-1

doi: 10.1016/j.artmed.2007.06.001

Artificial Intelligence in Medicine
Volume 41, Issue 1 , Pages 57-67 , September 2007