Artificial Intelligence in Medicine
Volume 28, Issue 1 , Pages 27-57 , May 2003

Active subgroup mining: a case study in coronary heart disease risk group detection

  • Dragan Gamberger

      Affiliations

    • Rudjer Bošković Institute, Zagreb, Croatia
  • ,
  • Nada Lavrač

      Affiliations

    • Corresponding Author InformationCorresponding author. Tel.: +386-61-177-3272; fax: +386-61-125-1038.
    • Jožef Stefan Institute, Ljubljana, Slovenia
  • ,
  • Goran Krstačić

      Affiliations

    • Institute for Cardiovascular Prevention and Rehabilitation, Zagreb, Croatia

Received 9 May 2002 ,Revised 11 January 2003 ,Accepted 15 January 2003.

References 

  1. Breiman L, Friedman J, Stone CJ, Olshen RA. Classification and regression trees. Boca Raton: CRC Press; 1984.
  2. Casteli WP. Epidemiology of coronary heart disease: the Framingham study. Am J Med. 1984;76(2A):4–12
  3. Casteli WP. The triglycerides issue: a view from Framingham. Am Heart J. 1986;112:432–437
  4. Chapman P, et al. CRISP-DM 1.0 Step-by-step data mining guide, http://www.crisp-dm.org.
  5. Clark P, Niblett T. The CN2 induction algorithm. Mach. Learn. 1989;3(4):261–283
  6. Cohen WW. Fast effective rule induction. In: Proceedings of the 12th International Conference on Machine Learning. Los Altos (CA): Morgan Kaufmann; 1995. p. 115–23.
  7. Cohen WW, Singer Y. A simple, fast, and effective rule learner. In: Proceedings of Annual Conference of American Association for Artificial Intelligence. American Association for Artificial Intelligence; 1995. p. 335–42.
  8. Compton P, Jansen R. Knowledge in context: a strategy for expert system maintenance. In: Proceedings of 2nd Australian Joint Artificial Intelligence Conference. Berlin: Springer LNAI 406; 1988. p. 292–306.
  9. Compton P, Horn R, Quinlan R, Lazarus L. Maintaining an expert system. In: Quinlan R, editor. Applications of expert systems. Reading: Addison-Wesley; 1989. p. 366–85.
  10. B. Dahlof , Pennert K, Hansson L. Reversal of left ventricular hypertrophy in hypertensive patients: a metaanalysis of 109 treatment studies. Am. J. Hypertens. 1992;5:95–110
  11. Davis R. Knowledge acquisition in rule-based systems: knowledge representation as a basis for system construction and maintainance. In: Shine ME, Coombs MJ, editors. Designing for human–computer communication. New York: Academic Press; 1978. p. 87–137.
  12. Davis R. TEIRESIAS: experiments in communicating with a knowledge-based systems. In: Waterman DA, Hayes-Roth F, editors. Pattern directed inference systems. New York: Academic Press; 1983. p. 99–134.
  13. De Raedt L, Dehaspe L. Clausal discovery. Mach. Learn. 1997;26:99–146
  14. Džeroski S, Lavrač N, editors. Relational data mining. Berlin: Springer; 2001.
  15. Fayyad UM, Irani KB. On the handling of continuous-valued attributes in decision tree generation. Mach. Learn. 1992;8:87–102
  16. Fayyad UM, Piatetsky-Shapiro G, Smyth P. From data mining to knowledge discovery: an overview. In: Fayyad UM, Piatetski-Shapiro G, Smyth P, Uthurusamy R, editors. Advances in knowledge discovery and data mining. AAAI Press; 1996. p. 1–34.
  17. Freund Y, Shapire RE. Experiments with a new boosting algorithm. In: Proceedings of Thirteenth International Machine Learning Conference ICML’96. Los Altos (CA): Morgan Kaufmann; 1996. p. 148–56.
  18. Gamberger D, Lavrač N, Grošelj C. Experiments with noise filtering in a medical domain. In: Proceedings of International Conference on Machine Learning ICML-99. Los Altos (CA): Morgan Kaufmann; 1999. p. 143–51.
  19. Gamberger D, Lavrač N. Confirmation rule sets. In: Proceedings of 4th European Conference on Principles of Data Mining and Knowledge Discovery (PKDD2000). Berlin: Springer; 2000. p. 34–43.
  20. Gamberger D, Lavrač N, Krstačić G, Šmuc T. Inconsistency tests for patient records in a coronary heart disease database. In: Proceedings of International Symposium on Medical Data Analysis (ISMDA 2000). Berlin: Springer; 2000. p. 183–9.
  21. Gamberger D, Lavrač N. Expert-guided subgroup discovery: methodology and application. J. Artif. Intell. Res. 2002;17:501–527
  22. Goldman L, Garber AM, Grover SA, Hlatky MA. Cost-effectiveness of assessments and management of risk factors. J. Am. Coll. Cardiol. 1996;27:1020–1030
  23. Hsu D, Soderland S, Etzioni O. A redundant covering algorithm applied to text classification. In: Proceedings of the AAAI Workshop on Learning from Text Categorization. AAAI Press; 1998.
  24. Jovanoski V, Lavrač N. Classification rule learning with APRIORI-C. In: Progress in Artificial Intelligence: Proceedings of the Tenth Portuguese Conference on Artificial Intelligence. Berlin: Springer; 2001. p. 44–51.
  25. Klösgen W. Explora: a multipattern and multistrategy discovery assistant. In: Fayyad UM, Piatetski-Shapiro G, Smyth P, Uthurusamy R, editors. Advances in knowledge discovery and data mining. Cambridge: MIT Press; 1996. p. 249–71.
  26. Kononenko I. Inductive and Bayesian learning in medical diagnosis. Appl. Artif. Intell. 1993;7:317–337
  27. Lavrač N, Džeroski S. Inductive logic programming: techniques and applications. Chichester: Ellis Horwood; 1994.
  28. Lavrač N, Flach P, Kavšek B, Todorovski L. Adapting classification rule induction to subgroup discovery. In: Proceedings of the IEEE International Conference on Data Mining. IEEE Computer Society; 2002. p. 266–73.
  29. Lavrač N, Železný F, Flach P. RSD: relational subgroup discovery through first-order feature construction. In: Proceedings of the Twelfth International Conferences on Inductive Logic Programming. Berlin: Springer LNAI 2583; 2003. p. 152–69.
  30. Lee Y, Buchanan BG, Aronis JM. Knowledge-based learning in exploratory science: learning rules to predict rodent carcinogenicity. Mach. Learn. 1998;30:217–240
  31. Mannila H, Toivonen H. On an algorithm for finding all interesting sentences. In: Trappl R, editor. Proceedings of the Cybernetics and Systems’96, 1996. p. 973–8.
  32. Maron D, Ridker PM, Pearson AT. Risk factors and the prevention of coronary heart disease. In: Wayne AR, Schlant RC, Fuster V, editors. HURST’S: the heart. New York: McGraw-Hill; 1998. p. 1175–95.
  33. Michalski RS, Mozetič I, Hong J, Lavrač N. The multi-purpose incremental learning system AQ15 and its testing application on three medical domains. In: Proceedings of the Fifth National Conference on Artificial Intelligence. Los Altos (CA): Morgan Kaufmann; 1986. p. 1041–5.
  34. Motoda H, editor. Active mining: new directions of data mining. In: Frontiers in artificial intelligence and applications, vol. 79. IOS Press; 2002.
  35. Motoda H. Active mining: a spiral model for knowledge discovery. In: Proceedings of the Invited talk at IEEE International Conference on Data Mining, ICDM-2002, Maebashi, Japan, 2002.
  36. Quinlan JR. Induction of decision trees. Mach. Learn. 1986;1:81–106
  37. Schapire RE, Singer Y. Improved boosting algorithms using confidence-rated predictions. In: Proceedings of the Eleventh Conference on Computational Learning Theory. ACM Press; 1998. p. 80–91.
  38. Silberschatz A, Tuzhilin A. On subjective measures of interestingness in knowledge discovery. In: Proceedings of the First International Conference on Knowledge Discovery and Data Mining. AAAI Press; 1995. p. 275–81.
  39. Todorovski L, Flach P, Lavrač N. Predictive performance of weighted relative accuracy. In: Zighed D, Komorowski J, Zytkow J, editors. Proceedings of the Fourth European Conference on Principles of Data Mining and Knowledge Discovery. Berlin: Springer; 2000. p. 255–64.
  40. Wilson PWF. Established risk factors and coronary artery disease: the Framingham Study. Am. J. Hypertension. 1984;7:7S–12S
  41. Wrobel S, Džeroski S. The ILP description learning problem: towards a general model-level definition of data mining in ILP. In: Morik K, Herrmann J, editors. Proceedings of the Fachgruppentreffen Maschinelles Lernen. University of Dortmund; 1995.
  42. Wrobel S, Wettschereck D, Verkamo AI, Siebes A, Mannila H, Kwakkel F, Klösgen W. User interactivity in very large scale data mining. In: Proceedings of the German Workshop on Machine Learning. Technical Report. University of Chemnitz; 1996. p. 125–30.
  43. Wrobel S. An algorithm for multi-relational discovery of subgroups. In: Proceedings of the First European Symposium on Principles of Data Mining and Knowledge Discovery. Berlin: Springer; 1997. p. 78–87.
  44. Wrobel S. Inductive Logic Programming for Knowledge Discovery in Databases. In: Džeroski S, Lavrač N, editors. Relational Data Mining. Berlin: Springer; 2001. p. 74–101.

PII: S0933-3657(03)00034-4

doi: 10.1016/S0933-3657(03)00034-4

Artificial Intelligence in Medicine
Volume 28, Issue 1 , Pages 27-57 , May 2003