Artificial Intelligence in Medicine
Volume 41, Issue 3 , Pages 177-196, November 2007

Evaluation of rule interestingness measures in medical knowledge discovery in databases

  • Miho Ohsaki

      Affiliations

    • Faculty of Engineering, Doshisha University, 1-3 Tataramiyakodani, Kyotanabe-shi, Kyoto 610-0321, Japan
    • Corresponding Author InformationCorresponding author. Tel.: +81 774 65 6468; fax: +81 774 65 6468.
  • ,
  • Hidenao Abe

      Affiliations

    • Department of Medical Informatics, Shimane University, 89-1 Enya-cho, Izumo-shi, Shimane 693-8501, Japan
  • ,
  • Shusaku Tsumoto

      Affiliations

    • Department of Medical Informatics, Shimane University, 89-1 Enya-cho, Izumo-shi, Shimane 693-8501, Japan
  • ,
  • Hideto Yokoi

      Affiliations

    • Department of Medical Informatics, Kagawa University Hospital, 1750-1 Ikenobe, Miki-cho, Kita-gun, Kagawa 761-0793, Japan
  • ,
  • Takahira Yamaguchi

      Affiliations

    • Faculty of Science and Technology, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama-shi, Kangawa 223-8522, Japan

Received 18 October 2006; received in revised form 24 July 2007; accepted 24 July 2007.

Summary 

Objective

We discuss the usefulness of rule interestingness measures for medical KDD through experiments using clinical datasets, and, based on the outcomes of these experiments, also consider how to utilize these measures in postprocessing.

Methods and materials

We first conducted an experiment to compare the evaluation results derived from a total of 40 various interestingness measures with those supplied by a medical expert for rules discovered in a clinical dataset on meningitis. We calculated and compared the performance of each interestingness measure to estimate a medical expert’s interest using f-measure and correlation coefficient. We then conducted a similar experiment for hepatitis.

Results and conclusion

The comprehensive results of experiments on meningitis and hepatitis indicate that the interestingness measures, accuracy, chi-square measure for one quadrant, relative risk, uncovered negative, and peculiarity, have a stable, reasonable performance in estimating real human interest in the medical domain. The results also indicate that the performance of interestingness measures is influenced by the certainty of a hypothesis made by the medical expert, and that the combinational use of interestingness measures will contribute to support medical experts to generate and confirm their hypotheses through human–system interaction.

PACS: 07.05.Kf, 07.05.Mh

Keywords: Data mining, Knowledge discovery in databases, Interestingness, Postprocessing, Clinical data

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0933-3657(07)00092-9

doi:10.1016/j.artmed.2007.07.005

Artificial Intelligence in Medicine
Volume 41, Issue 3 , Pages 177-196, November 2007