Volume 41, Issue 3 , Pages 177-196, November 2007
Evaluation of rule interestingness measures in medical knowledge discovery in databases
Summary
Objective
We discuss the usefulness of rule interestingness measures for medical KDD through experiments using clinical datasets, and, based on the outcomes of these experiments, also consider how to utilize these measures in postprocessing.
Methods and materials
We first conducted an experiment to compare the evaluation results derived from a total of 40 various interestingness measures with those supplied by a medical expert for rules discovered in a clinical dataset on meningitis. We calculated and compared the performance of each interestingness measure to estimate a medical expert’s interest using f-measure and correlation coefficient. We then conducted a similar experiment for hepatitis.
Results and conclusion
The comprehensive results of experiments on meningitis and hepatitis indicate that the interestingness measures, accuracy, chi-square measure for one quadrant, relative risk, uncovered negative, and peculiarity, have a stable, reasonable performance in estimating real human interest in the medical domain. The results also indicate that the performance of interestingness measures is influenced by the certainty of a hypothesis made by the medical expert, and that the combinational use of interestingness measures will contribute to support medical experts to generate and confirm their hypotheses through human–system interaction.
Keywords: Data mining, Knowledge discovery in databases, Interestingness, Postprocessing, Clinical data
To access this article, please choose from the options below
PII: S0933-3657(07)00092-9
doi:10.1016/j.artmed.2007.07.005
© 2007 Elsevier B.V. All rights reserved.
Volume 41, Issue 3 , Pages 177-196, November 2007
