Artificial Intelligence in Medicine
Volume 48, Issue 2 , Pages 139-152, February 2010

Development of traditional Chinese medicine clinical data warehouse for medical knowledge discovery and decision support

  • Xuezhong Zhou

      Affiliations

    • School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China
  • ,
  • Shibo Chen

      Affiliations

    • TCM Institute of Basic Clinic Medicine, China Academy of Chinese Medicine Sciences, Beijing 100700, China
  • ,
  • Baoyan Liu

      Affiliations

    • China Academy of Chinese Medicine Sciences, Beijing 100700, China
    • Corresponding Author InformationCorresponding author. Tel.: +86 10 64014411x2213; fax: +86 10 64007743.
  • ,
  • Runsun Zhang

      Affiliations

    • Guanganmen Hospital, China Academy of Chinese Medicine Sciences, Beijing 100053, China
  • ,
  • Yinghui Wang

      Affiliations

    • Guanganmen Hospital, China Academy of Chinese Medicine Sciences, Beijing 100053, China
  • ,
  • Ping Li

      Affiliations

    • Guanganmen Hospital, China Academy of Chinese Medicine Sciences, Beijing 100053, China
  • ,
  • Yufeng Guo

      Affiliations

    • Guanganmen Hospital, China Academy of Chinese Medicine Sciences, Beijing 100053, China
  • ,
  • Hua Zhang

      Affiliations

    • Beijing University of Chinese Medicine, Beijing 100029, China
  • ,
  • Zhuye Gao

      Affiliations

    • Beijing University of Chinese Medicine, Beijing 100029, China
  • ,
  • Xiufeng Yan

      Affiliations

    • Guanganmen Hospital, China Academy of Chinese Medicine Sciences, Beijing 100053, China

Received 16 August 2008; received in revised form 22 July 2009; accepted 23 July 2009.

Abstract 

Objective

Traditional Chinese medicine (TCM) is a scientific discipline, which develops the related theories from the long-term clinical practices. The large-scale clinical data are the core empirical knowledge source for TCM research. This paper introduces a clinical data warehouse (CDW) system, which incorporates the structured electronic medical record (SEMR) data for medical knowledge discovery and TCM clinical decision support (CDS).

Materials and methods

We have developed the clinical reference information model (RIM) and physical data model to manage the various information entities and their relationships in TCM clinical data. An extraction-transformation-loading (ETL) tool is implemented to integrate and normalize the clinical data from different operational data sources. The CDW includes online analytical processing (OLAP) and complex network analysis (CNA) components to explore the various clinical relationships. Furthermore, the data mining and CNA methods are used to discover the valuable clinical knowledge from the data.

Results

The CDW has integrated 20,000 TCM inpatient data and 20,000 outpatient data, which contains manifestations (e.g. symptoms, physical examinations and laboratory test results), diagnoses and prescriptions as the main information components. We propose a practical solution to accomplish the large-scale clinical data integration and preprocessing tasks. Meanwhile, we have developed over 400 OLAP reports to enable the multidimensional analysis of clinical data and the case-based CDS. We have successfully conducted several interesting data mining applications. Particularly, we use various classification methods, namely support vector machine, decision tree and Bayesian network, to discover the knowledge of syndrome differentiation. Furthermore, we have applied association rule and CNA to extract the useful acupuncture point and herb combination patterns from the clinical prescriptions.

Conclusion

A CDW system consisting of TCM clinical RIM, ETL, OLAP and data mining as the core components has been developed to facilitate the tasks of TCM knowledge discovery and CDS. We have conducted several OLAP and data mining tasks to explore the empirical knowledge from the TCM clinical data. The CDW platform would be a promising infrastructure to make full use of the TCM clinical data for scientific hypothesis generation, and promote the development of TCM from individualized empirical knowledge to large-scale evidence-based medicine.

Keywords: Clinical data warehouse, Traditional Chinese medicine, Clinical data mining, Clinical decision support

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0933-3657(09)00105-5

doi:10.1016/j.artmed.2009.07.012

Artificial Intelligence in Medicine
Volume 48, Issue 2 , Pages 139-152, February 2010