Journal Home
Search for

Volume 48, Issue 2, Pages 139-152 (February 2010)


View previous. 10 of 14 View next.

Development of traditional Chinese medicine clinical data warehouse for medical knowledge discovery and decision support

Xuezhong Zhouaemail address, Shibo Chenbemail address, Baoyan LiucCorresponding Author Informationemail address, Runsun Zhangd, Yinghui Wangd, Ping Lid, Yufeng Guod, Hua Zhange, Zhuye Gaoe, Xiufeng Yand

Received 16 August 2008; received in revised form 22 July 2009; accepted 23 July 2009.

Abstract 

Objective

Traditional Chinese medicine (TCM) is a scientific discipline, which develops the related theories from the long-term clinical practices. The large-scale clinical data are the core empirical knowledge source for TCM research. This paper introduces a clinical data warehouse (CDW) system, which incorporates the structured electronic medical record (SEMR) data for medical knowledge discovery and TCM clinical decision support (CDS).

Materials and methods

We have developed the clinical reference information model (RIM) and physical data model to manage the various information entities and their relationships in TCM clinical data. An extraction-transformation-loading (ETL) tool is implemented to integrate and normalize the clinical data from different operational data sources. The CDW includes online analytical processing (OLAP) and complex network analysis (CNA) components to explore the various clinical relationships. Furthermore, the data mining and CNA methods are used to discover the valuable clinical knowledge from the data.

Results

The CDW has integrated 20,000 TCM inpatient data and 20,000 outpatient data, which contains manifestations (e.g. symptoms, physical examinations and laboratory test results), diagnoses and prescriptions as the main information components. We propose a practical solution to accomplish the large-scale clinical data integration and preprocessing tasks. Meanwhile, we have developed over 400 OLAP reports to enable the multidimensional analysis of clinical data and the case-based CDS. We have successfully conducted several interesting data mining applications. Particularly, we use various classification methods, namely support vector machine, decision tree and Bayesian network, to discover the knowledge of syndrome differentiation. Furthermore, we have applied association rule and CNA to extract the useful acupuncture point and herb combination patterns from the clinical prescriptions.

Conclusion

A CDW system consisting of TCM clinical RIM, ETL, OLAP and data mining as the core components has been developed to facilitate the tasks of TCM knowledge discovery and CDS. We have conducted several OLAP and data mining tasks to explore the empirical knowledge from the TCM clinical data. The CDW platform would be a promising infrastructure to make full use of the TCM clinical data for scientific hypothesis generation, and promote the development of TCM from individualized empirical knowledge to large-scale evidence-based medicine.

a School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China

b TCM Institute of Basic Clinic Medicine, China Academy of Chinese Medicine Sciences, Beijing 100700, China

c China Academy of Chinese Medicine Sciences, Beijing 100700, China

d Guanganmen Hospital, China Academy of Chinese Medicine Sciences, Beijing 100053, China

e Beijing University of Chinese Medicine, Beijing 100029, China

Corresponding Author InformationCorresponding author. Tel.: +86 10 64014411x2213; fax: +86 10 64007743.

PII: S0933-3657(09)00105-5

doi:10.1016/j.artmed.2009.07.012


View previous. 10 of 14 View next.