Artificial Intelligence in Medicine
Volume 41, Issue 2 , Pages 87-104, October 2007

Integrative mining of traditional Chinese medicine literature and MEDLINE for functional gene networks

  • Xuezhong Zhou

      Affiliations

    • China Academy of Chinese Medical Sciences, Beijing 100700, China
    • Guanganmen Hospital, China Academy of Chinese Medical Sciences, Beijing 100053, China
    • Corresponding Author InformationCorresponding author at: China Academy of Chinese Medical Sciences, Beijing 100700, China. Tel.: +86 10 88001446; fax: +86 10 63131371.
  • ,
  • Baoyan Liu

      Affiliations

    • China Academy of Chinese Medical Sciences, Beijing 100700, China
  • ,
  • Zhaohui Wu

      Affiliations

    • College of Computer Science, Zhejiang University, Hangzhou 310027, China
  • ,
  • Yi Feng

      Affiliations

    • College of Computer Science, Zhejiang University, Hangzhou 310027, China

Received 1 December 2006; received in revised form 24 July 2007; accepted 24 July 2007.

Summary 

Objective

The amount of biomedical data in different disciplines is growing at an exponential rate. Integrating these significant knowledge sources to generate novel hypotheses for systems biology research is difficult. Traditional Chinese medicine (TCM) is a completely different discipline, and is a complementary knowledge system to modern biomedical science. This paper uses a significant TCM bibliographic literature database in China, together with MEDLINE, to help discover novel gene functional knowledge.

Materials and methods

We present an integrative mining approach to uncover the functional gene relationships from MEDLINE and TCM bibliographic literature. This paper introduces TCM literature (about 50,000 records) as one knowledge source for constructing literature-based gene networks. We use the TCM diagnosis, TCM syndrome, to automatically congregate the related genes. The syndrome–gene relationships are discovered based on the syndrome–disease relationships extracted from TCM literature and the disease–gene relationships in MEDLINE. Based on the bubble-bootstrapping and relation weight computing methods, we have developed a prototype system called MeDisco/3S, which has name entity and relation extraction, and online analytical processing (OLAP) capabilities, to perform the integrative mining process.

Results

We have got about 200,000 syndrome–gene relations, which could help generate syndrome-based gene networks, and help analyze the functional knowledge of genes from syndrome perspective. We take the gene network of Kidney–Yang Deficiency syndrome (KYD syndrome) and the functional analysis of some genes, such as CRH (corticotropin releasing hormone), PTH (parathyroid hormone), PRL (prolactin), BRCA1 (breast cancer 1, early onset) and BRCA2 (breast cancer 2, early onset), to demonstrate the preliminary results. The underlying hypothesis is that the related genes of the same syndrome will have some biological functional relationships, and will constitute a functional network.

Conclusion

This paper presents an approach to integrate TCM literature and modern biomedical data to discover novel gene networks and functional knowledge of genes. The preliminary results show that the novel gene functional knowledge and gene networks, which are worthy of further investigation, could be generated by integrating the two complementary biomedical data sources. It will be a promising research field through integrative mining of TCM and modern life science literature.

Keywords: Integrative data mining, Functional gene network, Traditional Chinese medicine literature, MEDLINE, Text mining

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0933-3657(07)00094-2

doi:10.1016/j.artmed.2007.07.007

Artificial Intelligence in Medicine
Volume 41, Issue 2 , Pages 87-104, October 2007