Artificial Intelligence in Medicine
Volume 43, Issue 2 , Pages 99-111, June 2008

Identification of gene transcript signatures predictive for estrogen receptor and lymph node status using a stepwise forward selection artificial neural network modelling approach

  • Lee J. Lancashire

      Affiliations

    • Clinical and Experimental Pharmacology, Paterson Institute for Cancer Research, University of Manchester, Manchester M20 4BX, United Kingdom
    • The Nottingham Trent University, School of Biomedical and Natural Sciences, Clifton Campus, Clifton Lane, Nottingham NG11 8NS, United Kingdom
    • Corresponding Author InformationCorresponding author at: Clinical and Experimental Pharmacology, Paterson Institute for Cancer Research, University of Manchester, Manchester M20 4BX, United Kingdom. Tel.: +44 161 446 3156; fax: +44 161 446 3109.
  • ,
  • Robert C. Rees

      Affiliations

    • The Nottingham Trent University, School of Biomedical and Natural Sciences, Clifton Campus, Clifton Lane, Nottingham NG11 8NS, United Kingdom
  • ,
  • Graham R. Ball

      Affiliations

    • The Nottingham Trent University, School of Biomedical and Natural Sciences, Clifton Campus, Clifton Lane, Nottingham NG11 8NS, United Kingdom
    • Corresponding Author InformationCorresponding author. Tel.: +44 115 848 3394; fax: +44 115 848 3093.

Received 9 January 2007; received in revised form 29 February 2008; accepted 10 March 2008.

Summary 

Objective

The advent of microarrays has attracted considerable interest from biologists due to the potential for high throughput analysis of hundreds of thousands of gene transcripts. Subsequent analysis of the data may identify specific features which correspond to characteristics of interest within the population, for example, analysis of gene expression profiles in cancer patients to identify molecular signatures corresponding with prognostic outcome. These high throughput technologies have resulted in an unprecedented rate of data generation, often of high complexity, highlighting the need for novel data analysis methodologies that will cope with data of this nature.

Methods

Stepwise methods using artificial neural networks (ANNs) have been developed to identify an optimal subset of predictive gene transcripts from highly dimensional microarray data. Here these methods have been applied to a gene microarray dataset to identify and validate gene signatures corresponding with estrogen receptor and lymph node status in breast cancer.

Results

Many gene transcripts were identified whose expression could differentiate patients to very high accuracies based upon firstly whether they were positive or negative for estrogen receptor, and secondly whether metastasis to the axillary lymph node had occurred. A number of these genes had been previously reported to have a role in cancer. Significantly fewer genes were used compared to other previous studies. The models using the optimal gene subsets were internally validated using an extensive random sample cross-validation procedure and externally validated using a follow up dataset from a different cohort of patients on a newer array chip containing the same and additional probe sets. Here, the models retained high accuracies, emphasising the potential power of this approach in analysing complex systems. These findings show how the proposed method allows for the rapid analysis and subsequent detailed interrogation of gene expression signatures to provide a further understanding of the underlying molecular mechanisms that could be important in determining novel prognostic markers associated with cancer.

Keywords: Artificial neural networks, Predictive modelling, Gene expression, Breast cancer

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0933-3657(08)00029-8

doi:10.1016/j.artmed.2008.03.001

Artificial Intelligence in Medicine
Volume 43, Issue 2 , Pages 99-111, June 2008