Artificial Intelligence in Medicine
Volume 45, Issue 2 , Pages 135-150, February 2009

Fully non-homogeneous hidden Markov model double net: A generative model for haplotype reconstruction and block discovery

  • Alessandro Perina

      Affiliations

    • Department of Computer Science, University of Verona, Strada le Grazie 15, 37134 Verona, Italy
    • Corresponding Author InformationCorresponding author. Tel.: +39 045 8027803; fax: +39 045 8027068.
  • ,
  • Marco Cristani

      Affiliations

    • Department of Computer Science, University of Verona, Strada le Grazie 15, 37134 Verona, Italy
  • ,
  • Luciano Xumerle

      Affiliations

    • Department of Mother and Child, Biology and Genetics, Section Biology and Genetics, University of Verona, Strada le Grazie 8, 37134 Verona, Italy
  • ,
  • Vittorio Murino

      Affiliations

    • Department of Computer Science, University of Verona, Strada le Grazie 15, 37134 Verona, Italy
  • ,
  • Pier Franco Pignatti

      Affiliations

    • Department of Mother and Child, Biology and Genetics, Section Biology and Genetics, University of Verona, Strada le Grazie 8, 37134 Verona, Italy
  • ,
  • Giovanni Malerba

      Affiliations

    • Department of Mother and Child, Biology and Genetics, Section Biology and Genetics, University of Verona, Strada le Grazie 8, 37134 Verona, Italy

Received 31 October 2007; received in revised form 21 August 2008; accepted 22 August 2008.

Summary 

Objective

In the last decade, haplotype reconstruction in unrelated individuals and haplotype block discovery have riveted the attention of computer scientists due to the involved strong computational aspects. Such tasks are usually addressed separately, but recently, statistical techniques have permitted them to be solved jointly. Following this trend we propose a generative model that permits researchers to solve the two problems jointly.

Method

The model inference is based on variational learning, which permits one to estimate quickly the model parameters while remaining robust even to local minima. The model parameters are then used to segment genotypes into blocks by thresholding a quantitative measure of boundary presence.

Results

Experiments on real data are presented, and state-of-the-art systems for haplotype reconstruction and strategies for block estimation are considered as comparison.

Conclusions

The proposed method can be used for a fast and reliable estimation of haplotype frequencies and the relative block structure. Moreover, the method can be easily used as part of a more complex system. The threshold used for block discovery can be related to the quality-of-fit reached in the model learning, resulting in an unsupervised strategy for block estimation.

Keywords: Haplotype reconstruction, Bayesian network, Variational learning, Block structure

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0933-3657(08)00127-9

doi:10.1016/j.artmed.2008.08.015

Artificial Intelligence in Medicine
Volume 45, Issue 2 , Pages 135-150, February 2009