Please use this identifier to cite or link to this item:
Publication type: Other (textual)
Title: Progressive multiple sequence alignment with the Poisson Indel Process
Authors: Maiolo, Massimo
Zhang, Xiaolei
Gil, Manuel
Anisimova, Maria
DOI: 10.21256/zhaw-4266
Extent: 8
Issue Date: 2017
Publisher / Ed. Institution: Selbstverlag
Language: English
Subject (DDC): 510: Mathematics
572: Biochemistry
Abstract: Sequence alignment lies at the heart of many evolutionary and comparative genomics studies. However, the optimal alignment of multiple sequences is NP-hard, so that exact algorithms become impractical for more than a few sequences. Thus, state of the art alignment methods employ progressive heuristics, breaking the problem into a series of pairwise alignments guided by a phylogenetic tree. Changes between homologous characters are typically modelled by a continuous-time Markov substitution model. In contrast, the dynamics of insertions and deletions (indels) are not modelled explicitly, because the computation of the marginal likelihood under such models has exponential time complexity in the number of taxa. Recently, Bouchard-Côté and Jordan [PNAS (2012) 110(4):1160-1166] have introduced a modification to a classical indel model, describing indel evolution on a phylogenetic tree as a Poisson process. The model termed PIP allows to compute the joint marginal probability of a multiple sequence alignment and a tree in linear time. Here, we present an new dynamic programming algorithm to align two multiple sequence alignments by maximum likelihood in polynomial time under PIP, and apply it a in progressive algorithm. To our knowledge, this is the first progressive alignment method using a rigorous mathematical formulation of an evolutionary indel process and with polynomial time complexity.
Fulltext version: Submitted version
License (according to publishing contract): Licence according to publishing contract
Departement: Life Sciences and Facility Management
Organisational Unit: Institute of Computational Life Sciences (ICLS)
Appears in collections:Publikationen Life Sciences und Facility Management

Files in This Item:
File Description SizeFormat 
123513.full.pdf256.45 kBAdobe PDFThumbnail

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.