A Fast and Exact Algorithm for the Perfect Reversal Median Problem

We study the problem of finding for the gene orders of three taxa a potential ancestral gene order such that the corresponding rearrangement scenario has a minimal number of reversals where each of the reversals has to preserve the common intervals of the given input gene orders. Common intervals identify sets of genes that occur consecutively in all input gene orders. The problem of finding such an ancestral gene order is called the perfect reversal median problem (pRMP). A tree based data structure for the representation of the common intervals of all input gene orders is used for the design and realization of a fast and exact algorithm--called TCIP--for solving the pRMP. It is known that for two given gene orders the minimum number of reversals to transfer one gene order into the other can be computed in polynomial time, whereas the corresponding problem with the restriction that common intervals should not be destroyed by the reversals is already NP-hard. Nevertheless, we show empirically on biological and artificial data that TCIP for the pRMP is usually even faster than the fastest exact algorithm (Caprara's median solver) for the reversal median problem (RMP), i.e., the corresponding problem in which the common intervals are not considered.

[1]  Jijun Tang,et al.  Reconstructing phylogenies from gene-content and gene-order data , 2007, Mathematics of Evolution and Phylogeny.

[2]  Jean-Stéphane Varré,et al.  Sorting by Reversals with Common Intervals , 2004, WABI.

[3]  Jens Stoye,et al.  On the Similarity of Sets of Permutations and Its Applications to Genome Comparison , 2006, J. Comput. Biol..

[4]  Mathieu Raffinot,et al.  Computing Common Intervals of K Permutations, with Applications to Modular Decomposition of Graphs , 2005, ESA.

[5]  Rita Casadio,et al.  Algorithms in Bioinformatics, 5th International Workshop, WABI 2005, Mallorca, Spain, October 3-6, 2005, Proceedings , 2005, WABI.

[6]  Pavel A. Pevzner,et al.  Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals , 1995, JACM.

[7]  Matthias Bernt,et al.  A Parallel Algorithm for Solving the Reversal Median Problem , 2005, PPAM.

[8]  Anne Bergeron,et al.  Advances on sorting by reversals , 2007, Discret. Appl. Math..

[9]  Ingrid Fischer,et al.  Computational life sciences II , 2005 .

[10]  Bernard M. E. Moret,et al.  Finding an Optimal Inversion Median: Experimental Results , 2001, WABI.

[11]  Olivier Gascuel,et al.  Mathematics of Evolution and Phylogeny , 2005 .

[12]  Matthias Bernt,et al.  The Reversal Median Problem, Common Intervals, and Mitochondrial Gene Orders , 2006, CompLife.

[13]  Tao Liu,et al.  Inversion Medians Outperform Breakpoint Medians in Phylogeny Reconstruction from Gene-Order Data , 2002, WABI.

[14]  P. Pevzner,et al.  Genome-scale evolution: reconstructing gene orders in the ancestral species. , 2002, Genome research.

[15]  Alberto Caprara The Reversal Median Problem , 2003, INFORMS J. Comput..

[16]  Takeaki Uno,et al.  Fast Algorithms to Enumerate All Common Intervals of Two Permutations , 1997, Algorithmica.

[17]  Matthias Bernt,et al.  Using median sets for inferring phylogenetic trees , 2007, Bioinform..

[18]  Stefano Leonardi,et al.  Algorithms - ESA 2005, 13th Annual European Symposium, Palma de Mallorca, Spain, October 3-6, 2005, Proceedings , 2005, ESA.

[19]  Jens Stoye,et al.  Finding All Common Intervals of k Permutations , 2001, CPM.

[20]  C. Paul,et al.  Perfect Sorting by Reversals Is Not Always Difficult , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[21]  Jens Stoye,et al.  Reversal Distance without Hurdles and Fortresses , 2004, CPM.

[22]  Annie Chateau,et al.  Reconstructing Ancestral Gene Orders Using Conserved Intervals , 2004, WABI.

[23]  Cedric Chauve,et al.  Conservation of Combinatorial Structures in Evolution Scenarios , 2004, Comparative Genomics.

[24]  Matthias Bernt,et al.  Genome Rearrangement Based on Reversals that Preserve Conserved Intervals , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.