SABRES: Sequence Alignment By Rare Event Simulation

We present a new stochastic method for finding the optimal alignment of DNA sequences. The method works by generating random paths through a graph (the edit graph) according to a Markov chain. Each path is assigned a score, and these scores are used to modify the transition probabilities of the Markov chain. This procedure converges to a fixed path through the graph, corresponding to the optimal (or near optimal) sequence alignment. The rules with which to update the transition probabilities are based on a new technique for stochastic optimisation known as the Cross-Entropy Method. This leads to very simple and natural updating formulas. Due to its versatility, mathematical tractability and simplicity, the method has great potential for a large class of combinatorial optimisation problems, in particular in biological sciences. Numerical results support the effectiveness of the method.