Approximation to the mean curve in the LCS problem

The problem of sequence comparison via optimal alignments occurs naturally in many areas of applications. The simplest such technique is based on evaluating a score given by the length of a longest common subsequence divided by the average length of the original sequences. In this paper we investigate the expected value of this score when the input sequences are random and their length tends to infinity. The corresponding limit exists but is not known precisely. We derive a theoretical large deviation, convex analysis and Monte Carlo based method to compute a consistent sequence of upper bounds on the unknown limit. An empirical practical version of our method produces promising numerical results.

MoreLess

Year of publication:	2008
Authors:	Durringer, Clement ; Hauser, Raphael ; Matzinger, Heinrich
Published in:	Stochastic Processes and their Applications. - Elsevier, ISSN 0304-4149. - Vol. 118.2008, 4, p. 629-648
Publisher:	Elsevier
Keywords:	Longest common subsequence problem Chvatal-Sankoff constant Steele conjecture Mean curve Large deviation theory Monte Carlo simulation Convex analysis

Online Resource

Check full text access |

More access options

Check Google Scholar

In libraries world-wide (WorldCat)

In German libraries (KVK)

subito order

I need help

More details

Type of publication:	Article
Source:	RePEc - Research Papers in Economics

Persistent link: https://www.econbiz.de/10008872913