Sylvain, Forêt; R, Wilson Susan; J, Burden Conrad - In: Statistical Applications in Genetics and Molecular Biology 8 (2009) 1, pp. 1-21
Word matches are often used in sequence comparison methods, either as a measure of sequence similarity or in the first search steps of algorithms such as BLAST or BLAT. The D2 statistic is the number of matches of words of k letters between two sequences. Recent advances have been made in the...