Burden Conrad J.; Junmei, Jing; Wilson Susan R. - In: Statistical Applications in Genetics and Molecular Biology 11 (2011) 1, pp. 1-28
The D2 statistic, defined as the number of matches of words of some pre-specified length k, is a computationally fast alignment-free measure of biological sequence similarity. However there is some debate about its suitability for this purpose as the variability in D2 may be dominated by the...