Table 15

Impact of the PV on the size of the index. N is the number of full-length mRNA sequences. R is the number of genomic index keys as a percentage of the number of distinct words on the unmasked genome.



Human 36.3
Mouse 37.1

mRNA
N
214 749
240 299

R (%)
8.4
10.2
EST
N
7 732 838
4 836 245

R (%)
38.6
27.2

Kapustin et al. Biology Direct 2008 3:20   doi:10.1186/1745-6150-3-20