|
Resolution: standard / high Figure 1.
Distribution of the positions of U12 and U2 introns across the length of the coding
sequences in Arabidopsis (A) and human (B) genes. For each intron, the distance from the 5' end of the coding sequence (after removal
of intron sequences) was calculated and divided by the total length of the coding
sequence. The resulting fractions were partitioned into 10 equal length bins. For
comparing the distribution of intron in the 5' and 3' halves of genes, the total numbers
of introns in bins 1–5 and in bins 6–10 were compared. The statistical significance
of the difference between these numbers was determined using the two-sided binomial
test, with the sum of these numbers treated as the total number of trials and the
number of distribution in bins 1–5 as the number of successes, with prior probability
of 0.5. (A) 139,982 U2 and 182 U12 intron positions in Arabidopsis genes were analyzed. Among the U2 introns, 69,819 and 70,163 introns were contained
in the 5' and 3'portions of genes, respectively (P = 0.3593). For U12 introns, 114 and 68 introns were contained in 5' and 3' portions
of genes, respectively (P= 8.04 × 10-4). (B) 230,339 U12 and 570 U12 intron positions in human genes were analyzed. Among the U2
introns, 119,949 and 110,390 introns were contained in the 5' and 3' region, respectively
(P= 2.2 × 10-16). Among the U12 introns, 318 and 252 introns were present in the 5' and 3' region,
respectively (P= 6.4 × 10-3).
Basu et al. Biology Direct 2008 3:19 doi:10.1186/1745-6150-3-19 |