Abstract
One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking into account problems of gene annotation. We show that, regardless of genome size, the ratio of introns to intergenic sequence is comparable across essentially all animals, with nearly all deviations dominated by increased intergenic sequence. Genomes of model organisms have ratios much closer to 1: 1, suggesting that the majority of published genomes of nonmodel organisms are under-annotated and consequently omit substantial numbers of genes, with likely negative impact on evolutionary interpretations. Finally, our results also indicate that most animals transcribe half or more of their genomes arguing against differences in genome usage between animal groups, and also suggesting that the transcribed portion is more dependent on genome size than previously thought.
Dokumententyp: | Zeitschriftenartikel |
---|---|
Fakultät: | Geowissenschaften > Department für Geo- und Umweltwissenschaften |
Fakultätsübergreifende Einrichtungen: | GeoBio-Center |
Themengebiete: | 500 Naturwissenschaften und Mathematik > 550 Geowissenschaften, Geologie
500 Naturwissenschaften und Mathematik > 500 Naturwissenschaften |
ISSN: | 1759-6653 |
Sprache: | Englisch |
Dokumenten ID: | 54766 |
Datum der Veröffentlichung auf Open Access LMU: | 14. Jun. 2018, 09:57 |
Letzte Änderungen: | 04. Nov. 2020, 13:34 |