Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Jul 1;34(Web Server issue):W435-9.
doi: 10.1093/nar/gkl200.

AUGUSTUS: ab initio prediction of alternative transcripts

Affiliations

AUGUSTUS: ab initio prediction of alternative transcripts

Mario Stanke et al. Nucleic Acids Res. .

Abstract

AUGUSTUS is a software tool for gene prediction in eukaryotes based on a Generalized Hidden Markov Model, a probabilistic model of a sequence and its gene structure. Like most existing gene finders, the first version of AUGUSTUS returned one transcript per predicted gene and ignored the phenomenon of alternative splicing. Herein, we present a WWW server for an extended version of AUGUSTUS that is able to predict multiple splice variants. To our knowledge, this is the first ab initio gene finder that can predict multiple transcripts. In addition, we offer a motif searching facility, where user-defined regular expressions can be searched against putative proteins encoded by the predicted genes. The AUGUSTUS web interface and the downloadable open-source stand-alone program are freely available from http://augustus.gobics.de.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The human gene ATP5G1 and the AUGUSTUS ab initio prediction for this region. The first transcript (g1.t1) is also the one predicted by standard AUGUSTUS using the Viterbi algorithm only. It misses the second exon of the gene. The second transcript (g1.t2) contains that exon and is correct. The height of a box (black: exon, light gray: intron) reflects the posterior probability of that exon or intron: The higher the posterior probability, the higher the box.
Figure 2
Figure 2
Region of a human gene on the forward strand for which AUGUSTUS predicted six transcripts (gene SON, chromosome 21, 33 837 000–33 872 000, ncbi build 35). The long intron of transcript g1.t3 containing position 20 000 has low posterior probability. Thus, the model is unsure whether this is actually one gene, two or three genes. In fact, for this gene there exists EST evidence both for the short transcript g1.t1 and for longer transcripts with exons mostly agreeing with those predicted above.

Similar articles

Cited by

References

    1. Allen J.E., Pertea M., Salzberg S.L. Computational gene prediction using multiple sources of evidence. Genome Res. 2004;14:142–148. - PMC - PubMed
    1. Stanke M., Waack S. Gene prediction with a hidden markov model and a new intron submodel. Bioinformatics (ECCB 2003 special issue) 2003;19:ii215–ii225. - PubMed
    1. Stanke M. Universität Göttingen; 2004. Gene Prediction with a Hidden Markov Model. PhD Thesis.
    1. Stanke M., Schöffmann O., Morgenstern B., Waack S. Gene prediction in eukaryotes with a Generalized Hidden Markov Model that uses hints from external sources. BMC Bioinformatics. 2006;7:62. - PMC - PubMed
    1. Guigó R., Reese M.G. EGASP: collaboration through competition to find human genes. Nature Meth. 2005;2:575–577. - PubMed

Publication types