The role of Havana and communities in the manual curation of unfinished vertebrate genomes
Correspondence: (Login to view email address)
- Wellcome Trust Sanger Institute
- Document Type:
- Poster
- Date:
- Received 20 April 2009 13:17 UTC; Posted 20 April 2009
- Subjects:
- Genetics & Genomics, Bioinformatics
- Abstract:
Manual annotation (the “museum” model of annotation) relies on a small group of specialized curators to catalogue and classify genes according to their functional roles. This is both costly and time consuming and therefore is used only for model organisms with sufficient funding. Smaller research communities often have to rely on other models of annotation, mainly automated annotation (the “factory” model, e.g. Ensembl), and the “jamboree” model (in which a group of leading biologists from the community and bioinformaticians come together for a short intensive annotation workshop). At the Wellcome Trust Sanger Institute (WTSI), the Havana team provides high quality manual annotation of finished vertebrate genome sequences, namely human, mouse and zebrafish. We also perform the curation of specific finished regions such as the MHC in dog, cow and pig, whose whole genomes have been assembled from unfinished BACs or from whole genome shotgun sequences. In addition, we at Havana have also hosted annotation jamborees for the cow (Bos taurus) and pig (Sus scrofa) genomes. During those sessions, the research community had the opportunity to annotate their genes of interest under expert guidance using the custom written publicly available Otterlace annotation system, and the unified manual annotation guidelines. By making use of the tools and skills acquired during the cow and pig jamborees, the delegates can continue annotating their genomes remotely. For the pig genome, a highly contiguous physical map has been generated by an international effort of four laboratories (available in Pre!Ensembl) and is being used as a substrate for the swine genome sequencing project. Upcoming vertebrate genomes will be sequenced to a high depth coverage with the next generation sequencing technologies (e.g. Illumina, 454, SOLiD) but will have the drawback of not being manually finished. Manual annotation will be more accurate than the automated predictions at coping with any assembly problems derived from these high coverage but unfinished (or automatic pre-finished) genomes. Once these inherent assembly errors are corrected and the gene structures are accurately identified with manual annotation, the curated genes will be incorporated and merged with the predicted gene models in Ensembl to provide a unified view of the landscape of vertebrate genomes. I will present an introduction to our manual annotation system and our experience using it for annotation jamborees at the WTSI.
- Collection:
- 3rd International Biocuration Conference
- Presented at:
- 3rd International Biocuration Conference, 16 April 2009
Discussion
- Votes:
-
2 votes
- Comments:
-
1 comment
- (Login to share with a colleague)
Additional information
- License:
- This document is licensed to the public under the Creative Commons Attribution 3.0 License
- How to cite this document:
-
Carvalho-Silva, Denise, Chen, Chao-Kung, Frankish, Adam, Gilbert, James, Gordon, Leo, Hunt, Toby, Larbaoui, Mustapha, Loveland, Jane, Mudge, Jonathan, Sehra, Harminder, Snow, Catherine, Steward, Charlie, Suner, Marie Marthe, Thomas, Mark, Wilming, Laurens, and Harrow, Jennifer. The role of Havana and communities in the manual curation of unfinished vertebrate genomes. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2009.3102.1> (2009)
- Version info:
-
Other versions of this document in Nature Precedings
None.
Other versions of this document elsewhere on the web
None known.
Paul Davis on 23 April 2009 14:53 UTC
Excellent poster and a very nice person to talk with.