doi:10.1038/npre.2009.3102.1
2 votes

The role of Havana and communities in the manual curation of unfinished vertebrate genomes

Denise R. Carvalho-Silva1, Chao-Kung Chen1, Adam Frankish1, James Gilbert1, Leo Gordon1, Toby Hunt1, Mustapha Larbaoui1, Jane E. Loveland1, Jonathan Mudge1, Harminder Sehra1, Catherine Snow1, Charlie Steward1, Marie Marthe Suner1, Mark Thomas1, Laurens Wilming1 & Jennifer Harrow1

Correspondence: (Login to view email address)

  1. Wellcome Trust Sanger Institute
Document Type:
Poster
Date:
Received 20 April 2009 13:17 UTC; Posted 20 April 2009
Subjects:
Genetics & Genomics, Bioinformatics
Tags:
Abstract:

Manual annotation‭ (‬the‭ “‬museum‭” ‬model of annotation‭) ‬relies on a small group of specialized curators to catalogue and classify genes according to their functional roles.‭ This‬ is both costly and time consuming and therefore is used only for model organisms with sufficient funding.‭ ‬Smaller research communities often have to rely on other models of annotation,‭ ‬mainly automated annotation‭ (‬the‭ “‬factory‭” ‬model,‭ ‬e.g.‭ ‬Ensembl‭)‬,‭ ‬and the‭ “‬jamboree‭” ‬model‭ (‬in which a group of leading biologists from the community and bioinformaticians come together for a short intensive annotation workshop‭)‬.‭ ‬At the Wellcome Trust Sanger Institute‭ (‬WTSI‭)‬,‭ ‬the Havana team provides high quality manual annotation of finished vertebrate genome sequences,‭ ‬namely human,‭ ‬mouse and zebrafish.‭ ‬We also perform the curation of specific finished regions such as the MHC in dog,‭ ‬cow and pig,‭ ‬whose whole genomes have been‭ ‬assembled from unfinished BACs or from whole genome shotgun sequences.‭ ‬In addition,‭ ‬we at Havana have also hosted annotation jamborees for the cow‭ (‬Bos taurus‭) ‬and pig‭ (‬Sus scrofa‭) ‬genomes.‭ ‬During those sessions,‭ ‬the research community had the opportunity to annotate their genes of interest under expert guidance using the custom written publicly available Otterlace annotation system,‭ ‬and the unified manual annotation guidelines.‭ ‬By making use of the tools and skills acquired during the cow and pig jamborees,‭ ‬the delegates can continue annotating their genomes remotely.‭ ‬For the pig genome,‭ ‬a highly contiguous physical map has been generated by an international effort of four laboratories (available in Pre!Ensembl) and‭ ‬is being used as a substrate for the swine genome sequencing project.‭ ‬Upcoming vertebrate genomes will be sequenced to a high depth coverage with the next generation sequencing technologies‭ (‬e.g.‭ ‬Illumina,‭ ‬454,‭ ‬SOLiD‭) ‬but will have the drawback of not being manually finished.‭ ‬Manual annotation will be more accurate than the automated predictions at coping with any assembly problems derived from these high coverage but unfinished‭ (‬or automatic pre-finished‭) ‬genomes.‭ ‬Once these inherent assembly errors are corrected and the gene structures are accurately identified with manual annotation,‭ ‬the curated genes will be incorporated and merged with the predicted gene models in Ensembl to provide a unified view of the landscape of vertebrate genomes.‭ ‬I will present an introduction to our manual annotation system and our experience using it for annotation jamborees at the WTSI.

Collection:
3rd International Biocuration Conference
Presented at:
3rd International Biocuration Conference, 16 April 2009

Discussion

Votes:

2 votes

(Login to vote)

Comments:

1 comment

Paul Davis on 23 April 2009 14:53 UTC

Excellent poster and a very nice person to talk with.

(Login to post a comment)

(Login to share with a colleague)

Additional information

License:
This document is licensed to the public under the Creative Commons Attribution 3.0 License
How to cite this document:

Carvalho-Silva, Denise, Chen, Chao-Kung, Frankish, Adam, Gilbert, James, Gordon, Leo, Hunt, Toby, Larbaoui, Mustapha, Loveland, Jane, Mudge, Jonathan, Sehra, Harminder, Snow, Catherine, Steward, Charlie, Suner, Marie Marthe, Thomas, Mark, Wilming, Laurens, and Harrow, Jennifer. The role of Havana and communities in the manual curation of unfinished vertebrate genomes. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2009.3102.1> (2009)

Version info:

Other versions of this document in Nature Precedings

None.

Other versions of this document elsewhere on the web

None known.

Participate

Related Documents

Advertisement