doi:10.1038/npre.2009.3313.1

Download:

PDF (1 MB)

Embed:

License:

License Kind
Document Type:
Presentation
Date:
Received 03 June 2009 15:35 UTC; Posted 03 June 2009
Subjects:
Genetics & Genomics, Microbiology, Bioinformatics
Tags:
Abstract:

Experienced biocurators can outperform automated systems on specific genes once they determine which pieces of evidence should drive annotation, and which annotations should be spread. The annotation logic may weigh both homology evidence (BLAST matches or HMM hits) and non-homology evidence (neighboring genes, metabolic context, taxonomic group). Unfortunately, the expertise developed to annotate each gene is short-lived, and is mostly lost if the logic driving the annotation is not captured. We report the development of BrainGrab, an interface added to the MANATEE manual annotation tool for prokaryotic genomes. The curator can specify evidence scenarios that should always lead to equivalent annotation for similar genes in similar contexts, and thus create new annotation rules while the expertise is fresh. No special knowledge of programming or protein family construction is required. BrainGrab rules can mix and match evidence types from the large array of existing protein family definitions such as Pfam families, sequence analyses such as SignalP, and contextual clues, that is, the same types of evidence already familiar to experienced biocurators. We have now created an infrastructure for collecting, distributing, interpreting, and applying BrainGrab rules for automated annotation. A rules interpreter combines queries of existing evidence with specified new searches to determine if a rule must fire. If so, the interpreter writes a new piece of rule-based evidence. Once deposited, BrainGrab/RuleBase evidence can provide automated annotation, pathway reconstruction, and even input data for other rules. We demonstrate the system with sets of rules for annotating proteins and pathways of siderophore biosynthesis in human pathogens, for annotating common fusion proteins, and for applying the proper nomenclature to bacterial ribosomal proteins. The chance to harness curatorial expertise for building rules creates a promising avenue for community contributions to improved annotation pipelines.

Collection:
3rd International Biocuration Conference
Presented at:
3rd International Biocuration Conference, 16 April 2009

Discussion

Votes:

2 votes

(Login to vote)

Comments:

0 comments

(Login to post a comment)

(Login to share with a colleague)

Additional information

License:
This document is licensed to the public under the Creative Commons Attribution 3.0 License
How to cite this document:

Haft, Daniel, Basu, Malay, and Richter, Roland. BrainGrab: Capturing Curator Expertise as Reusable Annotation Rules. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2009.3313.1> (2009)

Version info:

Other versions of this document in Nature Precedings

None.

Other versions of this document elsewhere on the web

None known.

Participate

Related Documents

Advertisement