ESG: Extended Similarity Group method for automated protein function prediction
Correspondence: (Login to view email address)
- Purdue University, West Lafayette, IN, USA
- Chung-Ang University, Seoul, Korea
PDF (41.6 KB)
- Document Type:
- Manuscript
- Date:
- Received 15 August 2008 14:35 UTC; Posted 15 August 2008
- Subjects:
- Biotechnology, Ecology, Bioinformatics
- Abstract:
We present here the Extended Similarity Group (ESG) method, which annotates query sequences with Gene Ontology (GO) terms by assigning probability to each annotation computed based on iterative PSI-BLAST searches. Conventionally sequence homology based function annotation methods, such as BLAST, retrieve function information from top hits with a significant score (E-values). In contrast, the PFP method, which we have presented previously, goes one step ahead in utilizing a PSI-BLAST result by considering very weak hits even an E-value of up to 100 and also by incorporating the functional association between GO terms (FAM matrix) computed using term co-occurrence frequencies in the UniProt database. PFP is very successful which is evidenced by the top rank in the function prediction category in CASP7 competition. Our new approach, ESG method, further improves the accuracy of PFP by essentially employing PFP in an iterative fashion. An advantage of ESG is that it is built in a rigorous statistical framework: Unlike PFP method that assigns a weighted score to each GO term, ESG assigns a probability based on weights computed using the E-value of each hit sequence on the path between the original query sequence and the current hit sequence.
- Collection:
- AFP-Biosapiens 2008
Discussion
- Votes:
-
0 votes
- Comments:
-
0 comments
- (Login to share with a colleague)
Additional information
- License:
- This document is licensed to the public under the Creative Commons Attribution 3.0 License
- How to cite this document:
-
Chitale, Meghana, Hawkins, Troy, Park, Changsoon, and Kihara, Daisuke. ESG: Extended Similarity Group method for automated protein function prediction. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2008.2193.1> (2008)
- Version info:
-
Other versions of this document in Nature Precedings
None.
Other versions of this document elsewhere on the web
None known.