doi:10.1038/npre.2008.2193.1
0 votes

ESG: Extended Similarity Group method for automated protein function prediction

Meghana Chitale1, Troy Hawkins1, Changsoon Park2 & Daisuke Kihara1

Correspondence: (Login to view email address)

  1. Purdue University, West Lafayette, IN, USA
  2. Chung-Ang University, Seoul, Korea
Document Type:
Manuscript
Date:
Received 15 August 2008 14:35 UTC; Posted 15 August 2008
Subjects:
Biotechnology, Ecology, Bioinformatics
Tags:
Abstract:

We present here the Extended Similarity Group (ESG) method, which annotates query sequences with Gene Ontology (GO) terms by assigning probability to each annotation computed based on iterative PSI-BLAST searches. Conventionally sequence homology based function annotation methods, such as BLAST, retrieve function information from top hits with a significant score (E-values). In contrast, the PFP method, which we have presented previously, goes one step ahead in utilizing a PSI-BLAST result by considering very weak hits even an E-value of up to 100 and also by incorporating the functional association between GO terms (FAM matrix) computed using term co-occurrence frequencies in the UniProt database. PFP is very successful which is evidenced by the top rank in the function prediction category in CASP7 competition. Our new approach, ESG method, further improves the accuracy of PFP by essentially employing PFP in an iterative fashion. An advantage of ESG is that it is built in a rigorous statistical framework: Unlike PFP method that assigns a weighted score to each GO term, ESG assigns a probability based on weights computed using the E-value of each hit sequence on the path between the original query sequence and the current hit sequence.

Collection:
AFP-Biosapiens 2008

Discussion

Votes:

0 votes

(Login to vote)

Comments:

0 comments

(Login to post a comment)

(Login to share with a colleague)

Additional information

License:
This document is licensed to the public under the Creative Commons Attribution 3.0 License
How to cite this document:

Chitale, Meghana, Hawkins, Troy, Park, Changsoon, and Kihara, Daisuke. ESG: Extended Similarity Group method for automated protein function prediction. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2008.2193.1> (2008)

Version info:

Other versions of this document in Nature Precedings

None.

Other versions of this document elsewhere on the web

None known.

Participate

Related Documents

Advertisement