<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/" version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
  <channel>
    <title>Nature Precedings - Tag feed for protein function prediction</title>
    <link>http://precedings.nature.com/tags/protein%20function%20prediction</link>
    <description>Recently posted documents tagged with 'protein function prediction'</description>
    <dc:publisher>Nature Publishing Group</dc:publisher>
    <dc:language>en</dc:language>
    <prism:publicationName>Nature Precedings</prism:publicationName>
    <image>
      <title>Nature Precedings</title>
      <url>http://precedings.nature.com/images/header_logo.gif</url>
      <link>http://precedings.nature.com</link>
    </image>
    <atom:link type="application/rss+xml" rel="self" href="http://precedings.nature.com/tags/protein%20function%20prediction/feed"/>
    <item>
      <title>Predicting Protein-Disease Relationships Using Sequence, Physicochemical Properties, and Molecular Function Information</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2239.1</link>
      <description>One of the most important tasks of modern bioinformatics is the development of computational tools that can be used to understand and treat human disease. To date, a variety of methods have been explored and algorithms for predicting whether a protein is involved in disease are gaining in their utility. Here, we describe an algorithm for detecting protein-disease associations based on the human protein-protein interaction network, known gene-disease associations, protein sequence, and protein functional information at the molecular level. Our method, PhenoPred (www.phenopred.org), is supervised: first, we map each protein onto the spaces of disease and functional terms based on distance to all annotated proteins in the protein interaction network. We also encode sequence, function, physicochemical, and predicted structural properties, such as secondary structure and flexibility. We then train support vector machines to detect a protein&#8217;s disease function for a number of terms in Disease Ontology (DO). We provided evidence that, despite the noise/incompleteness of experimental data and unfinished ontology of diseases, identification of candidate genes and proteins can be successful even when a large number of candidate disease terms are predicted on simultaneously.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2239.1</guid>
      <pubDate>Fri, 29 Aug 2008 12:44:48 UTC</pubDate>
      <dc:title>Predicting Protein-Disease Relationships Using Sequence, Physicochemical Properties, and Molecular Function Information</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2239.1</dc:identifier>
      <dc:date>2008-08-29</dc:date>
      <dc:creator>Predrag Radivojac</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-29T12:44:48Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2239/version/1/files/npre20082239-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Building Block and Building Rule: Dual Descriptor Method for Biological Sequence Analysis</title>
      <link>http://precedings.nature.com/documents/2223/version/1</link>
      <description>The emergence of &#8220;Systems Biology&#8221; in recent years highlights the systematic viewpoint of bio-system modeling. Building on such a background, Dual Descriptor Method, a generic methodology for biological sequence analysis is proposed. From a systematic perspective, Dual Descriptor is defined as a two element set of Composition Weight Map and Position Weight Function which aim at reflecting the composition and permutation information of a sequence. An alternate training algorithm is provided to get an optimum description of the building patterns of the sequences. In this paper, dual descriptor method has been applied to the analysis of two typical problems of molecular biology: gene identification and the prediction of protein function. Satisfactory and insightful results are achieved. Owing to the generality of this methodology, dual descriptor method has wide application perspective for many problems of pattern recognition, especially those involved in &#8220;Systems Biology&#8221;.</description>
      <guid>http://precedings.nature.com/documents/2223/version/1</guid>
      <pubDate>Tue, 26 Aug 2008 14:24:54 UTC</pubDate>
      <dc:title>Building Block and Building Rule: Dual Descriptor Method for Biological Sequence Analysis</dc:title>
      <dc:identifier>hdl:10101/npre.2008.2223.1</dc:identifier>
      <dc:date>2008-08-26</dc:date>
      <dc:creator>Bin-Guang Ma</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-26T14:24:54Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Biotechnology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2223/version/1/files/npre20082223-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>LabelHash: A Flexible and Extensible Method for Matching Structural Motifs</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2199.1</link>
      <description>There is an increasing number of proteins with known structure but unknown function. Determining their function would have a significant impact on understanding diseases and designing new therapeutics. Computational methods can facilitate function determination by identifying proteins that have high structural and chemical similarity. Below, we will briefly describe LabelHash, a new method for partial structure comparison. In partial structure comparison, the goal is to find the best geometric and chemical similarity between a set of 3D points called a motif and a subset of a set of 3D points called the target. Both the motif and targets are represented as sets of labeled 3D points. A motif is ideally composed of the functionally most-relevant residues in a binding site. The labels denote the type of residue. Motif points can have multiple labels to denote that substitutions are allowed. Any subset of the target that has labels that are compatible with the motif&#8217;s labels is called a match. The aim is to find statistically significant matches to a structural motif. Our method preprocesses a background database of targets such as a non-redundant subset of the Protein Data Bank in such a way that we can look up in constant time partial matches to a motif. Using a variant of the previously described match augmentation algorithm (1), we obtain complete matches to our motif. The nonparametric statistical model developed by (2,3) corrects for any bias introduced by our algorithm. This bias is introduced by excluding matches that do not satisfy certain geometric constraints for efficiency reasons. </description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2199.1</guid>
      <pubDate>Fri, 15 Aug 2008 16:25:57 UTC</pubDate>
      <dc:title>LabelHash: A Flexible and Extensible Method for Matching Structural Motifs</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2199.1</dc:identifier>
      <dc:date>2008-08-15</dc:date>
      <dc:creator>Mark Moll</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-15T16:25:57Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2199/version/1/files/npre20082199-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>ESG: Extended Similarity Group method for automated protein function prediction</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2193.1</link>
      <description>We present here the Extended Similarity Group (ESG) method, which annotates query sequences with Gene Ontology (GO) terms by assigning probability to each annotation computed based on iterative PSI-BLAST searches. Conventionally sequence homology based function annotation methods, such as BLAST, retrieve function information from top hits with a significant score (E-values). In contrast, the PFP method, which we have presented previously, goes one step ahead in utilizing a PSI-BLAST result by considering very weak hits even an E-value of up to 100 and also by incorporating the functional association between GO terms (FAM matrix) computed using term co-occurrence frequencies in the UniProt database. PFP is very successful which is evidenced by the top rank in the function prediction category in CASP7 competition. Our new approach, ESG method, further improves the accuracy of PFP by essentially employing PFP in an iterative fashion. An advantage of ESG is that it is built in a rigorous statistical framework: Unlike PFP method that assigns a weighted score to each GO term, ESG assigns a probability based on weights computed using the E-value of each hit sequence on the path between the original query sequence and the current hit sequence. </description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2193.1</guid>
      <pubDate>Fri, 15 Aug 2008 16:17:50 UTC</pubDate>
      <dc:title>ESG: Extended Similarity Group method for automated protein function prediction</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2193.1</dc:identifier>
      <dc:date>2008-08-15</dc:date>
      <dc:creator>Daisuke Kihara</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-15T16:17:50Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Biotechnology</prism:section>
      <prism:section>Ecology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2193/version/1/files/npre20082193-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
  </channel>
</rss>
