<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/" version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
  <channel>
    <title>Nature Precedings - Collection feed for AFP-Biosapiens 2008</title>
    <link>http://precedings.nature.com/collections/afp-biosapiens-2008/</link>
    <description>Recently posted documents in AFP-Biosapiens 2008</description>
    <dc:publisher>Nature Publishing Group</dc:publisher>
    <dc:language>en</dc:language>
    <prism:publicationName>Nature Precedings</prism:publicationName>
    <image>
      <title>Nature Precedings</title>
      <url>http://precedings.nature.com/images/header_logo.gif</url>
      <link>http://precedings.nature.com</link>
    </image>
    <atom:link type="application/rss+xml" rel="self" href="http://precedings.nature.com/collections/afp-biosapiens-2008/feed"/>
    <item>
      <title>Predicting Protein-Disease Relationships Using Sequence, Physicochemical Properties, and Molecular Function Information</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2239.1</link>
      <description>One of the most important tasks of modern bioinformatics is the development of computational tools that can be used to understand and treat human disease. To date, a variety of methods have been explored and algorithms for predicting whether a protein is involved in disease are gaining in their utility. Here, we describe an algorithm for detecting protein-disease associations based on the human protein-protein interaction network, known gene-disease associations, protein sequence, and protein functional information at the molecular level. Our method, PhenoPred (www.phenopred.org), is supervised: first, we map each protein onto the spaces of disease and functional terms based on distance to all annotated proteins in the protein interaction network. We also encode sequence, function, physicochemical, and predicted structural properties, such as secondary structure and flexibility. We then train support vector machines to detect a protein&#8217;s disease function for a number of terms in Disease Ontology (DO). We provided evidence that, despite the noise/incompleteness of experimental data and unfinished ontology of diseases, identification of candidate genes and proteins can be successful even when a large number of candidate disease terms are predicted on simultaneously.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2239.1</guid>
      <pubDate>Fri, 29 Aug 2008 12:44:48 UTC</pubDate>
      <dc:title>Predicting Protein-Disease Relationships Using Sequence, Physicochemical Properties, and Molecular Function Information</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2239.1</dc:identifier>
      <dc:date>2008-08-29</dc:date>
      <dc:creator>Predrag Radivojac</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-29T12:44:48Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2239/version/1/files/npre20082239-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Assessing functional novelty of PSI structures via structure-function analysis of large and diverse superfamilies</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2214.1</link>
      <description>The structural genomics initiatives have had as one of their aims to improve our understanding of protein function by providing representative structures for many structurally uncharacterised protein families. As suggested by the recent assessment of the Protein Structure Initiative (Structural Genomics Initiative, funded by the NIH), doubts have arisen as to whether Structural Genomics as initially planned were really beneficial to our understanding of biological issues, and in particular of protein function.A few protein domain superfamilies have been shown to account for unexpectedly large numbers of proteins encoded in fully sequenced genomes.  These large superfamilies are generally very diverse, spanning a wide range of functions, both in terms of molecular activities and biological processes. Some of these superfamilies, such as the Rossmann-fold P-loop nucleotide hydrolases or the TIM-barrel glycosidases,  have been the subject of extensive structural studies which in turn have shed light on how evolution of the sequence and structure properties produce functional diversity amongst homologues. Recently, the Structure-Function Linkage Database (SFLD) has been setup with the aim of helping the study of structure-function correlations in such superfamilies.  Since the evolutionary success of these large superfamilies suggests biological importance, several Structural Genomics Centers have focused on providing full structural coverage for representatives of all sequence families in these superfamilies.In this work we evaluate structure/function diversity in a set of these large superfamilies and attempt to assess the quality and quantity of biological information gained from Structural Genomics.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2214.1</guid>
      <pubDate>Fri, 22 Aug 2008 20:02:36 UTC</pubDate>
      <dc:title>Assessing functional novelty of PSI structures via structure-function analysis of large and diverse superfamilies</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2214.1</dc:identifier>
      <dc:date>2008-08-22</dc:date>
      <dc:creator>Benoit H. Dessailly</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-22T20:02:36Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2214/version/1/files/npre20082214-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Prediction of Functional Sites in SCOP Domains using Dynamics Perturbation Analysis</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2209.1</link>
      <description>Dynamics perturbation analysis (DPA) finds regions in a protein structure where proteins are &amp;#8220;ticklish&amp;#8221;, i.e., where interactions cause a large change in protein dynamics. Previously, such regions were shown to predict the location of native binding sites in a docking test set, but the more general applicability of DPA to the prediction of functional sites in proteins was not shown. Here we describe the results of applying an accelerated algorithm, called Fast DPA, to predict functional sites in over 50,000 SCOP domains.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2209.1</guid>
      <pubDate>Thu, 21 Aug 2008 17:23:10 UTC</pubDate>
      <dc:title>Prediction of Functional Sites in SCOP Domains using Dynamics Perturbation Analysis</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2209.1</dc:identifier>
      <dc:date>2008-08-21</dc:date>
      <dc:creator>Judith D. Cohn</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-21T17:23:10Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2209/version/1/files/npre20082209-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>A Systematic Approach to Identifying Protein-Ligand Binding Profiles on a Proteome Scale</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2207.1</link>
      <description>Identification of protein-ligand interaction networks on a proteome scale is crucial to address a wide range of biological problems such as correlating molecular functions to physiological processes and designing safe and efficient therapeutics. We have developed a novel computational strategy to identify ligand binding profiles of proteins across gene families and applied it to predicting protein functions, elucidating molecular mechanisms of drug adverse effects, and repositioning safe pharmaceuticals to treat different diseases.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2207.1</guid>
      <pubDate>Wed, 20 Aug 2008 20:29:38 UTC</pubDate>
      <dc:title>A Systematic Approach to Identifying Protein-Ligand Binding Profiles on a Proteome Scale</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2207.1</dc:identifier>
      <dc:date>2008-08-20</dc:date>
      <dc:creator>Lei Xie</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-20T20:29:38Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Biotechnology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2207/version/1/files/npre20082207-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>LabelHash: A Flexible and Extensible Method for Matching Structural Motifs</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2199.1</link>
      <description>There is an increasing number of proteins with known structure but unknown function. Determining their function would have a significant impact on understanding diseases and designing new therapeutics. Computational methods can facilitate function determination by identifying proteins that have high structural and chemical similarity. Below, we will briefly describe LabelHash, a new method for partial structure comparison. In partial structure comparison, the goal is to find the best geometric and chemical similarity between a set of 3D points called a motif and a subset of a set of 3D points called the target. Both the motif and targets are represented as sets of labeled 3D points. A motif is ideally composed of the functionally most-relevant residues in a binding site. The labels denote the type of residue. Motif points can have multiple labels to denote that substitutions are allowed. Any subset of the target that has labels that are compatible with the motif&#8217;s labels is called a match. The aim is to find statistically significant matches to a structural motif. Our method preprocesses a background database of targets such as a non-redundant subset of the Protein Data Bank in such a way that we can look up in constant time partial matches to a motif. Using a variant of the previously described match augmentation algorithm (1), we obtain complete matches to our motif. The nonparametric statistical model developed by (2,3) corrects for any bias introduced by our algorithm. This bias is introduced by excluding matches that do not satisfy certain geometric constraints for efficiency reasons. </description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2199.1</guid>
      <pubDate>Fri, 15 Aug 2008 16:25:57 UTC</pubDate>
      <dc:title>LabelHash: A Flexible and Extensible Method for Matching Structural Motifs</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2199.1</dc:identifier>
      <dc:date>2008-08-15</dc:date>
      <dc:creator>Mark Moll</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-15T16:25:57Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2199/version/1/files/npre20082199-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>ESG: Extended Similarity Group method for automated protein function prediction</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2193.1</link>
      <description>We present here the Extended Similarity Group (ESG) method, which annotates query sequences with Gene Ontology (GO) terms by assigning probability to each annotation computed based on iterative PSI-BLAST searches. Conventionally sequence homology based function annotation methods, such as BLAST, retrieve function information from top hits with a significant score (E-values). In contrast, the PFP method, which we have presented previously, goes one step ahead in utilizing a PSI-BLAST result by considering very weak hits even an E-value of up to 100 and also by incorporating the functional association between GO terms (FAM matrix) computed using term co-occurrence frequencies in the UniProt database. PFP is very successful which is evidenced by the top rank in the function prediction category in CASP7 competition. Our new approach, ESG method, further improves the accuracy of PFP by essentially employing PFP in an iterative fashion. An advantage of ESG is that it is built in a rigorous statistical framework: Unlike PFP method that assigns a weighted score to each GO term, ESG assigns a probability based on weights computed using the E-value of each hit sequence on the path between the original query sequence and the current hit sequence. </description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2193.1</guid>
      <pubDate>Fri, 15 Aug 2008 16:17:50 UTC</pubDate>
      <dc:title>ESG: Extended Similarity Group method for automated protein function prediction</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2193.1</dc:identifier>
      <dc:date>2008-08-15</dc:date>
      <dc:creator>Daisuke Kihara</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-15T16:17:50Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Biotechnology</prism:section>
      <prism:section>Ecology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2193/version/1/files/npre20082193-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Analysis of Genetic Interaction Maps Reveals Functional Pleiotropy</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2182.1</link>
      <description>Epistatic or genetic interactions, representing the effects of mutations on the phenotypes caused by other mutations, can be very helpful for uncovering functional relationships between genes. Recently, the Epistasis Miniarray Profile (E-MAP) method has emerged as a powerful approach for identifying such interactions systematically. As part of this approach, hierarchical clustering is used to partition genes into groups on the basis of the similarity between their global interaction profiles.  Here we present an original biclustering algorithm for identifying groups of functionally related genes from E-MAP data in a manner that allows individual genes to be assigned to more than one functional group. This enables investigation of the pleiotropic nature of gene function, a goal that cannot be achieved with hierarchical clustering. The performance of our algorithm is illustrated by applying it to two E-MAP datasets and an E-MAP-like in silico dataset for the yeast S. cerevisiae. In addition to identifying the majority of the functional modules reported in these studies, our algorithm uncovers many recently documented and novel multi-functional relationships between genes and gene groups.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2182.1</guid>
      <pubDate>Thu, 14 Aug 2008 21:45:30 UTC</pubDate>
      <dc:title>Analysis of Genetic Interaction Maps Reveals Functional Pleiotropy</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2182.1</dc:identifier>
      <dc:date>2008-08-14</dc:date>
      <dc:creator>Shuye Pu</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-14T21:45:30Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2182/version/1/files/npre20082182-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Safe Functional Inference for Uncharacterized Viral Proteins</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2187.1</link>
      <description>The explosive growth in the number of sequenced genomes has created a flood of protein sequences with unknown structure and function. A routine protocol for functional inference on an input query sequence is based on a database search for homologues. Searching a query against a non-redundant database using BLAST (or more advanced methods, e.g. PSI-BLAST) suffers from several drawbacks: (i) a local alignment often dominates the results; (ii) the reported statistical score (i.e. E-value) is often misleading; (iii) incorrect annotations may be falsely propagated. Several systematic methods are commonly used to assign sequences with functions on a genomic scale. In Pfam (1) and resources alike, statistical profiles (HMMs) are built from semi-manual multiple alignments of seed homologous sequences. The profiles are then used to scan genomic sequences for additional family members. The drawbacks of this scheme are: (i) only families with a predetermined seed are considered; (ii) the query must have a detectable sequence similarity to seed sequences; (iii) attention to internal relationships among the family members or the relations to other families is lacking; (iv) family membership is often set by pre-determined thresholds.An alternative to profile or model based methods for functional inference relies on a hierarchical clustering of the protein space, as implemented in the ProtoNet approach (2). The fundamental principle is the creation of a tree that captures evolutionary relatedness among protein families. The tree construction is fully automatic, and is based only on reported BLAST similarities among clustered sequences. The tree provides protein groupings in continuous evolutionary granularities, from closely related to distant superfamilies. Clusters in the ProtoNet tree show high correspondence with homologous sequence (i.e. Pfam and InterPro), functional (i.e. E.C. classification) and structural (i.e., SCOP) families (3). A new clustering scheme (4) has provided an extensive update to the ProtoNet process, which is now based on direct clustering of all detectable sequence similarities. Herein, we use the ProtoNet resource to develop a methodology for a consistent and safe functional inference for remote families. We illustrate the success of our approach towards clusters of poorly characterized viral proteins. Viral sequences are characterized by a rapid evolutionary rate which drives viral families to be even more remote (sequence-similarity-wise). Thus, functional inference for viral families is apparently an unsolved task. Despite this inherent difficulty, the new ProtoNet tree scaffold reliably captures weak evolutionary connections for viral families, which were previously overlooked. We take advantage of this, and propose new functional assignments for viral protein families.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2187.1</guid>
      <pubDate>Thu, 14 Aug 2008 21:43:29 UTC</pubDate>
      <dc:title>Safe Functional Inference for Uncharacterized Viral Proteins</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2187.1</dc:identifier>
      <dc:date>2008-08-14</dc:date>
      <dc:creator>Michal Linial</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-14T21:43:29Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2187/version/1/files/npre20082187-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Data mining of protein families using common peptides</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2189.1</link>
      <description>Predicting the function of a protein from its sequence is typically addressed using sequence-similarity. Here we propose a motif-based approach, using supervised motif extraction from protein sequences belonging to one functional family. The resulting deterministic motifs form Common Peptides (CPs) that characterize this family, allow for data mining of its proteins and facilitate further partition of the family into clusters</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2189.1</guid>
      <pubDate>Thu, 14 Aug 2008 14:14:36 UTC</pubDate>
      <dc:title>Data mining of protein families using common peptides</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2189.1</dc:identifier>
      <dc:date>2008-08-14</dc:date>
      <dc:creator>Assaf Gottlieb</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-14T14:14:36Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2189/version/1/files/npre20082189-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Association Analysis Techniques for Discovering Functional Modules from Microarray Data</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2184.1</link>
      <description>An application of great interest in microarray data analysis is the identification of a group of genes that show very similar patterns of expression in a data set, and are expected to represent groups of genes that perform common/similar functions, also known as functional modules. Although clustering offers a natural solution to this problem, it suffers from the limitation that it uses all the conditions to compare two genes, whereas only a subset of them may be relevant. Association analysis offers an alternative route for finding such groups of genes that may be co-expressed only over a subset of the experimental conditions used to prepare the data set. The techniques in this field attempt to find groups of data objects that contain coherent values across a set of attributes, in an exhaustive and efficient manner. In this paper, we illustrate how a generalization of the techniques in association analysis for real-valued data can be utilized to extract coherent functional modules from large microarray data sets.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2184.1</guid>
      <pubDate>Wed, 13 Aug 2008 22:42:34 UTC</pubDate>
      <dc:title>Association Analysis Techniques for Discovering Functional Modules from Microarray Data</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2184.1</dc:identifier>
      <dc:date>2008-08-13</dc:date>
      <dc:creator>Gaurav Pandey</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-13T22:42:34Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <prism:section>Molecular Cell Biology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2184/version/1/files/npre20082184-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
  </channel>
</rss>
