<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/" version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
  <channel>
    <title>Nature Precedings - Tag feed for annotation</title>
    <link>http://precedings.nature.com/tags/annotation</link>
    <description>Recently posted documents tagged with 'annotation'</description>
    <dc:publisher>Nature Publishing Group</dc:publisher>
    <dc:language>en</dc:language>
    <prism:publicationName>Nature Precedings</prism:publicationName>
    <image>
      <title>Nature Precedings</title>
      <url>http://precedings.nature.com/images/header_logo.gif</url>
      <link>http://precedings.nature.com</link>
    </image>
    <atom:link type="application/rss+xml" rel="self" href="http://precedings.nature.com/tags/annotation/feed"/>
    <item>
      <title>Annotation-based meta-analysis of microarray experiments</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3569.1</link>
      <description>We are developing software applications to perform meta-analysis of microarray experiments based on standardized experiment annotations aiming to identify similar experiments and cluster experiments. The applications were tested on files obtained from the ArrayExpress public repository. Annotation terms were used to compute experiment dissimilarities to find experiments related to a query experiment. These applications may motivate efforts of bench biologists to better annotate experiments.</description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3569.1</guid>
      <pubDate>Thu, 06 Aug 2009 20:01:12 UTC</pubDate>
      <dc:title>Annotation-based meta-analysis of microarray experiments</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3569.1</dc:identifier>
      <dc:date>2009-08-06</dc:date>
      <dc:creator>Jie Zheng</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-08-06T20:01:12Z</prism:publicationDate>
      <prism:category>Poster</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3569/version/1/files/npre20093569-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Using the Gene Ontology to Annotate Biomedical Journal Articles</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3556.1</link>
      <description>We are creating a gold-standard corpus of manually annotated full-text biomedical journal articles toward natural-language-processing applications.  Central to this is our use of entire ontologies of the Open Biomedical Ontologies initiative as well as other terminologies as term sources, in contrast to most other such annotation projects, which have used small, ad hoc schemas.  In addition to the standard difficulties in such annotation projects, each of the terminologies we have used has idiosyncrasies and ambiguities that present further challenges to consistent, high-quality annotation of these articles.  In this paper we present and discuss the most salient of these with regard to the Gene Ontology that we have encountered and addressed in our annotation guidelines and training.  The utility of these guidelines can be seen in the high and still-increasing interannotator-agreement statistics that we continually monitor.</description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3556.1</guid>
      <pubDate>Thu, 06 Aug 2009 14:33:25 UTC</pubDate>
      <dc:title>Using the Gene Ontology to Annotate Biomedical Journal Articles</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3556.1</dc:identifier>
      <dc:date>2009-08-06</dc:date>
      <dc:creator>Michael Bada</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-08-06T14:33:25Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3556/version/1/files/npre20093556-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Desiderata for an ontology of diseases for the annotation of biological datasets.</title>
      <link>http://precedings.nature.com/documents/3531/version/1</link>
      <description>There is a plethora of disease ontologies available, all potentially useful for the annotation of biological datasets. We define seven desirable features for such ontologies and examine whether or not these features are supported by eleven disease ontologies. The four ontologies most closely aligned with our desiderata are Disease Ontology, SNOMED CT, NCI thesaurus and UMLS.</description>
      <guid>http://precedings.nature.com/documents/3531/version/1</guid>
      <pubDate>Mon, 03 Aug 2009 14:04:38 UTC</pubDate>
      <dc:title>Desiderata for an ontology of diseases for the annotation of biological datasets.</dc:title>
      <dc:identifier>hdl:10101/npre.2009.3531.1</dc:identifier>
      <dc:date>2009-08-03</dc:date>
      <dc:creator>Olivier Bodenreider</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-08-03T14:04:38Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3531/version/1/files/npre20093531-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>The Eukaryote Genome Annotation Platform at Genoscope</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3457.1</link>
      <description>The Genoscope annotation workflow for eukaryote genomes relies on evidence from ab initio gene models predictions combined with homology searches, using collections of expressed sequences &amp;#8211; full length cDNAs, ESTs or massive-scale mRNA sequences from the same or closely related organisms &#8211; proteins or other genomic sequences. Global analysis of these drafts or complete sequences are then combining both approaches in the form of gene prediction data integration using GAZE, capable to identify a majority of the existing gene features. Although of very good quality, gene-modelling remains still tentative at the end of the process. Even though computational predictors are useful on large scale annotation for global genomics analysis, there is no complete genome for which all gene structures, in terms of exons, introns and coding regions, have been experimentally confirmed.Finished genomes can provide exciting insights into the genome organization and evolution. Additional experimental data generated by genome sequencing projects give assistance to genome annotation aiming to a better understanding of the biology of the organism. Therefore, gene models and annotation can be improved by human curation to find errors or to resolve incongruous evidence on the automatic annotation of the genome. We now provide to collaborators carrying sequencing projects with a distributed annotation platform allowing expert evaluation of the annotation, in addition to our automated gene prediction pipeline.To ensure at most the participation of the scientific community, an annotation tool for revising annotations has been set up using components of the Generic Model Organism Database toolkit, which provides tools for managing organism databases. A CHADO database, linked to an Apollo graphical interface, permit users to correct gene structures and store them in a dedicated organism database, as we will show on a few examples. Such a tool would facilitate connecting and comparing predicted annotations with existing biological data, becoming the repository of complete annotated finished genome sequence. </description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3457.1</guid>
      <pubDate>Fri, 24 Jul 2009 15:28:24 UTC</pubDate>
      <dc:title>The Eukaryote Genome Annotation Platform at Genoscope</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3457.1</dc:identifier>
      <dc:date>2009-07-24</dc:date>
      <dc:creator>Betina M. Porcel</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-07-24T15:28:24Z</prism:publicationDate>
      <prism:category>Poster</prism:category>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3457/version/1/files/npre20093457-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>A simple clustering of the BioModels database using semanticSBML</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3444.1</link>
      <description>The BioModels database contains biochemical network models in SBML format, in which the biochemical meaning of elements is specified by MIRIAM-compliant RDF annotations. We used these annotations to define a similarity measure for models, scoring the overlap of the biochemical systems described. Based on this score, we used two-way clustering to detect groups of similar models and groups of co-occuring model elements. To recognize and compare biochemical elements, we used routines from the software semanticSBML. A Python script extracts all MIRIAM annotations (regardless of their qualifiers) using the semanticSBML annotation classes. The result is a matrix in which the rows represent the models (e.g. BioModel 001), while the columns represent specific annotations (e.g. urn:miriam:reactome:REACT_15422). A matrix element is set to 1 if an identifier occurs in a model and to 0 otherwise. This matrix was used as an input for a hierarchical clustering algorithm (implemented in Matlab) and the clustered matrix was visualized. Model clustering allows to detect models describing similar biochemical processes (e.g. glycolysis) and their specific common elements. This may help to find candidate models for completing a given initial model, which could then be merged using semanticSBML.</description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3444.1</guid>
      <pubDate>Mon, 20 Jul 2009 18:16:55 UTC</pubDate>
      <dc:title>A simple clustering of the BioModels database using semanticSBML</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3444.1</dc:identifier>
      <dc:date>2009-07-20</dc:date>
      <dc:creator>Wolfram Liebermeister</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-07-20T18:16:55Z</prism:publicationDate>
      <prism:category>Poster</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3444/version/1/files/npre20093444-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>BrainGrab: Capturing Curator Expertise as Reusable Annotation Rules</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3313.1</link>
      <description>Experienced biocurators can outperform automated systems on specific genes once they determine which pieces of evidence should drive annotation, and which annotations should be spread. The annotation logic may weigh both homology evidence (BLAST matches or HMM hits) and non-homology evidence (neighboring genes, metabolic context, taxonomic group). Unfortunately, the expertise developed to annotate each gene is short-lived, and is mostly lost if the logic driving the annotation is not captured. We report the development of BrainGrab, an interface added to the MANATEE manual annotation tool for prokaryotic genomes. The curator can specify evidence scenarios that should always lead to equivalent annotation for similar genes in similar contexts, and thus create new annotation rules while the expertise is fresh. No special knowledge of programming or protein family construction is required. BrainGrab rules can mix and match evidence types from the large array of existing protein family definitions such as Pfam families, sequence analyses such as SignalP, and contextual clues, that is, the same types of evidence already familiar to experienced biocurators. We have now created an infrastructure for collecting, distributing, interpreting, and applying BrainGrab rules for automated annotation. A rules interpreter combines queries of existing evidence with specified new searches to determine if a rule must fire. If so, the interpreter writes a new piece of rule-based evidence. Once deposited, BrainGrab/RuleBase evidence can provide automated annotation, pathway reconstruction, and even input data for other rules. We demonstrate the system with sets of rules for annotating proteins and pathways of siderophore biosynthesis in human pathogens, for annotating common fusion proteins, and for applying the proper nomenclature to bacterial ribosomal proteins. The chance to harness curatorial expertise for building rules creates a promising avenue for community contributions to improved annotation pipelines.</description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3313.1</guid>
      <pubDate>Wed, 03 Jun 2009 15:52:03 UTC</pubDate>
      <dc:title>BrainGrab: Capturing Curator Expertise as Reusable Annotation Rules</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3313.1</dc:identifier>
      <dc:date>2009-06-03</dc:date>
      <dc:creator>Daniel H. Haft</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-06-03T15:52:03Z</prism:publicationDate>
      <prism:category>Presentation</prism:category>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <prism:section>Microbiology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3313/version/1/files/npre20093313-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>GUDMAP &amp;#8211; An Online GenitoUrinary Resource</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3297.1</link>
      <description>The GenitoUrinary Development Molecular Anatomy Project (GUDMAP) is a consortium of laboratories working to provide the scientific and medical community with gene expression data and tools to facilitate research (see www.gudmap.org). The data provided by GUDMAP includes large in situ hybridization screens (wholemount and section) and expression microarray analysis of components of the developing mouse urogenital system (including laser-captured material and FACS-isolated cells from transgenic reporter mice). In addition, a high-resolution anatomy ontology has been developed by members of the GUDMAP consortium to describe the subcompartments of the developing murine genitourinary tract. The GUDMAP Database Development Team and Editorial Office &amp;#8211; both based in Edinburgh &amp;#8211; function to ensure submission, curation, storage and presentation of the data submitted by the GUDMAP consortium. Our collective aim is twofold: 1) to simplify the process of submission so that data is publically available as soon as it is produced; and 2) to organize this information in a database and ensure that the online interface is continuously available and easy to use. Thus far, we have developed a range of tools that help both the submitter and the end user. These include: an online annotation tool that simplifies in situ data submission through an ontology-based graphical user interface; a database interface that allows users to browse and query expression data, and to filter data by organ system; a heat-map display of microarray data and analyses. Furthermore, the Edinburgh team has developed a GUDMAP Disease Database that queries associations between genes, genitourinary diseases, and renal/urinary and reproductive phenotypes. In collaboration with GUDMAP consortium members at the CCHMC (Cincinnati Children&amp;#8217;s Hospital Medical Center), the Disease Database is being extended to include mammalian phenotypes mapped to OMIM entries. By virtue of its impressive dataset and its ease of use we hope that the GUDMAP Website will continue to serve as a powerful resource for biologists, clinicians and bioinformaticians with an interest in the urogenital system.</description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3297.1</guid>
      <pubDate>Sat, 30 May 2009 14:17:58 UTC</pubDate>
      <dc:title>GUDMAP &amp;#8211; An Online GenitoUrinary Resource</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3297.1</dc:identifier>
      <dc:date>2009-05-30</dc:date>
      <dc:creator>Simon Harding</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-05-30T14:17:58Z</prism:publicationDate>
      <prism:category>Poster</prism:category>
      <prism:section>Developmental Biology</prism:section>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3297/version/1/files/npre20093297-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>H-InvDB release 6, a comprehensive annotation resource for human genes and transcripts</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3251.1</link>
      <description>H-Invitational Database (H-InvDB; http://www.h-invitational.jp/) is an integrated database of human genes and transcripts. By extensive analyses of all human transcripts, we provide curated annotations of human genes and transcripts that include gene structures, alternative splicing isoforms, non-coding functional RNAs, protein functions, functional domains, sub-cellular localizations, metabolic pathways, protein 3D structure, genetic polymorphisms, relation with diseases, gene expression profiling, molecular evolutionary features, protein-protein interactions (PPIs) and gene families/groups.  The latest release of H-InvDB (release 6.0) provide annotation for 219,765 human transcripts in 43,159 human gene clusters based on human FLcDNAs and mRNAs.H-InvDB consists of two main views, the Transcript view and the Locus view, and six auxiliary databases with web-based viewers; G-integra, H-ANGEL, DiseaseInfo Viewer, Evola, PPI view and Gene Family/Group view.  We also provides several data mining tools such as &#8220;Navi search&#8221;: consists of 16 search contents each of which includes items for the search condition (http://www.h-invitational.jp/hinv/c-search/hinvNaviTop.jsp), &#8220;PANDA&#8221;: Priority ANalysis for Disease Association (PANDA) system (http://www.h-invitational.jp/panda/app), H-InvDB now provides web service APIs of SOAP and REST to use H-InvDB data in programs. (http://www.h-invitational.jp/hinv/hws/doc/)</description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3251.1</guid>
      <pubDate>Thu, 14 May 2009 21:30:10 UTC</pubDate>
      <dc:title>H-InvDB release 6, a comprehensive annotation resource for human genes and transcripts</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3251.1</dc:identifier>
      <dc:date>2009-05-14</dc:date>
      <dc:creator>Chisato Yamasaki</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-05-14T21:30:10Z</prism:publicationDate>
      <prism:category>Poster</prism:category>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <prism:section>Evolutionary Biology</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3251/version/1/files/npre20093251-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Data Curation in Biology &amp;#8211; Past, Present and Future</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3225.1</link>
      <description>Data curation has been critical in the development of biology from Darwin and Linnaeus to UniProt, the careful collection and organisation of data has been the spring from which new hypotheses and understanding have emerged. In this presentation, I will describe how we have used data curation in my own research group &amp;#8211; and also present an overview of curation at the EBI. With new technical developments and the move towards the semantic web, the role of curation in the future needs to develop to take advantage of these new opportunities. This will be discussed.</description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3225.1</guid>
      <pubDate>Tue, 12 May 2009 13:19:11 UTC</pubDate>
      <dc:title>Data Curation in Biology &amp;#8211; Past, Present and Future</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3225.1</dc:identifier>
      <dc:date>2009-05-12</dc:date>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-05-12T13:19:11Z</prism:publicationDate>
      <prism:category>Presentation</prism:category>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <prism:section>Molecular Cell Biology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3225/version/1/files/npre20093225-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Automatisation in UniProtKB / Swiss-Prot Annotation: New Rules and Tools</title>
      <link>http://dx.doi.org/10.1038/npre.2009.3215.1</link>
      <description>The development of next generation sequencing technologies promises a massive increase in the rate of submission of new protein sequences to sequence databases such as the Universal Protein Resource Knowledge Base, UniProtKB. At UniProtKB/Swiss-Prot we propose to meet this challenge by continuing to expand and develop systems for the automatic propagation of existing annotation to newly submitted protein sequences. These developments will promote the standardization of ortholog annotation both across and within kingdoms and significantly enhance our ability to accurately annotate new protein sequences which are being produced at an ever increasing rate.</description>
      <guid>http://dx.doi.org/10.1038/npre.2009.3215.1</guid>
      <pubDate>Fri, 08 May 2009 15:41:19 UTC</pubDate>
      <dc:title>Automatisation in UniProtKB / Swiss-Prot Annotation: New Rules and Tools</dc:title>
      <dc:identifier>doi:10.1038/npre.2009.3215.1</dc:identifier>
      <dc:date>2009-05-08</dc:date>
      <dc:creator>Alan Bridge</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-05-08T15:41:19Z</prism:publicationDate>
      <prism:category>Poster</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3215/version/1/files/npre20093215-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
  </channel>
</rss>
