<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/" version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
  <channel>
    <title>Nature Precedings - Roderic Page</title>
    <link>http://precedings.nature.com/users/b16ff4bc0fc20d123b346d86f531bda2/</link>
    <description>Documents posted by Roderic Page</description>
    <dc:publisher>Nature Publishing Group</dc:publisher>
    <dc:language>en</dc:language>
    <prism:publicationName>Nature Precedings</prism:publicationName>
    <image>
      <title>Nature Precedings</title>
      <url>http://precedings.nature.com/images/header_logo.gif</url>
      <link>http://precedings.nature.com</link>
    </image>
    <atom:link type="application/rss+xml" rel="self" href="http://precedings.nature.com/users/b16ff4bc0fc20d123b346d86f531bda2/feed"/>
    <item>
      <title>Enhanced display of scienti&#64257;c articles using extended metadata</title>
      <link>http://precedings.nature.com/documents/3173/version/1</link>
      <description>Although the Web has transformed science publishing, scientific papers themselves are still essentially &amp;#8220;black boxes&amp;#8221;, with much of their content intended for human readers only.  Typically, computer-readable metadata associated with an article is limited to bibliographic details. By expanding article  metadata to include taxonomic names, identifiers for cited material (e.g., publications, sequences, specimens, and other data), and geographical  coordinates,  publishers could greatly  increase the scientific value of their digital content. At the same time this will provide  novel ways for users to discover and navigate through this content, beyond the relatively limited linkage provided by bibliographic citation.As a proof of concept, my entry in the Elsevier Grand Challenge extracted extended metadata from a set of articles from the journal Molecular Phylogeny and Evolution and used it to populate a  entity-attribute-value database. A simple web interface to this database enables an enhanced display of the content of an article, including a map of localities mentioned either explicitly or implicitly (through links to geotagged data), taxonomic coverage, and both data and citation links. Metadata extraction was limited to information listed in tables in the articles (such as GenBank sequences and specimen codes), the body of the article wasn&amp;#8217;t used. This restriction was deliberate, in order to demonstrate that making extended metadata available doesn&amp;#8217;t require a journal&amp;#8217;s publisher to make the full-text freely available (although this is desirable for other reasons). </description>
      <guid>http://precedings.nature.com/documents/3173/version/1</guid>
      <pubDate>Fri, 24 Apr 2009 17:31:02 UTC</pubDate>
      <dc:title>Enhanced display of scienti&#64257;c articles using extended metadata</dc:title>
      <dc:identifier>hdl:10101/npre.2009.3173.1</dc:identifier>
      <dc:date>2009-04-24</dc:date>
      <dc:creator>Roderic D. M. Page</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-04-24T17:31:02Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Ecology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <prism:section>Evolutionary Biology</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3173/version/1/files/npre20093173-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>bioGUID: resolving, discovering, and minting identifiers for biodiversity informatics</title>
      <link>http://precedings.nature.com/documents/3079/version/1</link>
      <description>Linking together the data of interest to biodiversity researchers (including specimen records, images, taxonomic names, and DNA sequences) requires services that can mint, resolve, and discover globally unique identifiers (including, but not limited to, DOIs, HTTP URIs, and LSIDs). bioGUID implements a range of services, the core ones being an OpenURL resolver for bibliographic resources, and a LSID resolver. The LSID resolver supports Linked Data-friendly resolution using  HTTP 303 redirects and content negotiation. Additional services include journal ISSN look-up, author name matching, and a tool to monitor the status of biodiversity data providers. bioGUID is available at http://bioguid.info/. Source code is available from Google Code.</description>
      <guid>http://precedings.nature.com/documents/3079/version/1</guid>
      <pubDate>Fri, 17 Apr 2009 21:01:04 UTC</pubDate>
      <dc:title>bioGUID: resolving, discovering, and minting identifiers for biodiversity informatics</dc:title>
      <dc:identifier>hdl:10101/npre.2009.3079.1</dc:identifier>
      <dc:date>2009-04-17</dc:date>
      <dc:creator>Roderic Page</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-04-17T21:01:04Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3079/version/1/files/npre20093079-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>On The Dangers Of Aligning RNA Sequences Using &#8220;Conserved&#8221; Motifs </title>
      <link>http://dx.doi.org/10.1038/npre.2007.1029.1</link>
      <description>Aligning RNA sequences can be a challenging task. Automatic sequence alignment programs typically align sequences only with respect to primary sequence, and as a result may yield spurious alignments. Incorporating information on RNA secondary structure can improve the alignment, but this must usually be done by hand. One approach to aligning RNA sequences uses &amp;#8220;conserved motifs&amp;#8221;, however relying on these motifs may lead to gross errors of alignment if, in fact, those motifs are not conserved.</description>
      <guid>http://dx.doi.org/10.1038/npre.2007.1029.1</guid>
      <pubDate>Tue, 18 Sep 2007 10:48:38 UTC</pubDate>
      <dc:title>On The Dangers Of Aligning RNA Sequences Using &#8220;Conserved&#8221; Motifs </dc:title>
      <dc:identifier>doi:10.1038/npre.2007.1029.1</dc:identifier>
      <dc:date>2009-03-04</dc:date>
      <dc:creator>Roderic Page</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2007-09-18T10:48:38Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <prism:section>Evolutionary Biology</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/1029/version/1/files/npre20071029-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/2.5/</creativeCommons:license>
    </item>
    <item>
      <title>Treemap Versus BPA (Again): A Response to Dowling</title>
      <link>http://dx.doi.org/10.1038/npre.2007.1030.1</link>
      <description>TreeMap is a computer program for analysing host-parasite cospeciation. We respond to Dowling&#8217;s (Cladistics, 18: 416-435) recent comparison of TreeMap and Brooks Parsimony Analysis (BPA) by showing that Dowling&#8217;s comparison suffers from several mistakes and flaws. We discuss the problems with both BPA and TreeMap, and show that BPA incorrectly counts the true number coevolutionary events more often than TreeMap 1. We also discuss the two main limitations of TreeMap 1 correctly identified by Dowling, namely its inability to handle widespread parasites, and its coarse optimality criterion (the number of cospeciation events). We suggest a simple fix for widespread parasites. The newly released TreeMap 2 uses a more sensitive optimality criterion than TreeMap 1, addressing Dowling&#8217;s second concern.</description>
      <guid>http://dx.doi.org/10.1038/npre.2007.1030.1</guid>
      <pubDate>Tue, 18 Sep 2007 10:49:19 UTC</pubDate>
      <dc:title>Treemap Versus BPA (Again): A Response to Dowling</dc:title>
      <dc:identifier>doi:10.1038/npre.2007.1030.1</dc:identifier>
      <dc:date>2009-03-04</dc:date>
      <dc:creator>Roderic Page</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2007-09-18T10:49:19Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <prism:section>Evolutionary Biology</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/1030/version/1/files/npre20071030-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/2.5/</creativeCommons:license>
    </item>
    <item>
      <title>Towards a Taxonomically Intelligent Phylogenetic Database</title>
      <link>http://dx.doi.org/10.1038/npre.2007.1028.1</link>
      <description>This note outlines some of the key intellectual obstacles that stand in the way of creating a usable phylogenetic database. These challenges include the need to accommodate multiple taxonomic names and classifications, and the need for tools to query trees in biologically meaningful ways. Until these problems are addressed, and a taxonomically intelligent phylogenetic database created, much of our phylogenetic knowledge will languish in the pages of journals.</description>
      <guid>http://dx.doi.org/10.1038/npre.2007.1028.1</guid>
      <pubDate>Tue, 18 Sep 2007 11:42:40 UTC</pubDate>
      <dc:title>Towards a Taxonomically Intelligent Phylogenetic Database</dc:title>
      <dc:identifier>doi:10.1038/npre.2007.1028.1</dc:identifier>
      <dc:date>2009-03-04</dc:date>
      <dc:creator>Roderic Page</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2007-09-18T11:42:40Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <prism:section>Evolutionary Biology</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/1028/version/1/files/npre20071028-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/2.5/</creativeCommons:license>
    </item>
    <item>
      <title>Visualising a scienti&#64257;c article</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2579.1</link>
      <description>This paper describes my entry in the Elsevier Grand Challenge &amp;#8220;Knowledge Enhancement in the Life Sciences&amp;#8221; contest.  The entry takes a collection of fulltext issues of Molecular Phylogenetics and Evolution as the starting point, then extracts citation links to both papers and data, such as Genbank sequences and specimens, together with geotagged localities, and builds a &amp;#8220;web&amp;#8221; of objects linked by typed relationships. Each object (such as a publication, a sequence, a specimen, a taxon name, etc.) is treated equally, so that you can take a publication and see what taxa it refers to, or take the taxon and find all the publications that refer to the taxon. Although the database has been seeded with some articles from Molecular Phylogenetics and Evolution, much of the data comes from GenBank, PubMed, and specimen databases. These are accessed through bioGUID, a tool I constructed to resolve globally unique identifiers and return associated metadata.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2579.1</guid>
      <pubDate>Fri, 28 Nov 2008 16:52:15 UTC</pubDate>
      <dc:title>Visualising a scienti&#64257;c article</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2579.1</dc:identifier>
      <dc:date>2008-11-28</dc:date>
      <dc:creator>Roderic Page</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-11-28T16:52:15Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Ecology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2579/version/1/files/npre20082579-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Towards realising Darwin&#8217;s dream: setting the trees free</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2217.1</link>
      <description>The fact that all living organisms are related by common descent is one of the central principles of modern biology.  Since the early 1990&amp;#8217;s the amount of data available to evolutionary biologists has exploded, and Elsevier&#8217;s journal Molecular Phylogenetics and Evolution, has become the largest single publisher of evolutionary trees (phylogenies). These trees and their supporting data potentially form a tremendous resource for biologists, with applications in genomics, evolutionary biology, biodiversity, and public health. However, most published trees are not available in any public database, but instead languish as images, &amp;#8220;locked up&amp;#8221; in the pages of journals. A long term solution to this problem is to invert the relationship between journal and database, such that the database is the primary repository, and the journal article becomes effectively a &amp;#8220;report&amp;#8221; on that data, albeit a report that is citable, and thus has the same status as a scientific article. This vision is some way off being achieved. However, publishers could greatly enhance the scientific value of their digital content by expanding article metadata to include taxonomic names, shared digital identifiers, geographical coordinates, and data structures such as evolutionary trees. This metadata doesn&#8217;t require making the full text available, but could substantially improve the findability of that text.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2217.1</guid>
      <pubDate>Fri, 22 Aug 2008 15:07:45 UTC</pubDate>
      <dc:title>Towards realising Darwin&#8217;s dream: setting the trees free</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2217.1</dc:identifier>
      <dc:date>2008-08-22</dc:date>
      <dc:creator>Roderic Page</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-22T15:07:45Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Ecology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2217/version/1/files/npre20082217-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Biodiversity informatics: the challenge of linking data and the role of shared identifiers</title>
      <link>http://precedings.nature.com/documents/1760/version/1</link>
      <description>A major challenge facing biodiversity informatics is integrating data stored in widely distributed databases. Initial efforts have relied on taxonomic names as the shared identifier linking records in different databases. However, taxonomic names have limitations as identifiers, being neither stable nor globally unique, and the pace of molecular taxonomic and phylogenetic research means that a lot of information in public sequence databases is not linked to formal taxonomic names. This review explores the use of other identifiers, such as specimen codes and GenBank accession numbers, to link otherwise disconnected facts in different databases. The structure of these links can also be exploited using the PageRank algorithm to rank the results of searches on biodiversity databases. The key to rich integration is a commitment to deploy and reuse globally unique, shared identifiers (such as DOIs and LSIDs), and the implementation of services that link those identifiers.</description>
      <guid>http://precedings.nature.com/documents/1760/version/1</guid>
      <pubDate>Thu, 03 Apr 2008 20:10:36 UTC</pubDate>
      <dc:title>Biodiversity informatics: the challenge of linking data and the role of shared identifiers</dc:title>
      <dc:identifier>hdl:10101/npre.2008.1760.1</dc:identifier>
      <dc:date>2008-04-03</dc:date>
      <dc:creator>Roderic Page</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-04-03T20:10:36Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Ecology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/1760/version/1/files/npre20081760-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
  </channel>
</rss>
