doi:10.1038/npre.2009.3222.1
1 vote

Present and future of proteomics data curation at the PRIDE database

Juan Antonio Vizcaino1, Henning Hermjakob1 & Lennart Martens1

Correspondence: (Login to view email address)

  1. Proteomics Services Team, EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
Document Type:
Poster
Date:
Received 05 May 2009 17:45 UTC; Posted 06 May 2009
Subjects:
Biotechnology, Bioinformatics
Tags:
Abstract:

Significant progress has been made in improving the accessibility and utility of the large amounts of generated high-throughput proteomics data by the introduction of publicly available proteomics repositories. One such repository is PRIDE (the ‘PRoteomics IDEntifications’ database, http://www.ebi.ac.uk/pride). PRIDE stores mass spectrometry related data, including peptide and protein identifications, mass spectra and valuable additional metadata.

At present, data curation in PRIDE is limited to data submission support. The format in which all submissions need to take place is PRIDE XML. Mass spectrometry derived data is very heterogeneous in terms of experimental approaches, instrumentation, data formats, etc. This is why conversion of all this different data to PRIDE XML is far from being trivial and can be very time consuming, since tailored submission pipelines must be often constructed. However, the situation has now ameliorated since some new tools like PRIDE converter (http://code.google.com/p/pride-converter). are now available for submitters to convert their data to PRIDE XML.

In the near future, data curation in PRIDE will be significantly extended. High-quality data will be included in a new repository called PRIDE-plus. First of all, it will be necessary to create a set of minimal requirement rules to decide which datasets can be included in PRIDE-plus. Then, the design and implementation of new curation tools to perform data quality assessment will be essential. It will also be necessary to do research into the automation of these new curation and annotation tasks.

Collection:
3rd International Biocuration Conference
Presented at:
3rd International Biocuration Conference, 16 April 2009

Discussion

Votes:

1 vote

(Login to vote)

Comments:

0 comments

(Login to post a comment)

(Login to share with a colleague)

Additional information

License:
This document is licensed to the public under the Creative Commons Attribution 3.0 License
How to cite this document:

Vizcaino, Juan Antonio, Hermjakob, Henning, and Martens, Lennart. Present and future of proteomics data curation at the PRIDE database. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2009.3222.1> (2009)

Version info:

Other versions of this document in Nature Precedings

None.

Other versions of this document elsewhere on the web

None known.

Participate

Related Documents

Advertisement