The Gene Ontology Annotation (GOA) Database
Correspondence: (Login to view email address)
- EMBL-EBI
PDF (490.6 KB)
- Document Type:
- Poster
- Date:
- Received 23 April 2009 10:42 UTC; Posted 23 April 2009
- Subjects:
- Genetics & Genomics, Bioinformatics
- Abstract:
The Gene Ontology (GO) is a well-established, structured vocabulary that has been successfully used for 10 years in the annotation of proteins. GO terms, created in consultation with the biology community, are used to replace the multiple nomenclatures used by scientific databases that can hamper data integration. Currently GO consists of more than 26,500 terms distributed over three ontologies that describe the molecular function, biological process and subcellular location of a protein in a generic cell.
The Gene Ontology Annotation (GOA) database (http://www.ebi.ac.uk/GOA) aims to provide high-quality manual and electronic GO annotations to proteins within the UniProt Knowledgebase (UniProtKB). By annotating all ‘known’ proteins with GO terms and transferring this knowledge to highly similar ‘unknown’ proteins, GOA offers a valuable contribution to the understanding of all proteomes.
As well as generating manual annotation, made by extracting experimental evidence from full text peer-reviewed publications, GOA produces electronic annotation by making large-scale assignments of GO terms to proteins using computational methods. To date we have six electronic annotation methods including; InterPro2GO, Swiss-Prot Keyword2GO and the projection of annotations between orthologous species using Ensembl Compara.
GOA provides annotated entries for over 180,000 species and is the largest and most comprehensive open-source contributor of annotations to the GO Consortium annotation effort. In addition, by integrating GO annotations from model organism groups (e.g. FlyBase, GeneDB, MGI, RGD, SGD and TAIR), GOA ensures the dataset remains a key reference. GOA prioritises the annotation of the human proteome and provides this annotation to the GO Consortium’s Reference Genome project.
GOA produces monthly releases of annotations to the human, mouse, rat, zebrafish, cow, chicken and Arabidopsis proteomes as well as a file for the multiple species within UniProtKB. The GOA dataset can be queried through a user-friendly web interface via our QuickGO browser
(http://www.ebi.ac.uk/QuickGO) or downloaded in a parsable format via the EBI
(ftp://ftp.ebi.ac.uk/pub/databases/GO/goa) and GO FTP sites.The GOA dataset has increasingly been integrated into tools that aid in the analysis of large datasets resulting from high-throughput experiments thus assisting researchers in biological interpretation of their results.
- Collection:
- 3rd International Biocuration Conference
- Presented at:
- 3rd International Biocuration Conference, 16 April 2009
Discussion
- Votes:
-
0 votes
- Comments:
-
0 comments
- (Login to share with a colleague)
Additional information
- License:
- This document is licensed to the public under the Creative Commons Attribution 3.0 License
- How to cite this document:
-
Huntley, Rachael, Dimmer, Emily, Barrell, Daniel, Binns, David, and Apweiler, Rolf. The Gene Ontology Annotation (GOA) Database. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2009.3154.1> (2009)
- Version info:
-
Other versions of this document in Nature Precedings
None.
Other versions of this document elsewhere on the web
None known.