doi:10.1038/npre.2009.3165.1
0 votes

Immunogenetic sequence annotation based on IMGT-ONTOLOGY

Joumana Jabado-Michaloud1, Géraldine Folch1, Fatena Bellahcene1, François Ehrenmann1, Patrice Duroux1, Véronique Giudicelli1 & Marie-Paule Lefranc1

Correspondence: (Login to view email address)

  1. IMGT Information System
Document Type:
Poster
Date:
Received 24 April 2009 09:18 UTC; Posted 24 April 2009
Subjects:
Genetics & Genomics, Immunology, Bioinformatics
Tags:
Abstract:

IMGT/LIGM-DB1 is the first and the largest IMGT® database2 in which are managed, analysed and annotated more than 136,000 immunoglobulin (IG) and T cell receptor (TR) nucleotide sequences from human and 235 other vertebrate species (April 2009). The expert annotation of these sequences and the added standardized knowledge are based on IMGT-ONTOLOGY, the first ontology developed in the field of immunogenetics and immunoinformatics.3 The annotation of immunogenetic sequences requires important expertise, owing to the unusual structure (non-classical exon/intron structure) of the IG and TR genes and characteristic chain synthesis owing to DNA V-J and V-D-J rearrangements. The way to annotate these sequences depends on the molecular type (gDNA, mRNA, cDNA or protein) and the configuration type (germline or rearranged), and if sequences from the concerned species are present or not in the IMGT reference directory sets. IMGT/V-QUEST5 and internal tools (IMGT/Automat, IMGT/LIGMotif, IMGT/BLAST and IMGT/DomainGapAlign) were developed. The first step in annotation allows to identify the chain type (for instance IG-Heavy) and to assign standardized keywords (IDENTIFICATION axiom). The second step is the classification of IG and TR genes and alleles (CLASSIFICATION axiom). The third step is the description (DESCRIPTION axiom) of the V, D, J and C genes and alleles with specific standardized labels. There are more than 590 IMGT standardized labels from which 64 have been entered in Sequence Ontology (SO). The delimitation of the FR-IMGT and CDR-IMGT lengths and the positions of conserved amino acids based on the IMGT unique numbering (NUMEROTATION axiom) allow to bridge the gap between sequences and 3D structures.6 The complete annotation of immunogenetic germline (V, D, J) and C sequences is followed by the update of the IMGT Repertoire (IMGT Gene tables, Alignments of alleles, Protein displays, Colliers de Perles, etc.), IMGT® gene database (IMGT/GENE-DB) and IMGT reference directory sets of the IMGT® tools (IMGT/V-QUEST, IMGT/JunctionAnalysis and IMGT/DomainGapAlign).

Collection:
3rd International Biocuration Conference
Presented at:
3rd International Biocuration Conference, 16 April 2009

Discussion

Votes:

0 votes

(Login to vote)

Comments:

0 comments

(Login to post a comment)

(Login to share with a colleague)

Additional information

License:
This document is licensed to the public under the Creative Commons Attribution 3.0 License
How to cite this document:

Jabado-Michaloud, Joumana , Folch, Géraldine, Bellahcene, Fatena, Ehrenmann, François, Duroux, Patrice, Giudicelli, Véronique, and Lefranc, Marie-Paule. Immunogenetic sequence annotation based on IMGT-ONTOLOGY. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2009.3165.1> (2009)

Version info:

Other versions of this document in Nature Precedings

None.

Other versions of this document elsewhere on the web

None known.

Participate

Related Documents

Advertisement