doi:10.1038/npre.2009.3900.1
1 vote

BRAGOMAP – a new Perl script for high throughoutput blast results analysis including GO and MapMan automatic annotations

Rafal Woycicki1, Wojciech Gutman2 & Zbigniew Przybecki1

Correspondence: (Login to view email address)

  1. Dept of Plant Genetics, Breeding and Biotechnology, Faculty of Horticulture and Landscape Architecture, Warsaw University of Life Sciences SGGW, Warsaw, Poland
  2. Dept of Biochemistry, Faculty of Agronomy and Biology, Warsaw University of Life Sciences - SGGW, Poland
Document Type:
Poster
Date:
Received 25 October 2009 19:49 UTC; Posted 26 October 2009
Subjects:
Genetics & Genomics, Bioinformatics, Plant Biology
Tags:
Abstract:

Analyzing of sequences similarities is the first and most important method used to find out the function of unknown nucleotides. Searching of homologs should be done carefully not to loose any important ones. Having thousands of results from various long-read sequencing projects (ie. differentially expressed tags, genomic polymorphons or BAC ends), the by-hand ability to retrieve interesting (to our goal) similarities in hundreds of Blast results decreases rapidly. Decreasing the number of retrieved sequences by giving more stringency in e-value threshold or displaying less results could lead to false deductions. Functional genomics, proteomics and metabolomics could give us answers to the role of nucleotide sequences. It makes the need to annotate as much of the homologies as we can, to proper molecular function, biological process and cellular component (as its proposed by widely accepted Gene Ontology Consortium annotations or MapMan mappings by Max-Planc-Institute).

To facilitate fast retrieval of interesting Blast homologies and making right deductions about the biological role of sequences, in big sequencing projects, the new Perl script BRAGOMAP was written. The program make use of some of BioPerl modules as well as the power of regex text-mining in the Perl itself.

The script gives us the possibility to find interesting sequence similarities by using keywords and giving points for each one found. It collects all important information from the GenBank data and puts it in different columns of tab-delimited file for further use. If we were interested (for example) in flower differentiation genes we could use the keywords (flower, ovule, anther, etc.) and/or filter all the homologies isolated from flower tissues in a special development stage. We can also filter results by choosing similarities to interesting genes or protein products. This script retrieve also all standard information from the Blast and GenBank files as Description, ACC no., E-value, Similarity positions, Query Length, Percent of Similarity etc. Automatic GO and MapMan annotations are done by looking for genes, protein products and /or DB references in the proper mappings files. Here we present the usefulness of the script in analyzing sequence similarities and annotations mapping of 3855 BAC ends obtained from the HindIII BAC genomic library of cucumber (Cucumis sativus L., line B10).

Presented at:
EMBnet CONFERENCE 2008 20th Anniversary Celebration - Leading applications and technologies, 18 September 2008

Discussion

Votes:

1 vote

(Login to vote)

Comments:

0 comments

(Login to post a comment)

(Login to share with a colleague)

Additional information

License:
This document is licensed to the public under the Creative Commons Attribution 3.0 License
How to cite this document:

Woycicki, Rafal, Gutman, Wojciech, and Przybecki, Zbigniew. BRAGOMAP – a new Perl script for high throughoutput blast results analysis including GO and MapMan automatic annotations. Available from Nature Precedings <http://dx.doi.org/10.1038/npre.2009.3900.1> (2009)

Version info:

Other versions of this document in Nature Precedings

None.

Other versions of this document elsewhere on the web

None known.

Participate

Related Documents

Advertisement