hdl:10101/npre.2008.2492.1
2 votes

Searching the World-Wide-Web using nucleotide and peptide sequences

Natarajan Ganesan1, Nicholas F. Bennett1, Bala Kalyanasundaram1, Mahe Velauthapillai1 & Richard Squier1

Correspondence: (Login to view email address)

  1. Department of Computer Science, Bioinformatics & Computational Biosciences Unit, 329A Saint Mary's Hall Georgetown University 37th and O Streets, NW Washington, DC 20057-1232
Document Type:
Manuscript
Date:
Received 08 November 2008 22:04 UTC; Posted 11 November 2008
Subjects:
Bioinformatics
Tags:
Abstract:

Background: No approaches have yet been developed to allow instant searching of the World-Wide-Web by just entering a string of sequence data. Though general search engines can be tuned to accept ‘processed’ queries, the burden of preparing such ‘search strings’ simply defeats the purpose of quickly locating highly relevant information. Unlike ‘sequence similarity’ searches that employ dedicated algorithms (like BLAST) to compare an input sequence from defined databases, a direct ‘sequence based’ search simply locates quick and relevant information about a blunt piece of nucleotide or peptide sequence. This approach is particularly invaluable to all biomedical researchers who would often like to enter a sequence and quickly locate any pertinent information before proceeding to carry out detailed sequence alignment.

Results: Here, we describe the theory and implementation of a web-based front-end for a search engine, like Google, which accepts sequence fragments and interactively retrieves a collection of highly relevant links and documents, in real-time. e.g. flat files like patent records, privately hosted sequence documents and regular databases.

Conclusions: The importance of this simple yet highly relevant tool will be evident when with a little bit of tweaking, the tool can be engineered to carry out searches on all kinds of hosted documents in the World-Wide-Web.

Availability: Instaseq is free web based service that can be accessed by visiting the following hyperlink on the WWW
http://instaseq.georgetown.edu

Discussion

Votes:

2 votes

(Login to vote)

Comments:

0 comments

(Login to post a comment)

(Login to share with a colleague)

Additional information

License:
This document is licensed to the public under the Creative Commons Attribution 3.0 License
How to cite this document:

Ganesan, Natarajan, Bennett, Nicholas, Kalyanasundaram, Bala, Velauthapillai, Mahe, and Squier, Richard. Searching the World-Wide-Web using nucleotide and peptide sequences. Available from Nature Precedings <http://hdl.handle.net/10101/npre.2008.2492.1> (2008)

Version info:

Other versions of this document in Nature Precedings

None.

Other versions of this document elsewhere on the web

None known.

Participate

Related Documents

Advertisement