Searching the World-Wide-Web using nucleotide and peptide sequences
Correspondence: (Login to view email address)
- Department of Computer Science, Bioinformatics & Computational Biosciences Unit, 329A Saint Mary's Hall Georgetown University 37th and O Streets, NW Washington, DC 20057-1232
- Document Type:
- Manuscript
- Date:
- Received 08 November 2008 22:04 UTC; Posted 11 November 2008
- Subjects:
- Bioinformatics
- Abstract:
Background: No approaches have yet been developed to allow instant searching of the World-Wide-Web by just entering a string of sequence data. Though general search engines can be tuned to accept ‘processed’ queries, the burden of preparing such ‘search strings’ simply defeats the purpose of quickly locating highly relevant information. Unlike ‘sequence similarity’ searches that employ dedicated algorithms (like BLAST) to compare an input sequence from defined databases, a direct ‘sequence based’ search simply locates quick and relevant information about a blunt piece of nucleotide or peptide sequence. This approach is particularly invaluable to all biomedical researchers who would often like to enter a sequence and quickly locate any pertinent information before proceeding to carry out detailed sequence alignment.
Results: Here, we describe the theory and implementation of a web-based front-end for a search engine, like Google, which accepts sequence fragments and interactively retrieves a collection of highly relevant links and documents, in real-time. e.g. flat files like patent records, privately hosted sequence documents and regular databases.
Conclusions: The importance of this simple yet highly relevant tool will be evident when with a little bit of tweaking, the tool can be engineered to carry out searches on all kinds of hosted documents in the World-Wide-Web.
Availability: Instaseq is free web based service that can be accessed by visiting the following hyperlink on the WWW
http://instaseq.georgetown.edu
Discussion
- Votes:
-
2 votes
- Comments:
-
0 comments
- (Login to share with a colleague)
Additional information
- License:
- This document is licensed to the public under the Creative Commons Attribution 3.0 License
- How to cite this document:
-
Ganesan, Natarajan, Bennett, Nicholas, Kalyanasundaram, Bala, Velauthapillai, Mahe, and Squier, Richard. Searching the World-Wide-Web using nucleotide and peptide sequences. Available from Nature Precedings <http://hdl.handle.net/10101/npre.2008.2492.1> (2008)
- Version info:
-
Other versions of this document in Nature Precedings
None.
Other versions of this document elsewhere on the web
None known.