Skip Navigation

Journal of Computational Biology

Not a subscriber? Get started...

Significance of Gapped Sequence Alignments

To cite this article:
Lee A. Newberg. Journal of Computational Biology. November 2008, 15(9): 1187-1194. doi:10.1089/cmb.2008.0125.

Published in Volume: 15 Issue 9: October 30, 2008

Author information

Lee A. Newberg
Center for Bioinformatics, Wadsworth Center, New York State Department of Health, Albany, New York and Department of Computer Science, Rensselaer Polytechnic Institute, Troy, New York.

ABSTRACT

Measurement of the statistical significance of extreme sequence alignment scores is key to many important applications, but it is difficult. To precisely approximate alignment score significance, we draw random samples directly from a well chosen, importance-sampling probability distribution. We apply our technique to pairwise local sequence alignment of nucleic acid and amino acid sequences of length up to 1000. For instance, using a BLOSUM62 scoring system for local sequence alignment, we compute that the p-value of a score of 6000 for the alignment of two sequences of length 1000 is (3.4 ± 0.3) × 10−1314. Further, we show that the extreme value significance statistic for the local alignment model that we examine does not follow a Gumbel distribution. A web server for this application is available at http://bayesweb.wadsworth.org/alignmentSignificanceV1/.

Free first page

This paper was cited by:

Large-deviation properties of largest component for random graphs
A. K. Hartmann
The European Physical Journal B. May 2011
CrossRef
Pharmacophore alignment search tool: Influence of canonical atom labeling on similarity searching
Volker Hähnke, Matthias Rupp, Mireille Krier, Friedrich Rippmann, Gisbert Schneider
Journal of Computational Chemistry. Nov 2010, Vol. 31, No. 15: 2810-2826
CrossRef
Exact Calculation of Distributions on Integers, with Application to Sequence Alignment
Lee A. Newberg, Charles E. Lawrence
Journal of Computational Biology. Jan 2009, Vol. 16, No. 1: 1-18
Abstract | Full Text PDF | Supplementary Material | Reprints | Permissions

Supplementary Material

Users who read this article also read

full access
Lee A. Newberg, Charles E. Lawrence
Journal of Computational Biology. January 2009: 1-18.
Abstract | Full Text PDF | Supplementary Material | Reprints | Permissions
no access
Wei Peng, Jih-Hsuan Lin, Julia Crouse
Cyberpsychology, Behavior, and Social Networking. November 2011: 681-688.
Abstract | Full Text PDF or HTML | Reprints | Permissions
no access
Georges Steffgen, Andreas König, Jan Pfetsch, André Melzer
Cyberpsychology, Behavior, and Social Networking. November 2011: 643-648.
Abstract | Full Text PDF or HTML | Reprints | Permissions
full access  
Torsten Blum, Oliver Kohlbacher
Journal of Computational Biology. July/August 2008: 565-576.
Abstract | Full Text PDF | Reprints | Permissions
full access
C. Bidot, F. Gruy, C.-S. Haudin, F. El Hentati, B. Guy, C. Lambert
Journal of Computational Biology. January/February 2008: 105-128.
Abstract | Full Text PDF | Reprints | Permissions
full access
Brian Y. Chen, Viacheslav Y. Fofanov, Drew H. Bryant, Bradley D. Dodson, David M. Kristensen, Andreas M. Lisewski, Marek Kimmel, Olivier Lichtarge, Lydia E. Kavraki
Journal of Computational Biology. July 2007: 791-816.
Abstract | Full Text PDF | Reprints | Permissions

Sign up for TOC Alerts


Publication Tools

  • Related articles in Liebert Online

Search:

for

Author:

Keywords:

Go to Advanced Search