Skip Navigation

Journal of Computational Biology

Not a subscriber? Get started...

Exact Calculation of Distributions on Integers, with Application to Sequence Alignment

To cite this article:
Lee A. Newberg and Charles E. Lawrence and. Journal of Computational Biology. January 2009, 16(1): 1-18. doi:10.1089/cmb.2008.0137.

Published in Volume: 16 Issue 1: January 4, 2009

Author information

Lee A. Newberg
Center for Bioinformatics, Wadsworth Center, New York State Department of Health, Albany, New York.
Department of Computer Science, Rensselaer Polytechnic Institute, Troy, New York.
Charles E. Lawrence
Center for Computational Molecular Biology, Brown University, Providence, Rhode Island.

ABSTRACT

Computational biology is replete with high-dimensional discrete prediction and inference problems. Dynamic programming recursions can be applied to several of the most important of these, including sequence alignment, RNA secondary-structure prediction, phylogenetic inference, and motif finding. In these problems, attention is frequently focused on some scalar quantity of interest, a score, such as an alignment score or the free energy of an RNA secondary structure. In many cases, score is naturally defined on integers, such as a count of the number of pairing differences between two sequence alignments, or else an integer score has been adopted for computational reasons, such as in the test of significance of motif scores. The probability distribution of the score under an appropriate probabilistic model is of interest, such as in tests of significance of motif scores, or in calculation of Bayesian confidence limits around an alignment. Here we present three algorithms for calculating the exact distribution of a score of this type; then, in the context of pairwise local sequence alignments, we apply the approach so as to find the alignment score distribution and Bayesian confidence limits.

Free first page

This paper was cited by:

A Classification of Bioinformatics Algorithms from the Viewpoint of Maximizing Expected Accuracy (MEA)
Michiaki Hamada, Kiyoshi Asai
Journal of Computational Biology. 2012 Online Ahead of Print
Abstract | Full Text PDF or HTML | Reprints | Permissions
Normalized global alignment for protein sequences
Guillermo Peris, Andrés Marzal
Journal of Theoretical Biology. Sep 2011
CrossRef
Construction of co-complex score matrix for protein complex prediction from AP-MS data
Z. Xie, C. K. Kwoh, X.-L. Li, M. Wu
Bioinformatics. Jul 2011, Vol. 27, No. 13: i159-i166
CrossRef
Improving the accuracy of predicting secondary structure for aligned RNA sequences
M. Hamada, K. Sato, K. Asai
Nucleic Acids Research. Jan 2011, Vol. 39, No. 2: 393-402
CrossRef
A computational procedure for functional characterization of potential marker genes from molecular data: Alzheimer's as a case study
Margherita Squillario, Annalisa Barla
BMC Medical Genomics. Jan 2011, Vol. 4, No. 1: 55
CrossRef
Fewer permutations, more accurate P-values
T. A. Knijnenburg, L. F. A. Wessels, M. J. T. Reinders, I. Shmulevich
Bioinformatics. Jun 2009, Vol. 25, No. 12: i161-i168
CrossRef

Supplementary Material

Users who read this article also read

no access
Seung-A Annie Jin
CyberPsychology & Behavior. December 2009: 761-765.
Abstract | Full Text PDF | Reprints | Permissions
full access
Kilian Bartholomé, Clemens Kreutz, Jens Timmer
Journal of Computational Biology. July 2009: 959-967.
Abstract | Full Text PDF | Reprints | Permissions
full access  
Lee A. Newberg
Journal of Computational Biology. November 2008: 1187-1194.
Abstract | Full Text PDF | Supplementary Material | Reprints | Permissions
full access
Marília D.V. Braga, Jens Stoye
Journal of Computational Biology. September 2010: 1145-1165.
Abstract | Full Text PDF or HTML | Reprints | Permissions
full access
Andre S. Ribeiro, Olli-Pekka Smolander, Tiina Rajala, Antti Häkkinen, Olli Yli-Harja
Journal of Computational Biology. April 2009: 539-553.
Abstract | Full Text PDF | Reprints | Permissions
full access
Arvind Gupta, Mohammad M. Karimi, Ján Maňuch, Ladislav Stacho, Xiaohong Zhao
Journal of Computational Biology. October 2010: 1435-1449.
Abstract | Full Text PDF or HTML | Reprints | Permissions

Sign up for TOC Alerts


Publication Tools

  • Related articles in Liebert Online

Search:

for

Authors:

Keywords:

Go to Advanced Search