Generic placeholder image

Protein & Peptide Letters

Editor-in-Chief

ISSN (Print): 0929-8665
ISSN (Online): 1875-5305

Loose and Strict Repeats in Weighted Sequences of Proteins

Author(s): Hui Zhang, Qing Guo, Jing Fan and Costas S. Iliopoulos

Volume 17, Issue 9, 2010

Page: [1136 - 1142] Pages: 7

DOI: 10.2174/092986610791760324

Price: $65

Abstract

A weighted sequence is a string in which a set of characters may appear at each position with respective probabilities of occurrence. Weighted sequences are able to summarize poorly defined short sequences, as well as the profiles of protein families and complete chromosome sequences. Thus it is of biological and theoretical significance to design powerful algorithms on weighted sequences. A common task is to identify repetitive motifs in weighted sequences, with presence probability not less than a given threshold. We define two types of repeats in weighted sequences, called the loose repeats and the strict repeats, respectively, and then attempt to locate these repeats. Using an iterative partitioning technique, we present algorithms for computing all the loose repeats and strict repeats of every length, respectively. Each solution costs O(n2)time.

Keywords: Border check array, loose repeat, partitioning, strict repeat, structural overlap, weighted sequence


Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy