Accurate and efficient methods to improve multiple circular sequence alignment

Carl Barton; Costas S. Iliopoulos; Ritu Kundu; Solon P. Pissis; Ahmad Retha; Fatima Vayani

doi:10.1007/978-3-319-20086-6_19

Accurate and efficient methods to improve multiple circular sequence alignment

Carl Barton, Costas S. Iliopoulos, Ritu Kundu, Solon P. Pissis, Ahmad Retha, Fatima Vayani

Informatics

Research output: Chapter in Book/Report/Conference proceeding › Chapter › peer-review

16 Citations (Scopus)

Abstract

Multiple sequence alignment is a core computational task in bioinformatics and has been extensively studied over the past decades. This computation requires an implicit assumption on the input data: the left- and right-most position for each sequence is relevant. However, this is not the case for circular structures; for instance, MtDNA. Efforts have been made to address this issue but it is far from being solved. We have very recently introduced a fast algorithm for approximate circular string matching (Barton et al., Algo Mol Biol, 2014). Here, we first show how to extend this algorithm for approximate circular dictionary matching; and, then, apply this solution with agglomerative hierarchical clustering to find a sufficiently good rotation for each sequence. Furthermore, we propose an alternative method that is suitable for more divergent sequences. We implemented these methods in BEAR, a programme for improving multiple circular sequence alignment. Experimental results, using real and synthetic data, show the high accuracy and efficiency of these new methods in terms of the inferred likelihood-based phylogenies.

Original language	English
Title of host publication	Experimental Algorithms
Subtitle of host publication	14th International Symposium, SEA 2015, Paris, France, June 29 – July 1, 2015, Proceedings
Editors	E. Bampis
Publisher	Springer International Publishing Switzerland
Pages	247-258
Number of pages	12
ISBN (Electronic)	9783319200866
ISBN (Print)	9783319200859
DOIs	https://doi.org/10.1007/978-3-319-20086-6_19
Publication status	Published - 20 Jun 2015

Publication series

Name	Lecture Notes in Computer Science
Publisher	Springer International Publishing Switzerland
Volume	9125

Access to Document

10.1007/978-3-319-20086-6_19

Cite this

Barton, C., Iliopoulos, C. S., Kundu, R., Pissis, S. P., Retha, A., & Vayani, F. (2015). Accurate and efficient methods to improve multiple circular sequence alignment. In E. Bampis (Ed.), Experimental Algorithms: 14th International Symposium, SEA 2015, Paris, France, June 29 – July 1, 2015, Proceedings (pp. 247-258). (Lecture Notes in Computer Science; Vol. 9125). Springer International Publishing Switzerland. https://doi.org/10.1007/978-3-319-20086-6_19

Barton, Carl ; Iliopoulos, Costas S. ; Kundu, Ritu et al. / Accurate and efficient methods to improve multiple circular sequence alignment. Experimental Algorithms: 14th International Symposium, SEA 2015, Paris, France, June 29 – July 1, 2015, Proceedings. editor / E. Bampis. Springer International Publishing Switzerland, 2015. pp. 247-258 (Lecture Notes in Computer Science).

@inbook{6d239f383cde40428cfbdaff6e65ca4f,

title = "Accurate and efficient methods to improve multiple circular sequence alignment",

abstract = "Multiple sequence alignment is a core computational task in bioinformatics and has been extensively studied over the past decades. This computation requires an implicit assumption on the input data: the left- and right-most position for each sequence is relevant. However, this is not the case for circular structures; for instance, MtDNA. Efforts have been made to address this issue but it is far from being solved. We have very recently introduced a fast algorithm for approximate circular string matching (Barton et al., Algo Mol Biol, 2014). Here, we first show how to extend this algorithm for approximate circular dictionary matching; and, then, apply this solution with agglomerative hierarchical clustering to find a sufficiently good rotation for each sequence. Furthermore, we propose an alternative method that is suitable for more divergent sequences. We implemented these methods in BEAR, a programme for improving multiple circular sequence alignment. Experimental results, using real and synthetic data, show the high accuracy and efficiency of these new methods in terms of the inferred likelihood-based phylogenies.",

author = "Carl Barton and Iliopoulos, {Costas S.} and Ritu Kundu and Pissis, {Solon P.} and Ahmad Retha and Fatima Vayani",

year = "2015",

month = jun,

day = "20",

doi = "10.1007/978-3-319-20086-6_19",

language = "English",

isbn = "9783319200859",

series = "Lecture Notes in Computer Science",

publisher = "Springer International Publishing Switzerland",

pages = "247--258",

editor = "E. Bampis",

booktitle = "Experimental Algorithms",

}

Barton, C , Iliopoulos, CS , Kundu, R , Pissis, SP , Retha, A & Vayani, F 2015, Accurate and efficient methods to improve multiple circular sequence alignment. in E Bampis (ed.), Experimental Algorithms: 14th International Symposium, SEA 2015, Paris, France, June 29 – July 1, 2015, Proceedings. Lecture Notes in Computer Science, vol. 9125, Springer International Publishing Switzerland, pp. 247-258. https://doi.org/10.1007/978-3-319-20086-6_19

Accurate and efficient methods to improve multiple circular sequence alignment. / Barton, Carl ; Iliopoulos, Costas S.; Kundu, Ritu et al.
Experimental Algorithms: 14th International Symposium, SEA 2015, Paris, France, June 29 – July 1, 2015, Proceedings. ed. / E. Bampis. Springer International Publishing Switzerland, 2015. p. 247-258 (Lecture Notes in Computer Science; Vol. 9125).

Research output: Chapter in Book/Report/Conference proceeding › Chapter › peer-review

TY - CHAP

T1 - Accurate and efficient methods to improve multiple circular sequence alignment

AU - Barton, Carl

AU - Iliopoulos, Costas S.

AU - Kundu, Ritu

AU - Pissis, Solon P.

AU - Retha, Ahmad

AU - Vayani, Fatima

PY - 2015/6/20

Y1 - 2015/6/20

N2 - Multiple sequence alignment is a core computational task in bioinformatics and has been extensively studied over the past decades. This computation requires an implicit assumption on the input data: the left- and right-most position for each sequence is relevant. However, this is not the case for circular structures; for instance, MtDNA. Efforts have been made to address this issue but it is far from being solved. We have very recently introduced a fast algorithm for approximate circular string matching (Barton et al., Algo Mol Biol, 2014). Here, we first show how to extend this algorithm for approximate circular dictionary matching; and, then, apply this solution with agglomerative hierarchical clustering to find a sufficiently good rotation for each sequence. Furthermore, we propose an alternative method that is suitable for more divergent sequences. We implemented these methods in BEAR, a programme for improving multiple circular sequence alignment. Experimental results, using real and synthetic data, show the high accuracy and efficiency of these new methods in terms of the inferred likelihood-based phylogenies.

AB - Multiple sequence alignment is a core computational task in bioinformatics and has been extensively studied over the past decades. This computation requires an implicit assumption on the input data: the left- and right-most position for each sequence is relevant. However, this is not the case for circular structures; for instance, MtDNA. Efforts have been made to address this issue but it is far from being solved. We have very recently introduced a fast algorithm for approximate circular string matching (Barton et al., Algo Mol Biol, 2014). Here, we first show how to extend this algorithm for approximate circular dictionary matching; and, then, apply this solution with agglomerative hierarchical clustering to find a sufficiently good rotation for each sequence. Furthermore, we propose an alternative method that is suitable for more divergent sequences. We implemented these methods in BEAR, a programme for improving multiple circular sequence alignment. Experimental results, using real and synthetic data, show the high accuracy and efficiency of these new methods in terms of the inferred likelihood-based phylogenies.

U2 - 10.1007/978-3-319-20086-6_19

DO - 10.1007/978-3-319-20086-6_19

M3 - Chapter

SN - 9783319200859

T3 - Lecture Notes in Computer Science

SP - 247

EP - 258

BT - Experimental Algorithms

A2 - Bampis, E.

PB - Springer International Publishing Switzerland

ER -

Barton C , Iliopoulos CS , Kundu R , Pissis SP , Retha A , Vayani F. Accurate and efficient methods to improve multiple circular sequence alignment. In Bampis E, editor, Experimental Algorithms: 14th International Symposium, SEA 2015, Paris, France, June 29 – July 1, 2015, Proceedings. Springer International Publishing Switzerland. 2015. p. 247-258. (Lecture Notes in Computer Science). doi: 10.1007/978-3-319-20086-6_19

Accurate and efficient methods to improve multiple circular sequence alignment

Abstract

Publication series

Access to Document

Fingerprint

Cite this