IUPACpal: efficient identification of inverted repeats in IUPAC-encoded DNA sequences

Alamro, Hayam; Alzamel, Mai; Iliopoulos, Costas; Pissis, Solon; Watts, Steven

doi:10.1186/s12859-021-03983-2

H. Alamro (Hayam), M. Alzamel (Mai), C.S. Iliopoulos (Costas), S. Pissis (Solon) and S. Watts (Steven)

2021-02-06

IUPACpal: efficient identification of inverted repeats in IUPAC-encoded DNA sequences

Background: An inverted repeat is a DNA sequence followed downstream by its reverse complement, potentially with a gap in the centre. Inverted repeats are found in both prokaryotic and eukaryotic genomes and they have been linked with countless possible functions. Many international consortia provide a comprehensive description of common genetic variation making alternative sequence representations, such as IUPAC encoding, necessary for leveraging the full potential of such broad variation datasets.

Results: We present IUPACpal, an exact tool for efficient identification of inverted repeats in IUPAC-encoded DNA sequences allowing also for potential mismatches and gaps in the inverted repeats.

Conclusion: Within the parameters that were tested, our experimental results show that IUPACpal compares favourably to a similar application packaged with EMBOSS. We show that IUPACpal identifies many previously unidentified inverted repeats when compared with EMBOSS, and that this is also performed with orders of magnitude improved speed.

Additional Metadata
Keywords	Inverted repeat, Palindrome, Gaps, Mismatches, Software, IUPAC
Persistent URL	doi.org/10.1186/s12859-021-03983-2
Journal	BMC Bioinformatics
Project	Pan-genome Graph Algorithms and Data Integration
Grant	This work was funded by the European Commission 7th Framework Programme; grant id h2020/872539 - Pan-genome Graph Algorithms and Data Integration (PANGAIA)
Organisation	Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Alamro, H., Alzamel, M., Iliopoulos, C., Pissis, S., & Watts, S. (2021). IUPACpal: efficient identification of inverted repeats in IUPAC-encoded DNA sequences. BMC Bioinformatics, 22. doi:10.1186/s12859-021-03983-2

View at Publisher

Free Full Text ( Final Version , 2mb )

See Also
software IUPACpal S. Watts (Steven)

IUPACpal: efficient identification of inverted repeats in IUPAC-encoded DNA sequences

Publication

Publication

software
IUPACpal

Address

CWI researchers

Questions or comments?

IUPACpal: efficient identification of inverted repeats in IUPAC-encoded DNA sequences

Publication

Publication

software IUPACpal

Workflow

Workflow

Add Content

software
IUPACpal