☆ 4.0 Article

Approximation algorithm for rearrangement distances considering repeated genes and intergenic regions

ALGORITHMS FOR MOLECULAR BIOLOGY (2021)

Journal

ALGORITHMS FOR MOLECULAR BIOLOGY

Volume 16, Issue 1, Pages -

Publisher

BMC

DOI: 10.1186/s13015-021-00200-w

Keywords

Genome rearrangement; Intergenic regions; Reversal

Funding

National Council of Technological and Scientific Development, CNPq [425340/2016-3]
Coordenacao de Aperfeiaoamento de Pessoal de Nivel Superior - Brasil (CAPES) [001]
Sao Paulo Research Foundation, FAPESP [2013/08293-7, 2015/11937-9, 2017/12646-3, 2019/27331-3]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The rearrangement distance is a method for comparing genomes of different species by determining the number of rearrangement events needed to transform one genome into another. This study explores the effects of transposition and reversal events on genome representation, considering gene repetition and intergenic regions. Practical experiments on simulated genomes show that using partitions can improve distance estimates.

The rearrangement distance is a method to compare genomes of different species. Such distance is the number of rearrangement events necessary to transform one genome into another. Two commonly studied events are the transposition, which exchanges two consecutive blocks of the genome, and the reversal, which reverts a block of the genome. When dealing with such problems, seminal works represented genomes as sequences of genes without repetition. More realistic models started to consider gene repetition or the presence of intergenic regions, sequences of nucleotides between genes and in the extremities of the genome. This work explores the transposition and reversal events applied in a genome representation considering both gene repetition and intergenic regions. We define two problems called Minimum Common Intergenic String Partition and Reverse Minimum Common Intergenic String Partition. Using a relation with these two problems, we show a Theta(k)-approximation for the Intergenic Transposition Distance, the Intergenic Reversal Distance, and the Intergenic Reversal and Transposition Distance problems, where k is the maximum number of copies of a gene in the genomes. Our practical experiments on simulated genomes show that the use of partitions improves the estimates for the distances.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.0

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Signed rearrangement distances considering repeated genes, intergenic regions, and indels

Gabriel Siqueira, Alexsandro Oliveira Alexandrino, Zanoni Dias

Summary: Genome rearrangement distance problems estimate the evolutionary distance between genomes. This study introduces a new model considering intergenic regions and multiple copies of genes. It proposes a series of problems and approximation algorithms, and demonstrates their effectiveness through experimental tests.

JOURNAL OF COMBINATORIAL OPTIMIZATION (2023)