Journal
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY
Volume 19, Issue 6, Pages -Publisher
WORLD SCIENTIFIC PUBL CO PTE LTD
DOI: 10.1142/S021972002140014X
Keywords
Rearrangements; double-cut-and-join; indels; orthology inference
Ask authors/readers for more resources
A new efficient ILP formulation was proposed to compute the rearrangement distance of genomes in a family-free setting, showing potential for inferring ortholog groups across multiple species. The method computes optimal gene matchings and integrates them into gene families, achieving promising results on simulated and real data.
Recently, we proposed an efficient ILP formulation [Rubert DP, Martinez FV, Braga MDV, Natural family-free genomic distance, Algorithms Mol Biol 16:4, 2021] for exactly computing the rearrangement distance of two genomes in a family-free setting. In such a setting, neither prior classification of genes into families, nor further restrictions on the genomes are imposed. Given two genomes, the mentioned ILP computes an optimal matching of the genes taking into account simultaneously local mutations, given by gene similarities, and large-scale genome rearrangements. Here, we explore the potential of using this ILP for inferring groups of orthologs across several species. More precisely, given a set of genomes, our method first computes all pairwise optimal gene matchings, which are then integrated into gene families in the second step. Our approach is implemented into a pipeline incorporating the pre-computation of gene similarities. It can be downloaded from gitlab.ub.uni-bielefeld.de/gi/FFGC. We obtained promising results with experiments on both simulated and real data.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available