4.2 Article Proceedings Paper

The potential of family-free rearrangements towards gene orthology inference

Journal

Publisher

WORLD SCIENTIFIC PUBL CO PTE LTD
DOI: 10.1142/S021972002140014X

Keywords

Rearrangements; double-cut-and-join; indels; orthology inference

Ask authors/readers for more resources

A new efficient ILP formulation was proposed to compute the rearrangement distance of genomes in a family-free setting, showing potential for inferring ortholog groups across multiple species. The method computes optimal gene matchings and integrates them into gene families, achieving promising results on simulated and real data.
Recently, we proposed an efficient ILP formulation [Rubert DP, Martinez FV, Braga MDV, Natural family-free genomic distance, Algorithms Mol Biol 16:4, 2021] for exactly computing the rearrangement distance of two genomes in a family-free setting. In such a setting, neither prior classification of genes into families, nor further restrictions on the genomes are imposed. Given two genomes, the mentioned ILP computes an optimal matching of the genes taking into account simultaneously local mutations, given by gene similarities, and large-scale genome rearrangements. Here, we explore the potential of using this ILP for inferring groups of orthologs across several species. More precisely, given a set of genomes, our method first computes all pairwise optimal gene matchings, which are then integrated into gene families in the second step. Our approach is implemented into a pipeline incorporating the pre-computation of gene similarities. It can be downloaded from gitlab.ub.uni-bielefeld.de/gi/FFGC. We obtained promising results with experiments on both simulated and real data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available