4.6 Article

Size-Constrained Regression Test Case Selection Using Multicriteria Optimization

期刊

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING
卷 38, 期 4, 页码 936-956

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TSE.2011.56

关键词

Software regression testing; test case selection; integer programming; Pareto optimality

向作者/读者索取更多资源

To ensure that a modified software system has not regressed, one approach is to rerun existing test cases. However, this is a potentially costly task. To mitigate the costs, the testing effort can be optimized by executing only a selected subset of the test cases that are believed to have a better chance of revealing faults. This paper proposes a novel approach for selecting and ordering a predetermined number of test cases from an existing test suite. Our approach forms an Integer Linear Programming problem using two different coverage-based criteria, and uses constraint relaxation to find many close-to-optimal solution points. These points are then combined to obtain a final solution using a voting mechanism. The selected subset of test cases is then prioritized using a greedy algorithm that maximizes minimum coverage in an iterative manner. The proposed approach has been empirically evaluated and the results show significant improvements over existing approaches for some cases and comparable results for the rest. Moreover, our approach provides more consistency compared to existing approaches.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemistry & Molecular Biology

Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification

Kristine Bohmann, Siavash Mirarab, Vineet Bafna, M. Thomas P. Gilbert

MOLECULAR ECOLOGY (2020)

Article Biochemistry & Molecular Biology

Log Transformation Improves Dating of Phylogenies

Uyen Mai, Siavash Mirarab

Summary: This paper introduces a new dating method wLogDate, which formulates dating as a nonconvex optimization problem, minimizing the variance of log-transformed rate multipliers across the tree. The method is shown to be more accurate and robust to various model assumptions than alternatives on simulated and real data.

MOLECULAR BIOLOGY AND EVOLUTION (2021)

Correction Biochemistry & Molecular Biology

ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy (vol 37, pg 3292, 2020)

Chao Zhang, Celine Scornavacca, Erin K. Molloy, Siavash Mirarab

MOLECULAR BIOLOGY AND EVOLUTION (2021)

Article Ecology

TAPER: Pinpointing errors in multiple sequence alignments despite varying rates of evolution

Chao Zhang, Yiming Zhao, Edward L. Braun, Siavash Mirarab

Summary: The text discusses the issue of erroneous data in sequence datasets and the need for automatic error detection methods as datasets grow larger. It introduces the TAPER method, which detects errors in species-specific stretches of sequence alignments to improve accuracy in downstream analyses.

METHODS IN ECOLOGY AND EVOLUTION (2021)

Article Biochemistry & Molecular Biology

Fast and accurate distance-based phylogenetic placement using divide and conquer

Metin Balaban, Yueyu Jiang, Daniel Roush, Qiyun Zhu, Siavash Mirarab

Summary: This study introduces a distance-based phylogenetic placement method called APPLES-2, which is more accurate and scalable than existing methods. Through validation using a large dataset, it is shown that 97% of query genomes can be accurately placed within three branches of the optimal position in the species tree using 50 marker genes.

MOLECULAR ECOLOGY RESOURCES (2022)

Article Biochemical Research Methods

Estimating repeat spectra and genome length from low-coverage genome skims with RESPECT

Shahab Sarmashghi, Metin Balaban, Eleonora Rachtman, Behrouz Touri, Siavash Mirarab, Vineet Bafna

Summary: The cost of genome sequencing is dropping faster than genome assembly and completion. The use of lightly sampled genomes and k-mers has advantages in identifying and phylogenetically placing eukaryotic species. A novel constrained optimization method can provide reliable estimates of genome length and repeat content.

PLOS COMPUTATIONAL BIOLOGY (2021)

Article Biochemical Research Methods

QuCo: quartet-based co-estimation of species trees and gene trees

Maryam Rabiee, Siavash Mirarab

Summary: The paper introduces a scalable likelihood-based approach called quartet co-estimation for co-estimation under the multi-species coalescent model. By independently inferring gene tree distributions and computing species tree topology and branch length, the method updates gene tree posterior probabilities based on the resulting species tree. Experimental results on simulations and a biological dataset demonstrate better accuracy compared to traditional methods.

BIOINFORMATICS (2022)

Article Biology

Distance-Based Phylogenetic Placement with Statistical Support

Navid Bin Hasan, Metin Balaban, Avijit Biswas, Md Shamsuzzoha Bayzid, Siavash Mirarab

Summary: Phylogenetic identification of unknown sequences through tree placement is commonly used in ecological studies. This article addresses the issue of uncertainty in placements obtained from incomplete and noisy data. Nonparametric bootstrapping is found to be the most accurate method for measuring support, and an efficient linear algebraic formulation for bootstrapping is presented. The article also compares the accuracy of maximum likelihood support values and distance-based methods in different applications and datasets.

BIOLOGY-BASEL (2022)

Article Biochemical Research Methods

ASTRAL-Pro 2: ultrafast species tree reconstruction from multi-copy gene family trees

Chao Zhang, Siavash Mirarab

Summary: ASTRAL-Pro 2 is a more efficient version of ASTRAL-Pro, which enhances scalability while maintaining accuracy by adopting a placement-based optimization algorithm.

BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

Weighting by Gene Tree Uncertainty Improves Accuracy of Quartet-based Species Trees

Chao Zhang, Siavash Mirarab

Summary: This paper introduces a threshold-free weighting scheme for quartet-based species tree inference, which improves the utility of summary methods and reduces incongruence with gene concatenation.

MOLECULAR BIOLOGY AND EVOLUTION (2022)

Article Biology

Learning Hyperbolic Embedding for Phylogenetic Tree Placement and Updates

Yueyu Jiang, Puoya Tabaghi, Siavash Mirarab

Summary: This paper demonstrates how the conventional Euclidean deep learning methods in phylogenetics can benefit from using hyperbolic geometry. The results show that hyperbolic embeddings have lower distance errors and can be used to update species trees.

BIOLOGY-BASEL (2022)

Article Biochemical Research Methods

Phylogenomic branch length estimation using quartets

Yasamin Tabatabaee, Chao Zhang, Tandy Warnow, Siavash Mirarab

Summary: This article introduces a new method, CASTLES, for estimating branch lengths on the species tree using expected values of gene tree branch lengths. The method improves on prior methods in terms of both speed and accuracy.

BIOINFORMATICS (2023)

Article Biotechnology & Applied Microbiology

Generation of accurate, expandable phylogenomic trees with uDance

Metin Balaban, Yueyu Jiang, Qiyun Zhu, Daniel McDonald, Rob Knight, Siavash Mirarab

Summary: Large, updatable phylogenetic trees are constructed using a divide-and-conquer strategy called uDance. This method enables high accuracy and scalability in inferring genome-wide evolutionary relationships by refining different parts of the tree independently. With uDance, a species tree of around 200,000 genomes was successfully inferred using 387 marker genes, representing 42.5 billion amino acid residues.

NATURE BIOTECHNOLOGY (2023)

Proceedings Paper Biotechnology & Applied Microbiology

Quantifying Hierarchical Conflicts in Homology Statements

Krister M. Swenson, Afif Elghraoui, Faramarz Valafar, Siavash Mirarab, Mathias Weller

Summary: This article introduces a test method to measure the hierarchical relationship between two sets of homology relationships provided by different software. The test can be used to check the feasibility of agglomerative syntenic block software and provide a mapping reference for downstream analysis. The research finds that it is rare for two collections of homology relationships to be perfectly hierarchically related, so an optimization problem is proposed to measure the distance between them, and a heuristic solution is given.

COMPARATIVE GENOMICS (RECOMB-CG 2022) (2022)

Proceedings Paper Biotechnology & Applied Microbiology

Fast and Accurate Branch Support Calculation for Distance-Based Phylogenetic Placements

Navid Bin Hasan, Avijit Biswas, Metin Balaban, Siavash Mirarab, Md. Shamsuzzoha Bayzid

Summary: Placing a new sequence onto an existing phylogenetic tree is important for various downstream applications. Existing methods often ignore the issue of uncertainty, but we have successfully estimated the distribution of placements using a distance-based approach and found that non-parametric bootstrapping is more accurate in estimating uncertainty.

COMPARATIVE GENOMICS (RECOMB-CG 2022) (2022)

暂无数据