Article
Computer Science, Artificial Intelligence
Eman Abdullah AlOmar, Anthony Peruma, Mohamed Wiem Mkaouer, Christian Newman, Ali Ouni, Marouane Kessentini
Summary: Recent studies show that developers refactor for a variety of reasons beyond just improving software design and fixing code smells. Developers use a variety of patterns in commit messages to describe their refactoring activities. The distribution of refactoring operations differs between production and test files.
EXPERT SYSTEMS WITH APPLICATIONS
(2021)
Article
Computer Science, Artificial Intelligence
Dan Anitei, Joan Andreu Sanchez, Jose Miguel Benedi, Ernesto Noya
Summary: This paper introduces the IBEM dataset, which consists of 600 scientific documents, over 8200 page images, and more than 160000 mathematical expressions. The dataset serves as a benchmark for mathematical expression recognition and searching. The paper also presents baseline experiments for mathematical symbol classification and mathematical expression recognition.
PATTERN RECOGNITION LETTERS
(2023)
Article
Computer Science, Software Engineering
Eman Abdullah AlOmar, Jiaqian Liu, Kenneth Addo, Mohamed Wiem Mkaouer, Christian Newman, Ali Ouni, Zhe Yu
Summary: This paper examines how developers document their refactoring activities in commit messages. By mining the text of commit messages, the study analyzes patterns specific to each refactoring type and designs a model to predict the type of refactoring performed at the commit level. The findings show that rename method and extract method are well-documented, while pull-up method and push-down method are hardest to identify via textual descriptions.
AUTOMATED SOFTWARE ENGINEERING
(2022)
Article
Computer Science, Software Engineering
Melina Vidoni
Summary: This study analyzes the quality of documentation in open-source R packages and investigates how R package developers perceive documentation. The results show that incomplete documentation is the most common issue, and developers focus more on common use cases and parameter documentation.
JOURNAL OF SYSTEMS AND SOFTWARE
(2022)
Article
Computer Science, Artificial Intelligence
Huy Quang Ung, Cuong Tuan Nguyen, Khanh Minh Phan, Vu Tran Minh Khuong, Masaki Nakagawa
Summary: This paper proposes a bag-of-features for clustering online handwritten mathematical expressions and introduces a distance-based representation method to reduce dimensionality. Experimental results show that combining low-level and high-level features with DbR can improve performance and reduce marking cost.
PATTERN RECOGNITION LETTERS
(2021)
Article
Computer Science, Information Systems
Woubshet Behutiye, Pilar Rodriguez, Markku Oivo
Summary: In agile software development, quality requirements often lack detailed specifications and documentation. This study developed the Agile QR-Doc QR documentation guidelines to support quality requirements documentation. Through surveys and discussions with software practitioners, the guidelines were validated and found to be relevant, understandable, and useful in supporting QR documentation in agile software development.
Review
Biochemical Research Methods
Anna C. Haber, Ulrich Sax, Fabian Prasser
Summary: Precision medicine relies on molecular and systems biology methods, as well as bidirectional association studies between phenotypes and genomic data. However, the integrated use of these data faces obstacles, particularly in terms of data protection. Anonymization, using mathematical models to quantify risks and guide reductions, can provide an alternative for analyzing phenotype data. Open software libraries and tools can serve as robust alternatives, but the complexity and heterogeneity of available anonymization tools make it difficult to assess their strengths and weaknesses. This systematic review aims to provide recommendations for anonymizing phenotype datasets with different properties by evaluating open anonymization tools described in the literature between 1990 and 2021.
BRIEFINGS IN BIOINFORMATICS
(2022)
Article
Computer Science, Artificial Intelligence
Sreekanth Madisetty, Kaushal Kumar Maurya, Akiko Aizawa, Maunendra Sankar Desarkar
Summary: Scientific documents often have multiple mathematical expressions, and detecting inline mathematical expressions is an important and challenging task in scientific text mining. While some recent studies have approached this problem from an image processing perspective, there is limited research from the NLP perspective. In this study, a hybrid algorithm combining CRF and Bi-LSTM is proposed to detect inline mathematical expressions in scientific documents, showing superior performance compared to baselines and individual methods in the hybrid approach.
Article
Engineering, Electrical & Electronic
Momcilo V. Krunic
Summary: Documentation as Code (DaC) is an approach that applies software development principles to technical documentation, improving accuracy, traceability, and maintainability. In the automotive industry, DaC has been successfully used to document various aspects of vehicle development, with case studies and examples showcasing its benefits in improving documentation quality and maintainability.
ELEKTRONIKA IR ELEKTROTECHNIKA
(2023)
Article
Computer Science, Information Systems
Alisson Solitto Da Silva, Rogerio Eduardo Garcia, Leonardo Castro Botega
Summary: This paper proposes a bug location model that leverages semantic architectural knowledge through ontologies to improve bug localization accuracy and efficiency. The model combines machine learning methods with architectural semantic knowledge to infer new knowledge and retrieve information from bug reports.
Article
Computer Science, Artificial Intelligence
Vinay Sakshi, Vinay Kukreja
Summary: The study provides a comprehensive review of machine-based recognition systems for mathematical text, including both published and grey literature. Through analysis and investigation of 262 instances, it is found that 52% of grey literature focuses on mobile applications and user interfaces, while white literature is mainly concentrated in 39 different conferences. Furthermore, a list of challenges and open issues is extracted to guide future research directions.
ARTIFICIAL INTELLIGENCE REVIEW
(2023)
Article
Multidisciplinary Sciences
Wei Su, Chuan Cai, Paul S. Wang, Hengjie Li, Zhen Huang, Qiang Huang
Summary: The paper discusses the complexity of mathematical expressions as a measure for comparison, with factors including representational length, computational time, and intelligibility. It introduces binary lambda calculus and rule-based algebraic computation methods for calculating complexity. The application of mathematical expression complexity in MACP is also addressed, offering benefits in computer aided assessment for calculating correct answers, verifying equivalence, checking user responses, and providing automatic grading.
Article
Computer Science, Artificial Intelligence
Wentao Ding, Jianhao Chen, Longfei E, Jinmao Li, Yuzhong Qu
Summary: Existing approaches to normalize time expressions heavily rely on predefined rules or grammars, which are time-consuming and not adaptable to various scenarios. In this article, we propose a novel idea of representing time expressions as update operations, allowing the normalization task to be modeled as sequence prediction. We introduce DNPTime, a distantly supervised neural parsing approach that utilizes the automatic rule generation method ARTime to construct training samples for fine-tuning T5 models. Experimental results demonstrate that DNPTime achieves the best average performances across different benchmarks with minimal human interventions.
KNOWLEDGE-BASED SYSTEMS
(2023)
Article
Computer Science, Information Systems
Di Cui
Summary: This paper proposes a method to identify flawed structural dependencies early in software evolution by combining structural and semantic dependencies, helping developers save maintenance costs by fixing issues in time. The evaluation results from 838 releases of 15 open source projects showed that the method can accurately capture files prone to errors or changes, with a high precision rate of 92%. The identified dependencies also showed significantly higher bug frequencies and change frequencies in future versions compared to the average.
Article
Computer Science, Artificial Intelligence
Kalliopi Kravari, Christina Antoniou, Nick Bassiliades
Summary: The clear definition of requirements in requirements engineering is crucial for system development, and the SENSE framework proposes a standardized natural language expression approach to support the flow-down procedure of requirements, providing validity and completeness verification.
Article
Computer Science, Software Engineering
Amel Mammar, Meriem Belguidoum, Saddam Hocine Hiba
Summary: This paper introduces a formal EVENT-B-based approach for modeling and verifying the deployment of component-based applications. By gradually refining an abstract model, a precise specification is built, and mathematical reasoning is used to prove its correctness. The presented approach validates the deployment in a cloud environment using PROB and ensures the construction of a correct system that meets the constraints.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Shuqi Liu, Yu Zhou, Longbing Ji, Tingting Han, Taolue Chen
Summary: In this paper, we propose a framework that combines GUI events deduplication with an adaptive semantic matching strategy to enhance the usability of reused tests. Experimental evaluation demonstrates that the framework improves widget mapping performance, significantly reduces event redundancy, and reduces the manual effort of creating tests for similar applications.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Xiangyu Mu, Lei Liu, Peng Zhang, Jingyao Li, Hui Li
Summary: The aim of this study is to reduce the size of the test case set required to detect the commutativity problem of the reduce function. By determining the pattern of the function and selecting corresponding test cases, the proposed test case generation strategy can achieve the same accuracy with a smaller test case set. It has been shown to be effective and has a high recall rate.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Padmalata Nistala, Asha Rajbhoj, Vinay Kulkarni, Sapphire Noronha, Ankit Joshi
Summary: This paper presents an automated proposal development approach using a combination of model-based and AI-enabled techniques, and discusses the successful deployment and user feedback of the system.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Jacco O. G. Krijnen, Manuel M. T. Chakravarty, Gabriele Keller, Wouter Swierstra
Summary: Compiler correctness is a long-standing problem, and it becomes more significant with the rise of smart contracts on blockchains. A translation certification framework can address the trust issue for low-level code on the blockchain, allowing users to have confidence in the compilation process of smart contracts.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Phillip James, Faron Moller, Filippos Pantekis
Summary: OnTrack is a tool that supports railway verification workflows using model driven engineering frameworks, allowing railway engineers to interact with verification procedures through encapsulating formal methods.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Oleg Kiselyov
Summary: Heterogeneous metaprogramming systems leverage higher-level host languages to generate lower-level object language code, enabling faster production of high-performant code with correctness guarantees. This paper presents two systems with OCaml as the host language and C as the object language, discussing their implementation and applications.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Conor Reynolds, Rosemary Monahan
Summary: This paper provides a detailed approach to formalize a fragment of the theory of institutions in the Coq proof assistant. The approach is illustrated and evaluated by instantiating the framework with specific institution examples.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Herbert Rausch Fernandes, Giovanni Freitas Gomes, Antonio Carlos Pinheiro de Oliveira, Sergio Vale Aguiar Campos
Summary: Alzheimer's disease is a common form of dementia with no effective drug treatment available. In this study, a statistical model checking approach was used to analyze protein and drug interactions and evaluate the effects of different drugs on the components contributing to Alzheimer's disease. The results showed that rapamycin could slow down the biological process causing neuronal death, while LY294002 and NVP-BEZ235 may increase tau phosphorylation. These findings provide important insights for the scientific community and raise awareness about potential side effects of PI3K inhibitor drugs.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Erwan Mahe, Christophe Gaston, Pascale Le Gall
Summary: This paper presents an Interaction Language to encode Sequence Diagrams (SD) and associates it with three different formal semantics. This allows for direct formal verification of SD, while preserving traceability of SD concepts and executed actions, and addressing the translation of problematic operators.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Joan Giner-Miguelez, Abel Gomez, Jordi Cabot
Summary: Datasets are crucial for training and evaluating machine learning models, but they can also lead to undesirable behaviors like biased predictions. To tackle this issue, the machine learning community suggests adopting consistent guidelines for dataset descriptions. However, these guidelines rely on natural language descriptions, which hinder automated computation and analysis. To overcome this, we present DescribeML, a language engineering tool that provides precise, structured descriptions of machine learning datasets, including their composition, provenance, and social concerns.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Andrey Sadovykh, Bilal Said, Dragos Truscan, Hugo Bruneliere
Summary: In this paper, the authors report on their 7 years of practical experience with an iterative Model-based Requirements Engineering (MBRE) approach and language in five large European collaborative projects. They demonstrate through significant data sets that this model-based approach provides interesting benefits in terms of scalability, heterogeneity, adaptability, traceability, automation, consistency and quality, and usefulness or usability. Concrete examples from these projects are provided to illustrate the application of the MBRE approach and language, and the authors discuss the general benefits and limitations of using such an approach, as well as the lessons learned over the years.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Alfa Yohannis, Dimitris Kolovos, Antonio Garcia-Dominguez
Summary: Picto Web is a multi-tenant web-based tool that allows exploration of complex models by transforming them into various transient web-based views using rule-based transformations. It uses a lazy view computation approach to efficiently support large models and complex transformations, and includes monitoring and push notification facilities for automatic recomputation of views and updated delivery to clients.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Enes Yigitbas, Maximilian Schmidt, Antonio Bucchiarone, Sebastian Gottschalk, Gregor Engels
Summary: UML has become a popular modeling language used in computer science courses, and various interactive learning applications have been developed to improve student engagement and learning outcomes. However, these applications have not successfully created immersive environments for students. Therefore, this study introduces GaMoVR, a VR-based and gamified learning environment, which provides an interactive and fun learning experience for students learning about UML modeling.
SCIENCE OF COMPUTER PROGRAMMING
(2024)
Article
Computer Science, Software Engineering
Yaxin Zhao, Lina Gong, Wenhua Yang, Yu Zhou
Summary: Accessible design aims to enable as many people as possible to access software products and services. This study investigates the interaction between accessibility issues and other factors affecting software performance. By analyzing a large number of accessibility issues, the study reveals the characteristics of these issues and their relationship with software quality attributes.
SCIENCE OF COMPUTER PROGRAMMING
(2024)