Article
Computer Science, Information Systems
Toon Boeckling, Guy De Tre, Antoon Bronselaer
Summary: This paper proposes and studies a type of tuple-level constraint called selection rules, explores their concepts and properties, and investigates rule implication with selection rules. Compared to HoloClean, repair strategies with selection rules show better performance and accuracy in terms of error detection and correction.
Article
Mathematics
Hunduma Legesse Geleta, Oluma Ararso Alemu
Summary: Finding the regions containing all the zeros of complex-valued harmonic polynomials is a relatively new research area. In this article, we provide the inclusion regions of all the zeros of these polynomials in general, and specifically, we bound the zeros of certain harmonic trinomials within a certain annular region.
JOURNAL OF MATHEMATICS
(2022)
Article
Computer Science, Information Systems
Toon Boeckling, Guy De Tre, Antoon Bronselaer
Summary: This paper studies novel techniques to enhance the performance of edit rule implication, an essential subtask when repairing data inconsistencies. The authors draw attention to the use of nominal and ordinal edit rules and propose enhanced algorithms for both. Evaluation results show promising improvements, with the ordinal algorithm performing the best.
INFORMATION SCIENCES
(2022)
Article
Computer Science, Information Systems
Nasir Mahmood, Yaser Hafeez, Khalid Iqbal, Shariq Hussain, Muhammad Aqib, Muhammad Jamal, Oh-Young Song
Summary: This study proposes a fault prediction approach using data-mining technique to find good predictors for high-quality software, and experimental results show promising outcomes, which can be utilized by practitioners and developers for defect prediction.
CMC-COMPUTERS MATERIALS & CONTINUA
(2021)
Article
Computer Science, Information Systems
Lichuan Ma, Qingqi Pei, Lu Zhou, Haojin Zhu, Licheng Wang, Yusheng Ji
Summary: The study proposed a federated data cleaning protocol, FedClean, for edge intelligence scenarios to achieve data cleaning without compromising data privacy. By generating Boolean shares of data and privately computing AVF scores, abnormal data entries are filtered out through a bitonic sorting network.
IEEE INTERNET OF THINGS JOURNAL
(2021)
Article
Computer Science, Artificial Intelligence
Henning Koehler, Sebastian Link
Summary: Classical data cleaning methods often ignore the uncertainty of the data and constraints. We propose a non-invasive qualitative approach to uncertainty, which improves the effectiveness of data cleaning.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
(2022)
Article
Mathematics, Interdisciplinary Applications
Cui-Bin Ji, Gui-Jiang Duan, Jun-Yan Zhou, Wei-Jie Xuan
Summary: With the advancement of digital manufacturing technology, data-driven quality management is rapidly developing to address the issues of insufficient quality data acquisition and poor data quality of complex equipment. A data integration and cleaning method based on digital total quality management is proposed to provide the foundation for designing a digital total quality management system for complex equipment.
Article
Green & Sustainable Science & Technology
Aleksandar Mitrasinovic, Milos Tomic
Summary: This study showed that using a cleaner master alloy led to smaller grains and lower undercooling values in the final structure, as well as a smaller difference in released heat compared to specimens treated with commercial master alloys.
INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING-GREEN TECHNOLOGY
(2022)
Article
Chemistry, Multidisciplinary
Jieh-Ren Chang, You-Shyang Chen, Chien-Ku Lin, Ming-Fu Cheng
Summary: Based on more than 8000 items of SSD error data, this study uses association rule algorithm to propose three improvement strategies for production control, including speeding up error judgment, formulating quality strategy, and customer service strategy.
APPLIED SCIENCES-BASEL
(2021)
Article
Computer Science, Information Systems
Zhuqi Miao, Meghan D. Sealey, Shrieraam Sathyanarayanan, Dursun Delen, Lan Zhu, Scott Shepherd
Summary: This study proposes a data preparation framework for guiding and validating the cleaning of electronic health record (EHR) data for secondary analysis. The framework includes three core themes: workflow, assessment and cleaning methods, and cleaning evaluation scheme. A case study using data from a large EHR database demonstrates the effectiveness of the framework in organizing and standardizing phases and processes within an EHR data preparation workflow. The cleaning evaluation scheme is particularly effective in validating EHR cleaning methods for handling complex issues in patient demographics, longitudinal EHR attributes, and filtering/imputation cleaning methods.
INFORMATION SYSTEMS
(2023)
Article
Ecology
Bruno R. Ribeiro, Santiago Jose Elias Velazco, Karlo Guidoni-Martins, Geiziane Tessarolo, Lucas Jardim, Steven P. Bachman, Rafael Loyola
Summary: The increase in online and openly accessible biodiversity databases provides a valuable resource for research and policy, but errors in primary species occurrence data can lead to misleading information. This study introduces an R package, bdc, that addresses quality issues and improves the fitness-for-use of biodiversity datasets by integrating several aspects of data cleaning.
METHODS IN ECOLOGY AND EVOLUTION
(2022)
Article
Computer Science, Artificial Intelligence
Alvaro Valencia-Parra, Luisa Parody, Angel Jesus Varela-Vaca, Ismael Caballero, Maria Teresa Gomez-Lopez
Summary: In order to succeed in business processes, organizations must focus on the quality and usability of data, with emphasis on obtaining recommendations for data usability before usage. Using DMN for data quality assessment and automated generation of recommendations can enhance decision-making processes for organizations when it comes to data usability.
DECISION SUPPORT SYSTEMS
(2021)
Article
Agriculture, Multidisciplinary
Katrien Devolder
Summary: The paper discusses the ethical concerns and objections to genome editing in livestock, despite its potential to address urgent global issues. While some see it as a technological fix, others worry about unintended consequences and complicity in factory farming. The author suggests considering wider obligations and potential impacts beyond narrow problem-solving.
JOURNAL OF AGRICULTURAL & ENVIRONMENTAL ETHICS
(2021)
Article
Engineering, Civil
Yangping Yao, Xing Zhang, Wenjie Cui
Summary: Intelligent compaction (IC) is gaining attention in construction engineering for earthwork compaction. However, outliers in IC data sets can lead to misinterpretation and erroneous quality assessment. This study proposes a method combining density-based local outlier factor (LOF) and inverse distance weighted (IDW) to clean measured data sets. Results show that the proposed LOF-IDW method exhibits better performance in outlier diagnosis, data rehabilitation, and reducing variation, providing reliable support for quality assessment and decision-making in IC.
TRANSPORTATION GEOTECHNICS
(2023)
Article
Mathematics, Applied
Pinakadhar Baliarsingh
Summary: The concept of difference operators based on fractional-order is widely used in various fields such as linear algebra, approximation theory, and the theory of fractional calculus. This paper focuses on studying the convergence of difference sequence and analyzing the consistency and validity of related formulas. Basic results involving convergence, linearity, exponent rule, topological properties, Leibniz, and chain rules for fractional derivatives have been investigated and demonstrated with illustrative examples.
MATHEMATICAL METHODS IN THE APPLIED SCIENCES
(2021)
Article
Computer Science, Hardware & Architecture
Yang Cao, Wenfei Fan, Shuai Ma
Article
Computer Science, Information Systems
Yang Cao, Wenfei Fan, Floris Geerts, Ping Lu
ACM TRANSACTIONS ON DATABASE SYSTEMS
(2018)
Article
Computer Science, Information Systems
Wenfei Fan, Yang Cao, Jingbo Xu, Wenyuan Yu, Yinghui Wu, Chao Tian, Jiaxin Jiang, Bohan Zhang
Article
Computer Science, Information Systems
Wenfei Fan, Wenyuan Yu, Jingbo Xu, Jingren Zhou, Xiaojian Luo, Qiang Yin, Ping Lu, Yang Cao, Ruiqi Xu
ACM TRANSACTIONS ON DATABASE SYSTEMS
(2018)
Article
Computer Science, Information Systems
Wenfei Fan, Ping Lu
ACM TRANSACTIONS ON DATABASE SYSTEMS
(2019)
Article
Multidisciplinary Sciences
Wenfei Fan
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES
(2019)
Article
Automation & Control Systems
Yang Cao, Wen-Fei Fan, Teng-Fei Yuan
INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING
(2020)
Article
Computer Science, Information Systems
Wenfei Fan, Chunming Hu, Muyang Liu, Ping Lu, Qiang Yin, Jingren Zhou
PROCEEDINGS OF THE VLDB ENDOWMENT
(2019)
Article
Computer Science, Information Systems
Yang Cao, Wenfei Fan, Tengfei Yuan
PROCEEDINGS OF THE VLDB ENDOWMENT
(2019)
Article
Computer Science, Information Systems
Wenfei Fan, Ping Lu, Chao Tian, Jingren Zhou
PROCEEDINGS OF THE VLDB ENDOWMENT
(2019)
Proceedings Paper
Computer Science, Information Systems
Wenfei Fan, Xueli Liu, Yingjie Cao
2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE)
(2018)
Proceedings Paper
Computer Science, Information Systems
Wenfei Fan, Xueli Liu, Ping Lu, Chao Tian
SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA
(2018)
Proceedings Paper
Computer Science, Information Systems
Wenfei Fan, Chunming Hu, Xueli Liu, Ping Lu
SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA
(2018)
Proceedings Paper
Computer Science, Information Systems
Wenfei Fan, Ping Lu, Xiaojian Luo, Jingbo Xu, Qiang Yin, Wenyuan Yu, Ruiqi Xu
SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA
(2018)
Proceedings Paper
Computer Science, Information Systems
Wenfei Fan, Chunming Hu, Chao Tian
SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA
(2017)