☆ 3.8 Article

Data Linkage Using Probabilistic Decision Rules: A Primer

BIRTH DEFECTS RESEARCH PART A-CLINICAL AND MOLECULAR TERATOLOGY (2008)

期刊

BIRTH DEFECTS RESEARCH PART A-CLINICAL AND MOLECULAR TERATOLOGY

卷 82, 期 11, 页码 812-821

出版社

WILEY-BLACKWELL

DOI: 10.1002/bdra.20510

关键词

data linkage; probabilistic linkage; record matching

类别

Developmental Biology Toxicology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Electronic data linkage is increasingly being used by researchers and health professionals in the birth defects field as a tool for enhancing both research and service/care. However, in many cases, a common pre-existing ID number does not exist across different datasets, and common identifiers, such as names or dates of birth, which could be used to match records, may be known to contain errors or even legitimate differences over time. In such situations, probabilistic matching, which does not require that all identifying fields exactly agree in order for one to conclude that two records belong to the same individual, can be a valuable tool for improving data linkage. However, probabilistic matching is computationally complex and demanding, and not well understood by many who may wish to apply it in their work. Therefore, the purpose of this article is to provide an overview of one approach to probabilistic matching, including the step-by-step procedures involved in the calculation of indices corresponding to the likelihood that two records are a correct match. In addition, the use of multiple iterative protocols, in which several different matching strategies are used in order to maximize the number of linked records, is discussed. Finally, issues related to deduplication and verification of internal-consistency in the linked data set are also reviewed. Birth Defects Research (Part A) 82:812-821, 2008. (c) 2008 Wiley-Liss, Inc.

Data Linkage Using Probabilistic Decision Rules: A Primer

期刊

BIRTH DEFECTS RESEARCH PART A-CLINICAL AND MOLECULAR TERATOLOGY

出版社

WILEY-BLACKWELL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Data Linkage Using Probabilistic Decision Rules: A Primer

期刊

BIRTH DEFECTS RESEARCH PART A-CLINICAL AND MOLECULAR TERATOLOGY

出版社

WILEY-BLACKWELL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文