4.6 Article

Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies

期刊

EUROPEAN JOURNAL OF EPIDEMIOLOGY
卷 36, 期 2, 页码 165-178

出版社

SPRINGER
DOI: 10.1007/s10654-020-00677-6

关键词

Epidemiology; Computer science; Extract; Transform; Load; Observational study; Research methods

资金

  1. Health data research UK
  2. EPSRC [EP/L000296/1] Funding Source: UKRI
  3. MRC [MR/S003878/1] Funding Source: UKRI

向作者/读者索取更多资源

The paper introduces a new software program DExtER that aids in the extraction and processing of primary care electronic health records for high quality epidemiological studies. The tool, with a user-friendly interface, offers the ability to obtain data extracts specific to each research question and study design.
The use of primary care electronic health records for research is abundant. The benefits gained from utilising such records lies in their size, longitudinal data collection and data quality. However, the use of such data to undertake high quality epidemiological studies, can lead to significant challenges particularly in dealing with misclassification, variation in coding and the significant effort required to pre-process the data in a meaningful format for statistical analysis. In this paper, we describe a methodology to aid with the extraction and processing of such databases, delivered by a novel software programme; the Data extraction for epidemiological research (DExtER). The basis of DExtER relies on principles of extract, transform and load processes. The tool initially provides the ability for the healthcare dataset to be extracted, then transformed in a format whereby data is normalised, converted and reformatted. DExtER has a user interface designed to obtain data extracts specific to each research question and observational study design. There are facilities to input the requirements for; eligible study period, definition of exposed and unexposed groups, outcome measures and important baseline covariates. To date the tool has been utilised and validated in a multitude of settings. There have been over 35 peer-reviewed publications using the tool, and DExtER has been implemented as a validated public health surveillance tool for obtaining accurate statistics on epidemiology of key morbidities. Future direction of this work will be the application of the framework to linked as well as international datasets and the development of standardised methods for conducting electronic pre-processing and extraction from datasets for research purposes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据