☆ 4.5 Article

Applying machine classifiers to update searches: Analysis from two case studies

RESEARCH SYNTHESIS METHODS (2022)

Journal

RESEARCH SYNTHESIS METHODS

Volume 13, Issue 1, Pages 121-133

Publisher

WILEY

DOI: 10.1002/jrsm.1537

Keywords

information retrieval; supervised machine learning; systematic reviews as topic; update search

Funding

National Institute for Health Research (NIHR) Policy Research Programme (PRP) for the Department of Health and Social Care, England

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The study shows that using machine classifiers can reduce the workload of manually screening citation records and save significant time. Classifier performance varies depending on the training data used, but high recall and substantial screening reduction can be achieved. Machine classifiers have great potential in handling update searches of public health research.

Manual screening of citation records could be reduced by using machine classifiers to remove records of very low relevance. This seems particularly feasible for update searches, where a machine classifier can be trained from past screening decisions. However, feasibility is unclear for broad topics. We evaluate the performance and implementation of machine classifiers for update searches of public health research using two case studies. The first study evaluates the impact of using different sets of training data on classifier performance, comparing recall and screening reduction with a manual screening 'gold standard'. The second study uses screening decisions from a review to train a classifier that is applied to rank the update search results. A stopping threshold was applied in the absence of a gold standard. Time spent screening titles and abstracts of different relevancy-ranked records was measured. Results: Study one: Classifier performance varies according to the training data used; all custom-built classifiers had a recall above 93% at the same threshold, achieving screening reductions between 41% and 74%. Study two: applying a classifier provided a solution for tackling a large volume of search results from the update search, and screening volume was reduced by 61%. A tentative estimate indicates over 25 h screening time was saved. In conclusion, custom-built machine classifiers are feasible for reducing screening workload from update searches across a range of public health interventions, with some limitation on recall. Key considerations include selecting a training dataset, agreeing stopping thresholds and processes to ensure smooth workflows.

Applying machine classifiers to update searches: Analysis from two case studies

Journal

RESEARCH SYNTHESIS METHODS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Applying machine classifiers to update searches: Analysis from two case studies

Journal

RESEARCH SYNTHESIS METHODS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper