Leakage and the reproducibility crisis in machine-learning-based science
Published 2023 View Full Article
- Home
- Publications
- Publication Search
- Publication Details
Title
Leakage and the reproducibility crisis in machine-learning-based science
Authors
Keywords
-
Journal
Patterns
Volume 4, Issue 9, Pages 100804
Publisher
Elsevier BV
Online
2023-08-04
DOI
10.1016/j.patter.2023.100804
References
Ask authors/readers for more resources
Related references
Note: Only part of the references are listed.- Reproducibility in machine learning for health research: Still a ways to go
- (2021) Matthew B. A. McDermott et al. Science Translational Medicine
- Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selection
- (2021) Miseon Shim et al. Scientific Reports
- An Empirical Study of the Impact of Data Splitting Decisions on the Performance of AIOps Solutions
- (2021) Yingzhe Lyu et al. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY
- Integrating explanation and prediction in computational social science
- (2021) Jake M. Hofman et al. NATURE
- A computational reproducibility study of PLOS ONE articles featuring longitudinal data analyses
- (2021) Heidi Seibold et al. PLoS One
- What Is Your Estimand? Defining the Target Quantity Connects Statistical Evidence to Theory
- (2021) Ian Lundberg et al. AMERICAN SOCIOLOGICAL REVIEW
- Datasheets for datasets
- (2021) Timnit Gebru et al. COMMUNICATIONS OF THE ACM
- Navigating the pitfalls of applying machine learning in genomics
- (2021) Sean Whalen et al. NATURE REVIEWS GENETICS
- Questionable Classification Accuracy Reported in “Designing a Sum of Squared Correlations Framework for Enhancing SSVEP-Based BCIs”
- (2020) Masaki Nakanishi et al. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING
- Measuring the predictability of life outcomes with a scientific mass collaboration
- (2020) Matthew J. Salganik et al. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
- Overly optimistic prediction results on imbalanced data: a case study of flaws and benefits when applying over-sampling
- (2020) Gilles Vandewiele et al. ARTIFICIAL INTELLIGENCE IN MEDICINE
- Tractography and machine learning: Current state and open challenges
- (2019) Philippe Poulin et al. MAGNETIC RESONANCE IMAGING
- A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models
- (2019) Evangelia Christodoulou et al. JOURNAL OF CLINICAL EPIDEMIOLOGY
- The trim-and-fill method for publication bias
- (2019) Linyu Shi et al. MEDICINE
- No coding required: Companies make it easier than ever for scientists to use artificial intelligence
- (2019) Matthew Hutson SCIENCE
- Validating Hyperspectral Image Segmentation
- (2019) Jakub Nalepa et al. IEEE Geoscience and Remote Sensing Letters
- Establishment of Best Practices for Evidence for Prediction
- (2019) Russell A. Poldrack et al. JAMA Psychiatry
- An empirical analysis of journal policy effectiveness for computational reproducibility
- (2018) Victoria Stodden et al. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
- Supervised Machine Learning for Population Genetics: A New Paradigm
- (2018) Daniel R. Schrider et al. TRENDS IN GENETICS
- The accuracy, fairness, and limits of predicting recidivism
- (2018) Julia Dressel et al. Science Advances
- Meta-analysis and the science of research synthesis
- (2018) Jessica Gurevitch et al. NATURE
- Easing the burden of code review
- (2018) NATURE METHODS
- Oy Vey! A Comment on “Machine Learning of Toxicological Big Data Enables Read-Across Structure Activity Relationships Outperforming Animal Test Reproducibility”
- (2018) Vinicius M Alves et al. TOXICOLOGICAL SCIENCES
- Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study
- (2018) John R. Zech et al. PLOS MEDICINE
- 50 Years of Data Science
- (2017) David Donoho JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS
- Prediction and explanation in social systems
- (2017) Jake M. Hofman et al. SCIENCE
- A checklist is associated with increased quality of reporting preclinical biomedical research: A systematic review
- (2017) SeungHye Han et al. PLoS One
- Machine learning, statistical learning and the future of biological research in psychiatry
- (2016) R. Iniesta et al. PSYCHOLOGICAL MEDICINE
- Joint use of over- and under-sampling techniques and cross-validation for the development and assessment of prediction models
- (2015) Rok Blagus et al. BMC BIOINFORMATICS
- Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD Statement
- (2015) Gary S Collins et al. BMC Medicine
- ImageNet Large Scale Visual Recognition Challenge
- (2015) Olga Russakovsky et al. INTERNATIONAL JOURNAL OF COMPUTER VISION
- The importance of prediction model validation and assessment in obesity and nutrition research
- (2015) A E Ivanescu et al. INTERNATIONAL JOURNAL OF OBESITY
- Opinion: Reproducible research can still be wrong: Adopting a prevention approach: Fig. 1.
- (2015) Jeffrey T. Leek et al. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
- When Optimism Hurts: Inflated Predictions in Psychiatric Neuroimaging
- (2013) Robert Whelan et al. BIOLOGICAL PSYCHIATRY
- Leakage in data mining
- (2013) Shachar Kaufman et al. ACM Transactions on Knowledge Discovery from Data
- Reporting and Methods in Clinical Prediction Research: A Systematic Review
- (2012) Walter Bouwmeester et al. PLOS MEDICINE
- pROC: an open-source package for R and S+ to analyze and compare ROC curves
- (2011) Xavier Robin et al. BMC BIOINFORMATICS
Find Funding. Review Successful Grants.
Explore over 25,000 new funding opportunities and over 6,000,000 successful grants.
ExploreFind the ideal target journal for your manuscript
Explore over 38,000 international journals covering a vast array of academic fields.
Search