☆ 4.4 Article

Examining the Missing Completely at Random Mechanism in Incomplete Data Sets: A Multiple Testing Approach

STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL (2012)

Journal

STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL

Volume 19, Issue 3, Pages 399-408

Publisher

PSYCHOLOGY PRESS

DOI: 10.1080/10705511.2012.687660

Keywords

false discovery rate; incomplete data; missing completely at random; multiple testing

Categories

Mathematics, Interdisciplinary Applications Social Sciences, Mathematical Methods

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

A multiple testing procedure for examining implications of the missing completely at random (MCAR) mechanism in incomplete data sets is discussed. The approach uses the false discovery rate concept and is concerned with testing group differences on a set of variables. The method can be used for ascertaining violations of MCAR and disproving this mechanism in empirical behavioral and social research. The procedure can also be employed when locating violations of MCAR in observed measures is of interest. The outlined approach is illustrated with data from a cognitive intervention study.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4

Not enough ratings

Secondary Ratings

Novelty

-

Significance

-

Scientific rigor

-

Rate this paper

Recommended

Article Statistics & Probability

OPTIMAL FALSE DISCOVERY RATE CONTROL FOR LARGE SCALE MULTIPLE TESTING WITH AUXILIARY INFORMATION

Hongyuan Cao, Jun Chen, Xianyang Zhang

Summary: The article introduces a method to improve the statistical power of large-scale multiple testing by utilizing auxiliary information in high-dimensional statistical inference. By using a framework based on a two-group mixture model and imposing structural relationship constraints and an optimal rejection rule to control the false discovery rate, the method's power is enhanced. The advantages of the proposed method are verified through empirical and theoretical analysis.

ANNALS OF STATISTICS (2022)

Add to Collection

Article Statistics & Probability

Covariate Adaptive False Discovery Rate Control With Applications to Omics-Wide Multiple Testing

Xianyang Zhang, Jun Chen

Summary: This article introduces an FDR control procedure that can incorporate covariate information in large-scale inference problems. The proposed procedure is implemented using a fast algorithm and has been shown to have asymptotic validity even in cases of misspecified models and weakly dependent p-values. Extensive simulations demonstrate that the method improves upon existing approaches in terms of flexibility, robustness, power, and computational efficiency. The method is applied to omics datasets from genomics studies to identify features associated with clinical and biological phenotypes, and shows superiority, particularly in sparse signal scenarios.

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2022)

Add to Collection

Article Biochemical Research Methods

propeller: testing for differences in cell type proportions in single cell data

Belinda Phipson, Choon Boon Sim, Enzo R. Porrello, Alex W. Hewitt, Joseph Powell, Alicia Oshlack

Summary: Single cell RNA-Sequencing (scRNA-seq) is popular for profiling cell transcriptomes. propeller is a robust method leveraging biological replication to find significant differences in cell type proportions. It performs well in various scenarios.

BIOINFORMATICS (2022)

Add to Collection

Article Statistics & Probability

False Discovery Rates to Detect Signals from Incomplete Spatially Aggregated Data

Hsin-Cheng Huang, Noel Cressie, Andrew Zammit-Mangion, Guowen Huang

Summary: The study introduces a new EFDR-CS procedure for incomplete data defined on irregular small areas, using conditional simulation to estimate the signal and combining M p-values using copulas for hypothesis testing. This procedure is demonstrated through a simulation study and applications to real data in the Asia-Pacific region and Middle East, Afghanistan, and Pakistan.

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2021)

Add to Collection

Article Automation & Control Systems

Integrative High Dimensional Multiple Testing with Heterogeneity under Data Sharing Constraints

Molei Liu, Yin Xia, Kelly Cho, Tianxi Cai

Summary: Identifying informative predictors in a high-dimensional regression model is crucial for association analysis and predictive modeling. Signal detection often fails in high-dimensional settings due to limited sample size, but meta-analyzing multiple studies can help improve power. Integrative analysis of high-dimensional data from different studies poses challenges, especially with data sharing constraints, but a new method called DSILT is proposed for signal detection without sharing individual-level data. The method incorporates proper estimation and debiasing procedures to construct test statistics for specific covariates, and a multiple testing procedure is developed to control false discovery rate and identify significant effects. Simulation studies show the proposed testing procedure performs well in controlling false discoveries and achieving power.

JOURNAL OF MACHINE LEARNING RESEARCH (2021)

Add to Collection

Article Management

False Discovery in A/B Testing

Ron Berman, Christophe Van den Bulte

Summary: The study reveals that up to 70% of significant results in website A/B testing are actually null effects, leading to high false discovery rates. Decision makers should be aware that one in five interventions achieving significance at a 5% confidence level may be ineffective in practice.

MANAGEMENT SCIENCE (2021)

Add to Collection

Article Multidisciplinary Sciences

MultipleTesting.com: A tool for life science researchers for multiple hypothesis testing correction

Otilia Menyhart, Boglarka Weltz, Balazs Gyorffy

Summary: Scientists across disciplines face the challenge of evaluating multiple hypotheses simultaneously, which requires consideration of statistical testing and confidence measures. Various strategies exist to address the issue of multiple hypothesis testing, with one approach being the use of multiple-testing correction methods.

PLOS ONE (2021)

Add to Collection

Article Mathematics, Interdisciplinary Applications

Screening-Assisted Dynamic Multiple Testing with False Discovery Rate Control

Iram Mushtaq, Qin Zhou, Xuemin Zi

Summary: In the era of big data, it is crucial to make timely and accurate decisions due to the arrival of high-dimensional data in streams. Identifying individuals with deviant behavior from the norm has become particularly important. The authors propose a large-scale dynamic testing system based on false discovery rate (FDR) control in order to detect as many irregular behavioral patterns as possible. By leveraging the sequential feature of datastreams, they develop a screening-assisted procedure that filters and tests streams in a sequential manner. The proposed method is shown to be accurate and powerful through simulation studies and a real-data example.

JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY (2023)

Add to Collection

Article Health Care Sciences & Services

Multiple imputation with missing data indicators

Lauren J. Beesley, Irina Bondarenko, Michael R. Elliot, Allison W. Kurian, Steven J. Katz, Jeremy M. G. Taylor

Summary: This paper describes how to generalize the sequential regression multiple imputation procedure to handle non-random missingness when missingness may depend on other variables. The method reduces bias in the final analysis compared to standard techniques, using approximation strategies involving inclusion of an offset in the imputation model.

STATISTICAL METHODS IN MEDICAL RESEARCH (2021)

Add to Collection

Article Automation & Control Systems

Asynchronous Online Testing of Multiple Hypotheses

Tijana Zrnic, Aaditya Ramdas, Michael Jordan

Summary: This study focuses on controlling the false discovery rate in asynchronous online testing, proposing a general framework that addresses dependency issues and improves existing algorithms. The use of conflict sets is highlighted as a way to better manage dependencies among test statistics.

JOURNAL OF MACHINE LEARNING RESEARCH (2021)

Add to Collection

Article Biochemical Research Methods

Querying multiple sets of P-values through composed hypothesis testing

Tristan Mary-Huard, Sarmistha Das, Indranil Mukhopadhyay, Stephane Robin

Summary: This study introduces the concept of composed hypothesis and rephrases the problem of testing complex hypotheses as a classification task, demonstrating that finding items for which the composed null hypothesis is rejected boils down to fitting a mixture model and classifying the items according to their posterior probabilities. The study showcases the efficiency and usefulness of the developed method in simulations and on two different applications, providing valuable biological insight.

BIOINFORMATICS (2022)

Add to Collection

Article Multidisciplinary Sciences

Weighted multiple testing procedures in genome-wide association studies

Ludivine Obry, Cyril Dalmasso

Summary: In this study, we evaluated recent weighted multiple testing procedures for genome wide association studies (GWAS) through a simulation study. We also introduced a new efficient procedure called wBHa, which prioritizes the detection of genetic variants with low minor allel frequencies while maximizing overall detection power. Our results demonstrated that wBHa outperformed other procedures in detecting rare variants while maintaining good overall power.

PEERJ (2023)

Add to Collection

Article Biochemical Research Methods

Bridging the False Discovery Gap

Arya Ebadi, Jack Freestone, William S. Noble, Uri Keich

Summary: Controlling the false discovery rate (FDR) in proteomics experiments using target decoy competition (TDC) only controls the average proportion of false discoveries. However, the actual proportion of false discoveries (FDP) may exceed the specified FDR threshold. We demonstrate this using real data and present two methods, FDP Stepdown and TDC Uniform Band, which help bridge the gap between controlling the expected FDR and the empirical FDP.

JOURNAL OF PROTEOME RESEARCH (2023)

Add to Collection

Article Biochemical Research Methods

Precursor deconvolution error estimation: The missing puzzle piece in false discovery rate in top-down proteomics

Kyowon Jeong, Philipp T. Kaulich, Wonhyeuk Jung, Jihyung Kim, Andreas Tholey, Oliver Kohlbacher

Summary: Top-down proteomics provides more comprehensive proteoform-level information, but reliable data analysis remains challenging. The conventional FDR estimation method may not work at the proteoform level, and the precursor deconvolution error rate should be taken into account.

PROTEOMICS (2023)

Add to Collection

Article Physics, Multidisciplinary

Multiple testing corrections in a climate complex network

Viola Meroni, Carlo De Michele

Summary: This study investigates the application of multiple testing corrections in complex network analysis. By comparing four different methods, it is found that false discovery rate correction is a better option.

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS (2022)

Add to Collection

No Data Available

No Data Available

© Peeref 2019-2024. All rights reserved.