4.3 Article

Wald χ2 Test for Differential Item Functioning Detection with Polytomous Items in Multilevel Data

期刊

出版社

SAGE PUBLICATIONS INC
DOI: 10.1177/00131644231181688

关键词

differential item functioning; item response theory; Wald & chi;(2) test; multilevel data; measurement invariance

向作者/读者索取更多资源

Identifying items with differential item functioning (DIF) is crucial for equitable measurement, but detecting DIF items in multilevel data has not been fully addressed. This study presents a multilevel extension of a two-stage procedure for detecting both uniform and non-uniform DIF with polytomous items. The proposed approach utilizes the Lord's Wald χ2 test and the Metropolis-Hastings Robbins-Monro algorithm for accurate estimation and evaluation. Simulation results show that the proposed approach has great power and controls Type I error rate effectively. Limitations and future research directions are discussed.
Identifying items with differential item functioning (DIF) in an assessment is a crucial step for achieving equitable measurement. One critical issue that has not been fully addressed with existing studies is how DIF items can be detected when data are multilevel. In the present study, we introduced a Lord's Wald ? 2 test-based procedure for detecting both uniform and non-uniform DIF with polytomous items in the presence of the ubiquitous multilevel data structure. The proposed approach is a multilevel extension of a two-stage procedure, which identifies anchor items in its first stage and formally evaluates candidate items in the second stage. We applied the Metropolis-Hastings Robbins-Monro (MH-RM) algorithm to estimate multilevel polytomous item response theory (IRT) models and to obtain accurate covariance matrices. To evaluate the performance of the proposed approach, we conducted a preliminary simulation study that considered various conditions to mimic real-world scenarios. The simulation results indicated that the proposed approach has great power for identifying DIF items and well controls the Type I error rate. Limitations and future research directions were also discussed.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据