Impact of Differential Item Functioning on Subsequent Statistical Conclusions Based on Observed Test Score Data (Seccion METODOLOGICA) Impact of Differential Item Functioning on Subsequent Statistical Conclusions Based on Observed Test Score Data (Seccion METODOLOGICA)

Impact of Differential Item Functioning on Subsequent Statistical Conclusions Based on Observed Test Score Data (Seccion METODOLOGICA‪)‬

Psicologica 2009, July-Dec, 30, 2

    • 5,99 лв.
    • 5,99 лв.

Publisher Description

Differential item functioning (DIF) has been widely studied in educational and psychological measurement. For recent reviews please see Camilli (2006) and Zumbo (2007). Previous research has primarily focused on the definitions of and the methods for detecting DIF. It is well accepted that the presence of DIF might degrade the validity of a test. There is relatively little known, however, about the impact of DIF on later statistical decisions when one uses the observed test scores in data analyses and corresponding statistical hypothesis tests. For example, let us imagine that a researcher is investigating whether there are gender differences on a language proficiency test. What is the impact of gender-based differential item functioning on the eventual statistical decision of whether the group means (male versus female) of the observed scores on the language proficiency test are equal? There is remarkably little research to help one directly answer this question. DIF may be present in a test because either (a) DIF analyses have not been used as part of the item analyses, (b) it is there unbeknownst to the researcher, as an artifact of DIF detection being a statistical decision method, and hence true DIF items may be missed, or (c) as a result of the practice of leaving items flagged as DIF in a test. Irrespective of how the DIF items got there, it is still unknown how such DIF items affect the subsequent statistical results and conclusions, particularly, the Type I error rate and effect size of hypothesis tests from observed score test data.

GENRE
Health & Well-Being
RELEASED
2009
1 July
LANGUAGE
EN
English
LENGTH
43
Pages
PUBLISHER
Universidad de Valencia
SIZE
282
KB

More Books by Psicologica

Evaluacion de Las Dimensiones de Valencia, Activacion, Frecuencia Subjetiva de Uso y Relevancia Para la Ansiedad, La Depresion y la IRA de 238 Sustantivos en Una Muestra Universitaria. Evaluacion de Las Dimensiones de Valencia, Activacion, Frecuencia Subjetiva de Uso y Relevancia Para la Ansiedad, La Depresion y la IRA de 238 Sustantivos en Una Muestra Universitaria.
2010
Actividad Electrofisiologica Durante El Procesamiento de Silabas y Prefijos. Actividad Electrofisiologica Durante El Procesamiento de Silabas y Prefijos.
2010
?Perjudica Antonio Banderas a Javier Bardem?: La Competicion Semantica en Tareas de Nombrado de Personas. ?Perjudica Antonio Banderas a Javier Bardem?: La Competicion Semantica en Tareas de Nombrado de Personas.
2010
Efectos de Metodo en Las Escalas de Ryff: Un Estudio en Poblacion de Personas Mayores. Efectos de Metodo en Las Escalas de Ryff: Un Estudio en Poblacion de Personas Mayores.
2010
Effects of Task and Category Membership on Representation Stability. Effects of Task and Category Membership on Representation Stability.
2011
Activacion Automatica de Las Dimensiones de Competencia y Sociabilidad en El Caso de Los Estereotipos de Genero (Seccion EXPERIMENTAL) Activacion Automatica de Las Dimensiones de Competencia y Sociabilidad en El Caso de Los Estereotipos de Genero (Seccion EXPERIMENTAL)
2008