Latent variable mixture models to test for differential item functioning: a population-based analysis

UBC Faculty Research and Publications

Latent variable mixture models to test for differential item functioning: a population-based analysis Wu, Xiuyun; Sawatzky, Richard; Hopman, Wilma; Mayo, Nancy; Sajobi, Tolulope T.; Liu, Juxin; Prior, Jerilynn C., 1943-; Papaioannou, Alexandra; Josse, Robert G.; Towheed, Tanveer; et al.

Abstract

Background: Comparisons of population health status using self-report measures such as the SF-36 rest on the assumption that the measured items have a common interpretation across sub-groups. However, self-report measures may be sensitive to differential item functioning (DIF), which occurs when sub-groups with the same underlying health status have a different probability of item response. This study tested for DIF on the SF-36 physical functioning (PF) and mental health (MH) sub-scales in population-based data using latent variable mixture models (LVMMs). Methods: Data were from the Canadian Multicentre Osteoporosis Study (CaMos), a prospective national cohort study. LVMMs were applied to the ten PF and five MH SF-36 items. A standard two-parameter graded response model with one latent class was compared to multi-class LVMMs. Multivariable logistic regression models with pseudo-class random draws characterized the latent classes on demographic and health variables. Results: The CaMos cohort consisted of 9423 respondents. A three-class LVMM fit the PF sub-scale, with class proportions of 0.59, 0.24, and 0.17. For the MH sub-scale, a two-class model fit the data, with class proportions of 0.69 and 0.31. For PF items, the probabilities of reporting greater limitations were consistently higher in classes 2 and 3 than class 1. For MH items, respondents in class 2 reported more health problems than in class 1. Differences in item thresholds and factor loadings between one-class and multi-class models were observed for both sub-scales. Demographic and health variables were associated with class membership. Conclusions: This study revealed DIF in population-based SF-36 data; the results suggest that PF and MH sub-scale scores may not be comparable across sub-groups defined by demographic and health status variables, although effects were frequently small to moderate in size. Evaluation of DIF should be a routine step when analysing population-based self-report data to ensure valid comparisons amongst sub-groups.

Item Metadata

Title	Latent variable mixture models to test for differential item functioning: a population-based analysis
Creator	Wu, Xiuyun; Sawatzky, Richard; Hopman, Wilma; Mayo, Nancy; Sajobi, Tolulope T.; Liu, Juxin; Prior, Jerilynn C., 1943-; Papaioannou, Alexandra; Josse, Robert G.; Towheed, Tanveer; Davison, K. S.; Lix, Lisa M.
Publisher	BioMed Central
Date Issued	2017-05-15
Description	Background: Comparisons of population health status using self-report measures such as the SF-36 rest on the assumption that the measured items have a common interpretation across sub-groups. However, self-report measures may be sensitive to differential item functioning (DIF), which occurs when sub-groups with the same underlying health status have a different probability of item response. This study tested for DIF on the SF-36 physical functioning (PF) and mental health (MH) sub-scales in population-based data using latent variable mixture models (LVMMs). Methods: Data were from the Canadian Multicentre Osteoporosis Study (CaMos), a prospective national cohort study. LVMMs were applied to the ten PF and five MH SF-36 items. A standard two-parameter graded response model with one latent class was compared to multi-class LVMMs. Multivariable logistic regression models with pseudo-class random draws characterized the latent classes on demographic and health variables. Results: The CaMos cohort consisted of 9423 respondents. A three-class LVMM fit the PF sub-scale, with class proportions of 0.59, 0.24, and 0.17. For the MH sub-scale, a two-class model fit the data, with class proportions of 0.69 and 0.31. For PF items, the probabilities of reporting greater limitations were consistently higher in classes 2 and 3 than class 1. For MH items, respondents in class 2 reported more health problems than in class 1. Differences in item thresholds and factor loadings between one-class and multi-class models were observed for both sub-scales. Demographic and health variables were associated with class membership. Conclusions: This study revealed DIF in population-based SF-36 data; the results suggest that PF and MH sub-scale scores may not be comparable across sub-groups defined by demographic and health status variables, although effects were frequently small to moderate in size. Evaluation of DIF should be a routine step when analysing population-based self-report data to ensure valid comparisons amongst sub-groups.
Subject	Latent class analysis; Item response theory; Mental health; Patient-reported outcome measures; Physical functioning; Population health
Genre	Article
Type	Text
Language	eng
Date Available	2017-05-17
Provider	Vancouver : University of British Columbia Library
Rights	Attribution 4.0 International (CC BY 4.0)
DOI	10.14288/1.0347535
URI	http://hdl.handle.net/2429/61728
Affiliation	Medicine, Faculty of; Non UBC; Endocrinology, Division of; Medicine, Department of
Citation	Health and Quality of Life Outcomes. 2017 May 15;15(1):102
Publisher DOI	10.1186/s12955-017-0674-0
Peer Review Status	Reviewed
Scholarly Level	Faculty
Copyright Holder	The Author(s).
Rights URI	http://creativecommons.org/licenses/by/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Faculty Research and Publications