Use and misuse of predicted values in epidemiologic data analyses (TG4)

BIRS Workshop Lecture Videos

Featured Collection

BIRS Workshop Lecture Videos

Use and misuse of predicted values in epidemiologic data analyses (TG4) Shaw, Pamela

Description

Pamela A. Shaw, Paul Gustafson, Daniela Sotres-Alvarez, Victor Kipnis, and Laurence Freedman

For many epidemiologic settings, the principle exposure or outcome under study can only be imprecisely measured. In an attempt to address error-in-variables, sometimes the analyst will adjust these variables, say through a calibration or prediction equation, and use the resulting predicted value in the analysis in place of the observed value. When a predicted quantity is used in place of an observed value in a data analysis, consideration of the impact of the uncertainty in the predicted quantity on the study results is needed, but this is not always done in practice. Such predicted variables usually have Berkson error. The result of ignoring this uncertainty, or prediction error, for some settings could be that the parameter estimates are biased, the standard errors are biased, or both. We examine three common examples for how predicted values are used in an analysis in place of an error-prone variable: 1) to estimate the distribution of a variable, 2) to compare values of a variable between groups by using the predicted value in a two-group statistic (e.g. t-statistic) or as an outcome variable in a regression, and 3) to estimate the effect of an error-prone variable on an outcome, where the predicted quantity is used as exposure variable in a regression. For each example, we present an overview of the potential consequences for using a predicted quantity in an analysis in place of the true value without appropriate statistical adjustment. We further illustrate some concepts with data from a large population-based cohort, the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).

Item Metadata

Title	Use and misuse of predicted values in epidemiologic data analyses (TG4)
Creator	Shaw, Pamela
Publisher	Banff International Research Station for Mathematical Innovation and Discovery
Date Issued	2019-06-04T09:30
Description	Pamela A. Shaw, Paul Gustafson, Daniela Sotres-Alvarez, Victor Kipnis, and Laurence Freedman For many epidemiologic settings, the principle exposure or outcome under study can only be imprecisely measured. In an attempt to address error-in-variables, sometimes the analyst will adjust these variables, say through a calibration or prediction equation, and use the resulting predicted value in the analysis in place of the observed value. When a predicted quantity is used in place of an observed value in a data analysis, consideration of the impact of the uncertainty in the predicted quantity on the study results is needed, but this is not always done in practice. Such predicted variables usually have Berkson error. The result of ignoring this uncertainty, or prediction error, for some settings could be that the parameter estimates are biased, the standard errors are biased, or both. We examine three common examples for how predicted values are used in an analysis in place of an error-prone variable: 1) to estimate the distribution of a variable, 2) to compare values of a variable between groups by using the predicted value in a two-group statistic (e.g. t-statistic) or as an outcome variable in a regression, and 3) to estimate the effect of an error-prone variable on an outcome, where the predicted quantity is used as exposure variable in a regression. For each example, we present an overview of the potential consequences for using a predicted quantity in an analysis in place of the true value without appropriate statistical adjustment. We further illustrate some concepts with data from a large population-based cohort, the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).
Extent	20.0 minutes
Subject	Mathematics; Statistics
Type	Moving Image
File Format	video/mp4
Language	eng
Notes	Author affiliation: University of Pennsylvania
Series	BIRS Workshop Lecture Videos (Banff, Alta)
Date Available	2020-09-12
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0394338
URI	http://hdl.handle.net/2429/75995
Affiliation	Non UBC
Peer Review Status	Unreviewed
Scholarly Level	Researcher
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Item Media

201906040930-Shaw_lrv.mp4 -- 87.39MB

Item Citations and Data

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

Open Collections

BIRS Workshop Lecture Videos

Use and misuse of predicted values in epidemiologic data analyses (TG4) Shaw, Pamela

Description

Item Metadata

Item Media

Item Citations and Data

Rights