Some statistical models for the multivariate analysis of longitudinal data

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Some statistical models for the multivariate analysis of longitudinal data Song, Peter Xue-Kun

Abstract

This thesis develops some statistical models for the multivariate analysis of longitudinal data on the basis of the dispersion models of J0rgensen (1987a, 1996), consisting of three topics: multivariate dispersion models and their application to regression analysis, stationary time series models with non-normal margins and state space models with Markov latent processes. The goal of the thesis is to develop statistical models which can accommodate features of both trend and dependence for longitudinal data. This thesis focusses mainly on the following three types of longitudinal data, namely (1) many short time series, (2) a few long stationary time series and (3) a few long non-stationary time series with time-varying covariates. A class of multivariate dispersion models is proposed to deal with data of type (1), in a spirit similar to multivariate analysis based on the multivariate normal distribution. Under these multivariate parametric models, population-averaged models (Diggle et al., 1994) are revisited, where approximate inferences for regression parameters are presented, including the generalized estimating equation (GEE) of Liang and Zeger (1986) as a special case. The thesis also presents a class of stationary autoregressive moving-average (ARMA) models with exponential dispersion model margins for data of type (2). The class of ARMA models is defined as a special case of a class of stationary infinite order moving average processes constructed by means of the thinning operation of Joe (1996a). For analysis of type (3) data, two classes of state space models, including one with stationary latent processes and another with non-stationary latent processes, are proposed. To estimate regression parameters in both classes of models, we develop an algorithm for solving the so-called Kalman estimating equation (KEE), corresponding to a modified EM-algorithm where the E-step is approximated by the Kalman smoother that estimates the latent process via the best linear unbiased predictor (BLUP). Two simulation studies are conducted in the thesis based on Poisson-gamma models. One is for the comparison of the efficiency of the K E E approach and the Monte Carlo E M (MCEM) algorithm. The other simulation study is for the examination of the utility of the model diagnosis for detecting the misspecification of stationarity and non-stationarity for latent process. The thesis contains two data analyses. One data set consists of daily counts of emergency room visits for respiratory diseases to the hospital of Prince George, British Columbia, along with covariates of air pollution variables and meteorological variables. These data are analyzed through state space models to investigate the relationship between air pollution and respiratory morbidity. The other data set, consisting of the monthly number of poliomyelitis cases in the USA from 1970 to 1983, is analyzed based on the Poisson stationary-gamma model to study whether or not there is an evidence of a decreasing trend in the rate of polio infections in the USA.

Item Metadata

Title	Some statistical models for the multivariate analysis of longitudinal data
Creator	Song, Peter Xue-Kun
Publisher	University of British Columbia
Date Issued	1996
Description	This thesis develops some statistical models for the multivariate analysis of longitudinal data on the basis of the dispersion models of J0rgensen (1987a, 1996), consisting of three topics: multivariate dispersion models and their application to regression analysis, stationary time series models with non-normal margins and state space models with Markov latent processes. The goal of the thesis is to develop statistical models which can accommodate features of both trend and dependence for longitudinal data. This thesis focusses mainly on the following three types of longitudinal data, namely (1) many short time series, (2) a few long stationary time series and (3) a few long non-stationary time series with time-varying covariates. A class of multivariate dispersion models is proposed to deal with data of type (1), in a spirit similar to multivariate analysis based on the multivariate normal distribution. Under these multivariate parametric models, population-averaged models (Diggle et al., 1994) are revisited, where approximate inferences for regression parameters are presented, including the generalized estimating equation (GEE) of Liang and Zeger (1986) as a special case. The thesis also presents a class of stationary autoregressive moving-average (ARMA) models with exponential dispersion model margins for data of type (2). The class of ARMA models is defined as a special case of a class of stationary infinite order moving average processes constructed by means of the thinning operation of Joe (1996a). For analysis of type (3) data, two classes of state space models, including one with stationary latent processes and another with non-stationary latent processes, are proposed. To estimate regression parameters in both classes of models, we develop an algorithm for solving the so-called Kalman estimating equation (KEE), corresponding to a modified EM-algorithm where the E-step is approximated by the Kalman smoother that estimates the latent process via the best linear unbiased predictor (BLUP). Two simulation studies are conducted in the thesis based on Poisson-gamma models. One is for the comparison of the efficiency of the K E E approach and the Monte Carlo E M (MCEM) algorithm. The other simulation study is for the examination of the utility of the model diagnosis for detecting the misspecification of stationarity and non-stationarity for latent process. The thesis contains two data analyses. One data set consists of daily counts of emergency room visits for respiratory diseases to the hospital of Prince George, British Columbia, along with covariates of air pollution variables and meteorological variables. These data are analyzed through state space models to investigate the relationship between air pollution and respiratory morbidity. The other data set, consisting of the monthly number of poliomyelitis cases in the USA from 1970 to 1983, is analyzed based on the Poisson stationary-gamma model to study whether or not there is an evidence of a decreasing trend in the rate of polio infections in the USA.
Extent	8244231 bytes
Genre	Thesis/Dissertation
Type	Text
File Format	application/pdf
Language	eng
Date Available	2009-03-17
Provider	Vancouver : University of British Columbia Library
Rights	For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
DOI	10.14288/1.0087739
URI	http://hdl.handle.net/2429/6196
Degree	Doctor of Philosophy - PhD
Program	Statistics
Affiliation	Science, Faculty of; Statistics, Department of
Degree Grantor	University of British Columbia
Graduation Date	1996-11
Campus	UBCV
Scholarly Level	Graduate
Aggregated Source Repository	DSpace

Item Media

ubc_1996-148394.pdf -- 7.86MB

Item Citations and Data

Rights

For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.

Open Collections

UBC Theses and Dissertations

Some statistical models for the multivariate analysis of longitudinal data Song, Peter Xue-Kun

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights