UBC Theses and Dissertations

UBC Theses Logo

UBC Theses and Dissertations

Modelling peaks over thresholds in panel data : a grouped panel generalized Pareto regression model Liu, Zefan

Abstract

Extreme Value Theory (EVT) provides probabilistic tools to understand the behaviour of extreme events, making it widely applicable across various fields. When modelling the marginal distributions of the extremes in panel data, one may wish to balance the flexibility to capture the heterogeneity among margins and the efficiency of estimation through a combination of regression technique and assuming a latent group structure among subjects. This group structure facilitating information pooling may not be known a priori and needs to be estimated from data, which may then lead to potential physical interpretations. One existing approach addressing this modelling idea builds on the Block Maxima (BM) method in EVT, which can result in a loss of valuable information. Moreover, similar to the classic k-means clustering method, the current algorithm for estimating group structure is prone to converging to locally optimal solutions. We extend the current approach to a new framework called the grouped panel generalized Pareto regression model, which utilizes the Peaks Over Threshold (POT) method to model excesses over high thresholds, thereby leveraging extreme event information more exhaustively. To account for the conditional dependence structure within clusters of excesses, we introduce a dependence-window-based sandwich estimator for standard error estimation. Taking advantage of the POT method, we develop a new grouping algorithm inspired by hierarchical clustering, which relies on a pre-determined linkage and stopping rule. This algorithm estimates the latent number of groups, the group structure and associated parameters simultaneously, and it demonstrates improved performance in identifying the globally optimal structure and balancing the goodness of fit across subjects under reasonable conditions. The finite-sample performance of our methodology is carefully evaluated through simulation studies, and an application to the river flow data from 31 hydrological stations in Upper Danube river basin is used to illustrate the real-world applicability of our modelling strategy, where the estimation efficiency is notably improved and physically interpretable group structures are identified.

Item Media

Item Citations and Data

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International