Clustering and modelling of phase variation for functional data

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Clustering and modelling of phase variation for functional data Fu, Shing

Abstract

Our work is motivated by an analysis of elephant seal dive profiles which we view as functional data, specifically, as depth as a function of time, with data recorded almost continuously by sensors attached to the animal. The objective is to group profiles by shape to better understand the corresponding behavioural states of the seals. Most existing approaches rely on multivariate clustering methods applied to ad hoc summaries of the dive profile. Instead, we view each profile as arising from a function that is a deformation of a base shape. The deformation is regarded as phase variation and is represented by a latent warping function with a finite mixture distribution. We first propose a curve registration model to explicitly model amplitude and phase variations of functional data, with phase variation represented by smooth time transformations called warping functions. Inference is conducted via the stochastic approximation expectation-maximization (SAEM) algorithm. Our simulation study shows that the SAEM algorithm is computationally more stable and efficient than existing approaches in the literature for inference of this class of curve registration model with flexible warping. We then propose two clustering approaches based on our curve registration model for functional data: 1) a simultaneous approach that smooths the noisy raw profiles and estimates the base shape, the warping functions and the cluster membership and via SAEM algorithms; and 2) a two-step approach that applies clustering algorithms on the estimated warping functions. In contrast to generic clustering algorithms in the literature, our methods treat the clustering structure as heterogeneity in phase variation. The proposed method is applied to the analysis of elephant seal dive profiles and an analysis of human growth curves. We are able to obtain more intuitive clusters by focusing the clustering effort on phase variation.

Item Metadata

Title	Clustering and modelling of phase variation for functional data
Creator	Fu, Shing
Publisher	University of British Columbia
Date Issued	2019
Description	Our work is motivated by an analysis of elephant seal dive profiles which we view as functional data, specifically, as depth as a function of time, with data recorded almost continuously by sensors attached to the animal. The objective is to group profiles by shape to better understand the corresponding behavioural states of the seals. Most existing approaches rely on multivariate clustering methods applied to ad hoc summaries of the dive profile. Instead, we view each profile as arising from a function that is a deformation of a base shape. The deformation is regarded as phase variation and is represented by a latent warping function with a finite mixture distribution. We first propose a curve registration model to explicitly model amplitude and phase variations of functional data, with phase variation represented by smooth time transformations called warping functions. Inference is conducted via the stochastic approximation expectation-maximization (SAEM) algorithm. Our simulation study shows that the SAEM algorithm is computationally more stable and efficient than existing approaches in the literature for inference of this class of curve registration model with flexible warping. We then propose two clustering approaches based on our curve registration model for functional data: 1) a simultaneous approach that smooths the noisy raw profiles and estimates the base shape, the warping functions and the cluster membership and via SAEM algorithms; and 2) a two-step approach that applies clustering algorithms on the estimated warping functions. In contrast to generic clustering algorithms in the literature, our methods treat the clustering structure as heterogeneity in phase variation. The proposed method is applied to the analysis of elephant seal dive profiles and an analysis of human growth curves. We are able to obtain more intuitive clusters by focusing the clustering effort on phase variation.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2019-02-25
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0376529
URI	http://hdl.handle.net/2429/68405
Degree (Theses)	Doctor of Philosophy - PhD
Program (Theses)	Statistics
Affiliation	Science, Faculty of; Statistics, Department of
Degree Grantor	University of British Columbia
Graduation Date	2009-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Clustering and modelling of phase variation for functional data Fu, Shing

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights