Monotone optimal policies for quasivariational inequalities arising in optimal portfolio liquidation

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Monotone optimal policies for quasivariational inequalities arising in optimal portfolio liquidation Crawford, Daniel J.

Abstract

This thesis studies the Hamilton-Jacobi-Ballman quasivariational inequality (HJBQVI), the corresponding optimal value function, and discrete schemes useful for approximating this value function. Moreover, the structural properties of the optimal policy of particular discrete scheme is studied. The motivation is to find a convergent, approximating scheme for the otherwise complicated HJBQVI that has monotone policy structure that can be exploited in a stochastic gradient estimation scheme to approximate optimal policy function parameters. In order to motivate this approach, we consider the problem of optimal liquidation of a single risky asset portfolio as an impulse control problem. The model is defined over continuous time, state, and compact action sets, and the optimal liquidation value and strategy are found from the viscosity solution of a HJBQVI. It is shown that the optimal strategy is monotone in the number of shares owned and the time remaining to liquidation. This structural result is exploited to estimate the optimal policy via a reinforcement learning method based on the simultaneous perturbation stochastic approximation (SPSA) algorithm. The optimal policy can be estimated without knowledge of the parameters of the underlying model.

Item Metadata

Title	Monotone optimal policies for quasivariational inequalities arising in optimal portfolio liquidation
Creator	Crawford, Daniel J.
Publisher	University of British Columbia
Date Issued	2014
Description	This thesis studies the Hamilton-Jacobi-Ballman quasivariational inequality (HJBQVI), the corresponding optimal value function, and discrete schemes useful for approximating this value function. Moreover, the structural properties of the optimal policy of particular discrete scheme is studied. The motivation is to find a convergent, approximating scheme for the otherwise complicated HJBQVI that has monotone policy structure that can be exploited in a stochastic gradient estimation scheme to approximate optimal policy function parameters. In order to motivate this approach, we consider the problem of optimal liquidation of a single risky asset portfolio as an impulse control problem. The model is defined over continuous time, state, and compact action sets, and the optimal liquidation value and strategy are found from the viscosity solution of a HJBQVI. It is shown that the optimal strategy is monotone in the number of shares owned and the time remaining to liquidation. This structural result is exploited to estimate the optimal policy via a reinforcement learning method based on the simultaneous perturbation stochastic approximation (SPSA) algorithm. The optimal policy can be estimated without knowledge of the parameters of the underlying model.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2014-12-09
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivs 2.5 Canada
DOI	10.14288/1.0167066
URI	http://hdl.handle.net/2429/51421
Degree (Theses)	Master of Applied Science - MASc
Program (Theses)	Electrical and Computer Engineering
Affiliation	Applied Science, Faculty of; Electrical and Computer Engineering, Department of
Degree Grantor	University of British Columbia
Graduation Date	2015-02
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/2.5/ca/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Monotone optimal policies for quasivariational inequalities arising in optimal portfolio liquidation Crawford, Daniel J.

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights