- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Monotone optimal policies for quasivariational inequalities...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
Monotone optimal policies for quasivariational inequalities arising in optimal portfolio liquidation Crawford, Daniel J.
Abstract
This thesis studies the Hamilton-Jacobi-Ballman quasivariational inequality (HJBQVI), the corresponding optimal value function, and discrete schemes useful for approximating this value function. Moreover, the structural properties of the optimal policy of particular discrete scheme is studied. The motivation is to find a convergent, approximating scheme for the otherwise complicated HJBQVI that has monotone policy structure that can be exploited in a stochastic gradient estimation scheme to approximate optimal policy function parameters. In order to motivate this approach, we consider the problem of optimal liquidation of a single risky asset portfolio as an impulse control problem. The model is defined over continuous time, state, and compact action sets, and the optimal liquidation value and strategy are found from the viscosity solution of a HJBQVI. It is shown that the optimal strategy is monotone in the number of shares owned and the time remaining to liquidation. This structural result is exploited to estimate the optimal policy via a reinforcement learning method based on the simultaneous perturbation stochastic approximation (SPSA) algorithm. The optimal policy can be estimated without knowledge of the parameters of the underlying model.
Item Metadata
Title |
Monotone optimal policies for quasivariational inequalities arising in optimal portfolio liquidation
|
Creator | |
Publisher |
University of British Columbia
|
Date Issued |
2014
|
Description |
This thesis studies the Hamilton-Jacobi-Ballman quasivariational inequality (HJBQVI), the corresponding optimal value function, and discrete schemes useful for approximating this value function. Moreover, the structural properties of the optimal policy of particular discrete scheme is studied. The motivation is to find a convergent, approximating scheme for the otherwise complicated HJBQVI that has monotone policy structure that can be exploited in a stochastic gradient estimation scheme to approximate optimal policy function parameters. In order to motivate this approach, we consider the problem of optimal liquidation of a single risky asset portfolio as an impulse control problem. The model is defined over continuous time, state, and compact action sets, and the optimal liquidation value and strategy are found from the viscosity solution of a HJBQVI. It is shown that the optimal strategy is monotone in the number of shares owned and the time remaining to liquidation. This structural result is exploited to estimate the optimal policy via a reinforcement learning method based on the simultaneous perturbation stochastic approximation (SPSA) algorithm. The optimal policy can be estimated without knowledge of the parameters of the underlying model.
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2014-12-09
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivs 2.5 Canada
|
DOI |
10.14288/1.0167066
|
URI | |
Degree | |
Program | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
2015-02
|
Campus | |
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivs 2.5 Canada