Optimal planning with approximate model-based reinforcement learning

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Optimal planning with approximate model-based reinforcement learning Kao, Hai Feng

Abstract

Model-based reinforcement learning methods make efficient use of samples by building a model of the environment and planning with it. Compared to model-free methods, they usually take fewer samples to converge to the optimal policy. Despite that efficiency, model-based methods may not learn the optimal policy due to structural modeling assumptions. In this thesis, we show that by combining model- based methods with hierarchically optimal recursive Q-learning (HORDQ) under a hierarchical reinforcement learning framework, the proposed approach learns the optimal policy even when the assumptions of the model are not all satisfied. The effectiveness of our approach is demonstrated with the Bus domain and Infinite Mario – a Java implementation of Nintendo’s Super Mario Brothers.

Item Metadata

Title	Optimal planning with approximate model-based reinforcement learning
Creator	Kao, Hai Feng
Publisher	University of British Columbia
Date Issued	2011
Description	Model-based reinforcement learning methods make efficient use of samples by building a model of the environment and planning with it. Compared to model-free methods, they usually take fewer samples to converge to the optimal policy. Despite that efficiency, model-based methods may not learn the optimal policy due to structural modeling assumptions. In this thesis, we show that by combining model- based methods with hierarchically optimal recursive Q-learning (HORDQ) under a hierarchical reinforcement learning framework, the proposed approach learns the optimal policy even when the assumptions of the model are not all satisfied. The effectiveness of our approach is demonstrated with the Bus domain and Infinite Mario – a Java implementation of Nintendo’s Super Mario Brothers.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2012-01-04
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-ShareAlike 3.0 Unported
DOI	10.14288/1.0052158
URI	http://hdl.handle.net/2429/39889
Degree	Master of Science - MSc
Program	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2012-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-sa/3.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Optimal planning with approximate model-based reinforcement learning Kao, Hai Feng

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights