- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Exploring partially observable Markov decision processes...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
Exploring partially observable Markov decision processes by exploting structure and heuristic information Leung, Siu-Ki
Abstract
This thesis is about chance and choice, or decisions under uncertainty. The desire for creating an intelligent agent performing rewarding tasks in a realistic world urges for working models to do sequential decision making and planning. In responding to this grand wish, decision-theoretic planning (DTP) has evolved from decision theory and control theory, and has been applied to planning in artificial intelligence. Recent interest has been directed toward Markov Decision Processes (MDPs) introduced from operations research. While fruitful results have been tapped from research in fully observable MDPs, partially observable MDPs (POMDPs) are still too difficult to solve as observation uncertainties are incorporated. Abstraction and approximation techniques become the focus. This research attempts to enhance POMDPs by applying A l techniques. In particular, we transform the linear POMDP constructs into a structured representation based on binary decision trees and Bayesian Networks to achieve compactness. A handful of tree-oriented operations is then developed to perform structural belief updates and value computation. Along ii with the structured representation, we explore the belief space with a heuristic online search approach, in which best-first search strategy with heuristic pruning is employed. Experimenting with a structured testbed domain reveals great potentials of exploiting structure and heuristics to empower POMDPs for more practical applications.
Item Metadata
Title |
Exploring partially observable Markov decision processes by exploting structure and heuristic information
|
Creator | |
Publisher |
University of British Columbia
|
Date Issued |
1996
|
Description |
This thesis is about chance and choice, or decisions under uncertainty. The
desire for creating an intelligent agent performing rewarding tasks in a realistic
world urges for working models to do sequential decision making and planning.
In responding to this grand wish, decision-theoretic planning (DTP)
has evolved from decision theory and control theory, and has been applied to
planning in artificial intelligence. Recent interest has been directed toward
Markov Decision Processes (MDPs) introduced from operations research.
While fruitful results have been tapped from research in fully observable
MDPs, partially observable MDPs (POMDPs) are still too difficult to solve
as observation uncertainties are incorporated. Abstraction and approximation
techniques become the focus.
This research attempts to enhance POMDPs by applying A l techniques.
In particular, we transform the linear POMDP constructs into a structured
representation based on binary decision trees and Bayesian Networks to
achieve compactness. A handful of tree-oriented operations is then developed
to perform structural belief updates and value computation. Along
ii with the structured representation, we explore the belief space with a heuristic
online search approach, in which best-first search strategy with heuristic
pruning is employed.
Experimenting with a structured testbed domain reveals great potentials
of exploiting structure and heuristics to empower POMDPs for more practical
applications.
|
Extent |
4266139 bytes
|
Genre | |
Type | |
File Format |
application/pdf
|
Language |
eng
|
Date Available |
2009-03-09
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
|
DOI |
10.14288/1.0051460
|
URI | |
Degree | |
Program | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
1997-05
|
Campus | |
Scholarly Level |
Graduate
|
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.