Reinforcement learning in complex environments with locally trained naïve agents

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Reinforcement learning in complex environments with locally trained naïve agents Gupta, Kashish

Abstract

Reinforcement learning has long been advertised as the one with the capability to intelligently mimic and understand human learning and behavior. While the upshot of the field's advances is not underrated, its applicability and extension to large, complex and highly dynamic environments remain inefficient, inaccurate or unsolved. As the complexity of objectives increase, trend in reinforcement learning research is to tackle it with more computational power and training samples. The inspiration for the proposed methodology is derived from human learning, where increasingly complex objectives are learned with limited computational power by re-purposing previously learned skills. An intuitive and elegant concept is presented which exploits abstract symmetries present in training environments to learn a naïve agent and bypass some of the core challenges of reinforcement learning training. The naïve agent, trained in a local environment, can then be used in numerous ways to improve training in high-dimensional state-space environments. The proposed solution is incorporated with heuristic-based planning, learning from demonstration and state-space abstraction methods to present the efficacy and ease of adaptability of the proposed concept on a range of domains. The proposed method provides a structured approach to the training process and improves the trained agent's generalization capabilities and training sample efficiency. While the presented concept benefits from additional information about the state-space structure, it is notably different from traditional reinforcement learning approaches which augments the training process with expensive domain-specific knowledge to improve sample efficiency. Experiments and analyses are presented for challenging navigation and control environments, solved with augmented off-policy family of reinforcement learning methods.

Item Metadata

Title	Reinforcement learning in complex environments with locally trained naïve agents
Creator	Gupta, Kashish
Supervisor	Najjaran, Homayoun
Publisher	University of British Columbia
Date Issued	2021
Description	Reinforcement learning has long been advertised as the one with the capability to intelligently mimic and understand human learning and behavior. While the upshot of the field's advances is not underrated, its applicability and extension to large, complex and highly dynamic environments remain inefficient, inaccurate or unsolved. As the complexity of objectives increase, trend in reinforcement learning research is to tackle it with more computational power and training samples. The inspiration for the proposed methodology is derived from human learning, where increasingly complex objectives are learned with limited computational power by re-purposing previously learned skills. An intuitive and elegant concept is presented which exploits abstract symmetries present in training environments to learn a naïve agent and bypass some of the core challenges of reinforcement learning training. The naïve agent, trained in a local environment, can then be used in numerous ways to improve training in high-dimensional state-space environments. The proposed solution is incorporated with heuristic-based planning, learning from demonstration and state-space abstraction methods to present the efficacy and ease of adaptability of the proposed concept on a range of domains. The proposed method provides a structured approach to the training process and improves the trained agent's generalization capabilities and training sample efficiency. While the presented concept benefits from additional information about the state-space structure, it is notably different from traditional reinforcement learning approaches which augments the training process with expensive domain-specific knowledge to improve sample efficiency. Experiments and analyses are presented for challenging navigation and control environments, solved with augmented off-policy family of reinforcement learning methods.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2021-11-12
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0403370
URI	http://hdl.handle.net/2429/80187
Degree (Theses)	Doctor of Philosophy - PhD
Program (Theses)	Electrical Engineering
Affiliation	Applied Science, Faculty of; Engineering, School of (Okanagan)
Degree Grantor	University of British Columbia
Graduation Date	2022-02
Campus	UBCO
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Reinforcement learning in complex environments with locally trained naïve agents Gupta, Kashish

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights