QTRB: TEAM-BASED REGION BUILDING USING Q-LEARNING TO DERIVE POLICY ON PROGRAMS PARAMETERIZED BY LOCAL REWARD SIGNAL

Sealy, Noah

dc.contributor.author	Sealy, Noah
dc.date.accessioned	2023-04-26T14:35:24Z
dc.date.available	2023-04-26T14:35:24Z
dc.date.issued	2023-04-24
dc.identifier.uri	http://hdl.handle.net/10222/82533
dc.description.abstract	While attempting to solve 2-dimensional grid world maze tasks, it was observed that genetic programming is limited by its random initialization and no use of local reward. This thesis proposes a hybrid algorithm called QTRB, team-based region building with q-learning, which attempts to integrate genetic programming and reinforcement learning to use local reward during evolution. During evolution, QTRB constructs programs based directly on local environmental reward; programs are then passed to a reinforcement learning agent to learn on as a model. QTRB was tested to solve variously sized 2-dimensional maze tasks, hypothesizing that policy can be derived from an agent learning from this model. The results suggest that QTRB can derive policy on the given tasks, with fewer direct environment queries than traditional q-learning as the task size scales.	en_US
dc.language.iso	en	en_US
dc.subject	genetic programming	en_US
dc.subject	reinforcement learning	en_US
dc.subject	local reinforcement	en_US
dc.subject	hybrid algorithms	en_US
dc.subject	qtrb	en_US
dc.title	QTRB: TEAM-BASED REGION BUILDING USING Q-LEARNING TO DERIVE POLICY ON PROGRAMS PARAMETERIZED BY LOCAL REWARD SIGNAL	en_US
dc.type	Thesis	en_US
dc.date.defence	2023-04-11
dc.contributor.department	Faculty of Computer Science	en_US
dc.contributor.degree	Master of Computer Science	en_US
dc.contributor.external-examiner	n/a	en_US
dc.contributor.graduate-coordinator	Dr. Michael McAllister	en_US
dc.contributor.thesis-reader	Dr. Vlado Keselj	en_US
dc.contributor.thesis-reader	Dr. Garnett Wilson	en_US
dc.contributor.thesis-reader	Dr. Dirk Arnold	en_US
dc.contributor.thesis-supervisor	Dr. Malcolm Heywood	en_US
dc.contributor.ethics-approval	Not Applicable	en_US
dc.contributor.manuscripts	Not Applicable	en_US
dc.contributor.copyright-release	Not Applicable	en_US

Find Full text

Files in this item

Name:: NoahSealy2023.pdf
Size:: 14.81Mb
Format:: PDF
Description:: Main thesis

View/Open

This item appears in the following Collection(s)

Faculty of Graduate Studies Online Theses

Show simple item record