Autonomous strategy planning under uncertainty

Sardağ, Alp.

Archives and Documentation Center Digital Archives Home
→
Boğaziçi Üniversitesi Tezleri
→
Fen Bilimleri Enstitüsü
→
Bilgisayar Mühendisliği
→
M.S. Theses
→
View Item

dc.contributor	Graduate Program in Computer Engineering.
dc.contributor.advisor	Akın, H. Levent.
dc.contributor.author	Sardağ, Alp.
dc.date.accessioned	2023-03-16T10:05:45Z
dc.date.available	2023-03-16T10:05:45Z
dc.date.issued	2006.
dc.identifier.other	CMPE 2006 S27
dc.identifier.uri	http://digitalarchive.boun.edu.tr/handle/123456789/12472
dc.description.abstract	A real world environment is often partially observable for agents either because of noisy sensors or incomplete perception. Autonomous strategy planning under uncer- tainty has two major challenges. The 歔rst one is autonomous segmentation of the state space for a given task, and the second, emerging complex behaviors, that deal with each state segment. This thesis proposes three new approaches, namely ARKAQ-Learning, KAFAQ-Learning and KBVI, that handle both challenges by utilizing combinations of various techniques. ARKAQ makes use of ART2-A Networks augmented with Kalman Filters and Q-Learning. KAFAQ is a 歔nite state automaton using Kalman 歔lters and Q-Learning. KBVI uses Monte Carlo methods and introduces a new technique to calculate Q-values for continuous domains. All are online algorithms with relatively low space and time complexity. The algorithms were run for some well-known Partially Observable Markov Decision Process problems, where the problem of representing the value function is more di±cult than the discrete case because inputs are continuous distributions. The algorithms could reveal the hidden states, mapping non-Markovian observations to internal belief states, and also could construct an approximate optimal policy on the internal belief state space.
dc.format.extent	30cm.
dc.publisher	Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2006.
dc.relation	Includes appendices.
dc.relation	Includes appendices.
dc.subject.lcsh	Reinforcement learning (Machine learning)
dc.subject.lcsh	Markov processes.
dc.title	Autonomous strategy planning under uncertainty
dc.format.pages	xvi, 135 leaves;