Arşiv ve Dokümantasyon Merkezi
Dijital Arşivi

Autonomous strategy planning under uncertainty

Basit öğe kaydını göster

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Akın, H. Levent.
dc.contributor.author Sardağ, Alp.
dc.date.accessioned 2023-03-16T10:05:45Z
dc.date.available 2023-03-16T10:05:45Z
dc.date.issued 2006.
dc.identifier.other CMPE 2006 S27
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12472
dc.description.abstract A real world environment is often partially observable for agents either because of noisy sensors or incomplete perception. Autonomous strategy planning under uncer- tainty has two major challenges. The 歔rst one is autonomous segmentation of the state space for a given task, and the second, emerging complex behaviors, that deal with each state segment. This thesis proposes three new approaches, namely ARKAQ-Learning, KAFAQ-Learning and KBVI, that handle both challenges by utilizing combinations of various techniques. ARKAQ makes use of ART2-A Networks augmented with Kalman Filters and Q-Learning. KAFAQ is a 歔nite state automaton using Kalman 歔lters and Q-Learning. KBVI uses Monte Carlo methods and introduces a new technique to calculate Q-values for continuous domains. All are online algorithms with relatively low space and time complexity. The algorithms were run for some well-known Partially Observable Markov Decision Process problems, where the problem of representing the value function is more di±cult than the discrete case because inputs are continuous distributions. The algorithms could reveal the hidden states, mapping non-Markovian observations to internal belief states, and also could construct an approximate optimal policy on the internal belief state space.
dc.format.extent 30cm.
dc.publisher Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2006.
dc.relation Includes appendices.
dc.relation Includes appendices.
dc.subject.lcsh Reinforcement learning (Machine learning)
dc.subject.lcsh Markov processes.
dc.title Autonomous strategy planning under uncertainty
dc.format.pages xvi, 135 leaves;


Bu öğenin dosyaları

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster

Dijital Arşivde Ara


Göz at

Hesabım