Ana gezinime atla Aramaya atla Ana içeriğe atla

A cultural algorithm for pomdps from stochastic inventory control

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

5 Alıntılar (Scopus)

Özet

Reinforcement Learning algorithms such as SARSA with an eligibility trace, and Evolutionary Computation methods such as genetic algorithms, are competing approaches to solving Partially Observable Markov Decision Processes (POMDPs) which occur in many fields of Artificial Intelligence. A powerful form of evolutionary algorithm that has not previously been applied to POMDPs is the cultural algorithm, in which evolving agents share knowledge in a belief space that is used to guide their evolution. We describe a cultural algorithm for POMDPs that hybridises SARSA with a noisy genetic algorithm, and inherits the latter's convergence properties. Its belief space is a common set of state-action values that are updated during genetic exploration, and conversely used to modify chromosomes. We use it to solve problems from stochastic inventory control by finding memoryless policies for nondeterministic POMDPs. Neither SARSA nor the genetic algorithm dominates the other on these problems, but the cultural algorithm outperforms the genetic algorithm, and on highly non-Markovian instances also outperforms SARSA.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıHybrid Metaheuristics - 5th International Workshop, HM 2008, Proceedings
YayınlayanSpringer Verlag
Sayfalar16-28
Sayfa sayısı13
ISBN (Basılı)3540884386, 9783540884385
DOI'lar
Yayın durumuYayınlandı - 2008
Etkinlik5th International Workshop on Hybrid Metaheuristics, HM 2008 - Malaga, !!Spain
Süre: 8 Eki 20089 Eki 2008

Yayın serisi

AdıLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Hacim5296 LNCS
ISSN (Basılı)0302-9743
ISSN (Elektronik)1611-3349

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???5th International Workshop on Hybrid Metaheuristics, HM 2008
Ülke/Bölge!!Spain
ŞehirMalaga
Periyot8/10/089/10/08

Parmak izi

A cultural algorithm for pomdps from stochastic inventory control' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Bundan alıntı yap