Passive sensory processing is often insufficient to guide biological organisms in complex environments. Rather, behaviourally relevant information can be accessed by performing so-called epistemic actions that explicitly aim at unveiling hidden information. However, it is still unclear how an autonomous agent can learn epistemic actions and how it can use them adaptively. In this work, we propose a definition of epistemic actions for POMDPs that derive from their characterizations in cognitive science and classical planning literature. We give theoretical insights about how partial observability and epistemic actions can affect the learning process and performance in the extreme conditions of model-free and memory-free reinforcement learning where hidden information cannot be represented. We finally investigate these concepts using an integrated eye-arm neural architecture for robot control, which can use its effectors to execute epistemic actions and can exploit the actively gathered information to efficiently accomplish a seek-and-reach task.

Ognibene, D., Catenacci Volpi, N., Pezzulo, G., Baldassarre, G. (2013). Learning Epistemic Actions in Model-Free Memory-Free Reinforcement Learning: experiments with a neuro-robotic model. In Biomimetic and Biohybrid Systems. Living Machines 2013 (pp.191-203). Springer [10.1007/978-3-642-39802-5_17].

Learning Epistemic Actions in Model-Free Memory-Free Reinforcement Learning: experiments with a neuro-robotic model

Ognibene D
Primo
;
2013

Abstract

Passive sensory processing is often insufficient to guide biological organisms in complex environments. Rather, behaviourally relevant information can be accessed by performing so-called epistemic actions that explicitly aim at unveiling hidden information. However, it is still unclear how an autonomous agent can learn epistemic actions and how it can use them adaptively. In this work, we propose a definition of epistemic actions for POMDPs that derive from their characterizations in cognitive science and classical planning literature. We give theoretical insights about how partial observability and epistemic actions can affect the learning process and performance in the extreme conditions of model-free and memory-free reinforcement learning where hidden information cannot be represented. We finally investigate these concepts using an integrated eye-arm neural architecture for robot control, which can use its effectors to execute epistemic actions and can exploit the actively gathered information to efficiently accomplish a seek-and-reach task.
paper
Active Vision; Reinforcement Learning; POMDP; Partial Observability; Planning; Epistemic Actions; Exploration
English
International Conference on Biomimetic and Biohybrid Systems: Living Machines, LM 2013
2013
Lepora N.F., Mura A., Krapp H.G., Verschure P.F.M.J., Prescott T.J.
Biomimetic and Biohybrid Systems. Living Machines 2013
978-3-642-39801-8
2013
8064 LNAI
191
203
http://link.springer.com/chapter/10.1007/978-3-642-39802-5_17
reserved
Ognibene, D., Catenacci Volpi, N., Pezzulo, G., Baldassarre, G. (2013). Learning Epistemic Actions in Model-Free Memory-Free Reinforcement Learning: experiments with a neuro-robotic model. In Biomimetic and Biohybrid Systems. Living Machines 2013 (pp.191-203). Springer [10.1007/978-3-642-39802-5_17].
File in questo prodotto:
File Dimensione Formato  
Ognibene-2013-Living Machines-VoR.pdf

Solo gestori archivio

Descrizione: Intervento a convegno
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 583.35 kB
Formato Adobe PDF
583.35 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/301913
Citazioni
  • Scopus 6
  • ???jsp.display-item.citation.isi??? ND
Social impact