When monkeys tackle novel complex behavioral tasks by trial-and-error they select actions from repertoires of sensorimotor primitives that allow them to search solutions in a space which is coarser than the space of fine movements. Neuroscientific findings suggested that upper-limb sensorimotor primitives might be encoded, in terms of the final goal-postures they pursue, in premotor cortex. A previous work by the authors reproduced these results in a model based on the idea that cortical pathways learn sensorimotor primitives while basal ganglia learn to assemble and trigger them to pursue complex reward-based goals. This paper extends that model in several directions: a) it uses a Kohonen network to create a neural map with population encoding of postural primitives; b) it proposes an actor-critic reinforcement learning algorithm capable of learning to select those primitives in a biologically plausible fashion (i.e., through a dynamic competition between postures); c) it proposes a procedure to pre-train the actor to select promising primitives when tackling novel reinforcement learning tasks. Some tests (obtained with a task used for studying monkeys engaged in learning reaching-action sequences) show that the model is computationally sound and capable of learning to select sensorimotor primitives from the postures’ continuous space on the basis of their population encoding.

Ognibene, D., Rega, A., Baldassarre, G. (2006). A model of reaching that integrates reinforcement learning and population encoding of postures. In From Animals to Animats 9th International Conference on Simulation of Adaptive Behavior, SAB 2006, Rome, Italy, September 25-29, 2006. Proceedings (pp.381-393). Springer [10.1007/11840541_32].

A model of reaching that integrates reinforcement learning and population encoding of postures

Ognibene D;
2006

Abstract

When monkeys tackle novel complex behavioral tasks by trial-and-error they select actions from repertoires of sensorimotor primitives that allow them to search solutions in a space which is coarser than the space of fine movements. Neuroscientific findings suggested that upper-limb sensorimotor primitives might be encoded, in terms of the final goal-postures they pursue, in premotor cortex. A previous work by the authors reproduced these results in a model based on the idea that cortical pathways learn sensorimotor primitives while basal ganglia learn to assemble and trigger them to pursue complex reward-based goals. This paper extends that model in several directions: a) it uses a Kohonen network to create a neural map with population encoding of postural primitives; b) it proposes an actor-critic reinforcement learning algorithm capable of learning to select those primitives in a biologically plausible fashion (i.e., through a dynamic competition between postures); c) it proposes a procedure to pre-train the actor to select promising primitives when tackling novel reinforcement learning tasks. Some tests (obtained with a task used for studying monkeys engaged in learning reaching-action sequences) show that the model is computationally sound and capable of learning to select sensorimotor primitives from the postures’ continuous space on the basis of their population encoding.
paper
Premotor Cortex; Adulthood Phase; Posture Controller; Dynamic Competition; Accumulator Model; Reinforcement learning; Artificial Neural Networks;
English
International Conference on Simulation of Adaptive Behavior, SAB 2006
2006
Stefano Nolfi; Gianluca Baldassarre; Raffaele Calabretta; John C. T. Hallam; Davide Marocco; Jean-Arcady Meyer; Orazio Miglino; Domenico Parisi;
From Animals to Animats 9th International Conference on Simulation of Adaptive Behavior, SAB 2006, Rome, Italy, September 25-29, 2006. Proceedings
978-3-540-38608-7
2006
2
381
393
https://link.springer.com/chapter/10.1007/11840541_32
none
Ognibene, D., Rega, A., Baldassarre, G. (2006). A model of reaching that integrates reinforcement learning and population encoding of postures. In From Animals to Animats 9th International Conference on Simulation of Adaptive Behavior, SAB 2006, Rome, Italy, September 25-29, 2006. Proceedings (pp.381-393). Springer [10.1007/11840541_32].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/301945
Citazioni
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 10
Social impact