Abstract
Biological brains still far exceed artificial intelligence systems, both in terms of control capabilities and power consumption. Spiking neural networks (SNNs) are a promising model, inspired by neuroscience and functionally closer to the way neurons process information. While recent advancements in neuromorphic hardware allow energy efficient synthesis of spiking networks, the training of such networks remains an open problem. In this work we focus on reinforcement learning with sparse and delayed rewards. The proposed architecture has four distinct layers and addresses the limitation of previous models in terms of scalability with input dimensions. Our SNN is evaluated on classical reinforcement learning and control tasks and is compared to two common RL algorithms: Q-learning and deep Q-network (DQN). Experiments demonstrate that the proposed network outperforms Q-learning on a task with six-dimensional observation space and compares favorably to the evaluated DQN configurations in terms of stability and memory requirements.
| Original language | English |
|---|---|
| Title of host publication | 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781728169262 |
| DOIs | |
| Publication status | Published - Jul 2020 |
| Externally published | Yes |
| Event | 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Virtual, Glasgow, United Kingdom Duration: 19 Jul 2020 → 24 Jul 2020 |
Publication series
| Name | Proceedings of the International Joint Conference on Neural Networks |
|---|
Conference
| Conference | 2020 International Joint Conference on Neural Networks, IJCNN 2020 |
|---|---|
| Country/Territory | United Kingdom |
| City | Virtual, Glasgow |
| Period | 19/07/20 → 24/07/20 |
Bibliographical note
Publisher Copyright:© 2020 IEEE.
Keywords
- Reinforcement learning
- reward-modulated STDP
- spiking neural networks