This repository shows one sample implementation of using a Recurrent Neural Network (RNN) architecture, specifically the Long Short Term Memory RNN (LSTM) in Reinforcement Learning (RL). The LSTM is used in RL to demonstrate one way of employing memory to the learning agent in order to efficiently remember the important signals from the environment and successfully complete the task while maximizing the reward. The main source of this notebook is the paper by Bram Bakker in 2002 entitled Reinforcement Learning with Long Short-Term memory. However, the main focus is on the incorporation of LSTM in Q-learning in the T-maze task. Some important details will be explained as much as possible but not everything will be revealed
-
Notifications
You must be signed in to change notification settings - Fork 0
This repository shows one sample implementation of using a Recurrent Neural Network (RNN) architecture, specifically the Long Short Term Memory RNN (LSTM) in Reinforcement Learning (RL). The LSTM is used in RL to demonstrate one way of employing memory to the learning agent in order to efficiently remember the important signals from the environm…
dcbiton/olfactory-search-rl-lstm
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This repository shows one sample implementation of using a Recurrent Neural Network (RNN) architecture, specifically the Long Short Term Memory RNN (LSTM) in Reinforcement Learning (RL). The LSTM is used in RL to demonstrate one way of employing memory to the learning agent in order to efficiently remember the important signals from the environm…
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published