Had a lot of fun recreating the 13yr old DeepMind paper on playing Atari with deep RL entirely from scratch!
Here, an agent learns from only raw video frames to match human experts in 29/49 diff. games using the same hyperparameters! The most interesting thing revisiting is how the authors like to motivate experience replay and DQN from a neuroscientific/bio perspective, even though--