I made this small html file that does neural-qlearning. It runs in CPU (no GPU), no experience-replay, and only 2 actions (left vs right). It barely works. Time to turn theory into practice as Ive got a 3d isosurface based game world to bring to life.
Did a little neural qlearning in a single html file and tiny neuralnet. No experience replay. Learns by gradient of squared error of the qlearning equation considering 3 game states, here, actionA(here), actionB(here), qscore estimates of each. Balances double cartpole a little