Neural Snake Research Lab

Interactive ML systems demo · single static file · zero build toolchain

Population-based neuroevolution and policy-gradient / value-based RL in one canvas: GA, ES, adaptive ES, REINFORCE, A2C, DQN, and a clipped PPO-style update. Organic rendering, human play, and a split-screen duel against the learned champion — designed to read like a serious research instrument, not a toy.

Mode Train Gen / Ep 0 Best 0 Mean 0 Score 0

Training curve

Learning

Play / view Brain architecture Optimization RL learning rate RL exploration ε Population Mutation σ Mutation rate Sim speed Turbo train (minimal rendering, max throughput)

Export

Save / load the champion vector. Evolution saves the flat genome; RL saves the policy (+ target for DQN).

Load genome

Best network (weights)

Blue = positive, red = negative. Brightness = magnitude. RNN/LSTM shows input→hidden (first slice) for readability.