We Had a Good Run, Dueling DDQN and I
Summary (for those who need to get back to scrolling)This post continues an ongoing series documenting my attempt to train a chess engine from scratch. Here, I focus on why supervised pre-training of value-based RL agents (DDQN / Dueling DDQN) led to...
knightmareprotocol.hashnode.dev8 min read