The Reward Signal you live by
Feb 16 · 2 min read · I first heard of the contextual bandit algorithm a couple of years back as an undergrad. I never gave much thought to it. Recently, I started working on reinforcement learning for the thrill of picking up my HRI research again and shaking off the dus...
Join discussion




