Two Small Extensions To Contextual Bandits
I have spoken quite a bit about how real system is a multi turn problem, but a lot of causal approaches to recommender systems use a single turn contextual bandit. This simplification occurs in part because the multi turn problem is so very complicat...
ml4interact.hashnode.dev2 min read