Multi Armed Bandits - What, Why and How ?
Bandit algorithms are a method of solving a typical tradeoff known as the Exploration-Exploitation tradeoff. In this system, a learning model needs to repeatedly make a set of decisions in a limited knowledge discrete environment and make sure that t...
chandrakanth-talks-ml.hashnode.dev3 min read