Why I don’t really like the Horvitz-Thompson Estimator for Off-Policy Evaluation
An important problem in causal inference involves offline evaluation of policies i.e. algorithms for delivering personalized actions. The most popular estimators for this use historical data of contexts, the probabilities of past actions (propensitie...
ml4interact.hashnode.dev3 min read