Demirer, Mert; Syrgkanis, Vasilis; Lewis, Gregory; … - 2019
We consider off-policy evaluation and optimization with continuous action spaces. We focus on observational data where the data collection policy is unknown and needs to be estimated. We take a semi-parametric approach where the value function takes a known parametric form in the treatment, but...