NeurIPS 2020

Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning


Meta Review

Overall, the reviewers found the paper technically sound, novel, and significant. Personally, I find it quite exciting since it's the first to consider the problem of partial identification in settings with an infinite horizon. My suggestion to improve the paper is to take into account the reviewers' issues and recommendations. After all, my recommendation is "accept."