NeurIPS 2020

Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization


Meta Review

Three out of four knowledgeable referees support acceptance for the contributions and I also recommend acceptance. I believe the concerns about theoretical aspects of R4 were addressed in the rebuttal. In the revised version of the paper, please present your additional experiments to address concerns of reviewers R3 and R4, your comments regarding runtime (R1) and your comments regarding the rho-projection (R1,R3,R4). Furthermore, R3 has some remaining theoretical concerns which were not clarified in the rebuttal - please elaborate on these in the revised paper.