NIPS Proceedingsβ

Ofir Nachum

4 Papers

  • DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections (2019)
  • A Lyapunov-based Approach to Safe Reinforcement Learning (2018)
  • Data-Efficient Hierarchical Reinforcement Learning (2018)
  • Bridging the Gap Between Value and Policy Based Reinforcement Learning (2017)