The standard reference for reinforcement learning, covering MDPs, dynamic programming, TD learning, Q-learning and policy gradient methods. Good condition. Pickup in Zürich only.