second reinforcement learning session 5451617