part one of reinforcement learning 6164852