lecture 11 learning by reinforcement 2151318