group project path discovery via reinforcement learning 2822662