discovering hierarchy in reinforcement learning using hexq 402815