The system will…
NFR1. Have one or more reinforcement learning agents
NFR2. Be learning, training, and working in simulation