Detect policy drift with an RL training environment
Debug and fix DNS zone files in a reinforcement learning environment