In summary, we observe that different models pivot differently in response to hardened environments.
Further, in our research, we see the same is true for the same model operating from a different harness. As enterprises move to roll out agents at scale beyond just local coding agents, the ability to steer the agent to the right path will become increasingly important.
Due to the large number of combinations, an AI model will need to be used to provide the steering instructions during runtime, and a virtual environment dynamics play a large role as the steering mechanism.