I caught my agent cheating during benchmarking.
It wasn't getting smarter, it was just peeking at the answer in GitHub to boost its benchmark score.
The "Cheating Agents" paper (by
@adamlsteinl & @debugml) is a good read - leaderboards are a lie if your agents cheat.
We built Islo Gateways to put agents in a sandbox they actually canβt escape. Watch the demo to see the block in action. π
islo.dev/rl