Agentic code is a below average SWE in "how to write maintainable code" and an exceptional SWE in its breadth. It's in contrast to human SWEs where breadth and excellence tend to correlate.
By virtue of sheer size, big-tech SWEs have a huge variance in skill levels. I was fortunate to learn from some exceptional ones and also had to deal with some who were below the bar.
As a TL, I allocated more time and scrutiny to my code reviews of the latter group. Even after that, satisfaction was not guaranteed - that's because the structure of the code usually didn't lend itself to provability and generalizability. It basically was written with a few specific examples in mind and the tests were written for those examples. Any inputs outside those few examples could easily blow up.
Reviewing agentic code reminds me of the SWE that needs extra scrutiny. No matter how much I review, I don't feel satisfied with the code it outputs. The primary failure mechanism is the same - it seemingly writes code with examples in mind and its iteration loop explicitly encourages this - write a test and fix code until it passes.
For now, I am working with it because it is the future. If it learns from my feedback over time, I will be happy but if I have to repeat effectively the same feedback again and again over many mini-projects, I'll be annoyed. The jury is out on this one.