What if a model could actively prune its own context as it is working with only the things it thinks is useful to carry forward, like some kind of dynamic context. For example I saw my agent grep for the word "ask" and notice that it gave too many results so it needed to try something else. That would also be a good time to sanitize that with a note. Maybe it would be more output tokens, but the input would stay in the smart zone a lot longer. Is anyone working on something like this?