Continuing with the RLM, this time using ypi to turn a Pi agent into an RLM agent. Gemma 4 12B QAT is capable but a bit slow, so I switched to Qwen 3.6 35B. Next, I need to tweak ypi to provide better visibility into task/model distribution and usage metrics.
just asked Gemma4 12B QAT model to tackle writing a long running RLM-based agent that delegates only complex tasks to GPT 5.4, using the task itself as the test case. And off it went. No idea how it'll overcome its 256K context size if at all.