The next wave of AI will not be won by better prompts. It will be won by systems that learn from experience.
Today, Prime Intellect Lab is out of beta, open for you to start training your own models.
The era of self-improving agents is here.
By performing SFT on tool outputs and RL on the assistant tokens, we can efficiently teach the model the environment dynamics. This happens on-policy: the LLM models the environment not in a vacuum but in response to its own actions.
We show strong results in the under-resourced programming language Forth and evaluate generalization to unrelated environments.
We also characterize what aspects of an environment lead to overfitting when using ECHO, how model behavior is impacted, and much more.
True agents model the world.
Current training provides no separation between agent and environment: pre-training only trains world modeling, RL only agentic actions. We combine both using ECHO by @DimitrisPapail and @VaishShrivas.
this is the biggest wake-up call to protect and nourish open source AI
if you don't build out sovereign and independent models infra closed labs will patronize you to an insulting degree
mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community
also the fact that this is un purpose not visible to the user is crazy
This is why Prime Intellect must exist.
We must diffuse the tools of recursive self-improving AI, otherwise Anthropic will build the singleton and concentrate power until they run the world government.
mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community
also the fact that this is un purpose not visible to the user is crazy
We're excited to join the NVIDIA Nemotron Coalition 💚
Frontier open models matter for the whole ecosystem.
We're bringing the RL infrastructure and environments we've built over the last year to help scale agentic capabilities.
primeintellect.ai/blog/nemot…
Our contribution: the post‑training & RL environments layer.
2,500 open RL environments, the verifiers framework, Prime Sandbox, and NeMo Gym integration — all natively integrated into NVIDIA's ecosystem, incl. Nemotron.
NVIDIA Nemotron 3 Ultra is here
We have Day‑0 support for Nemotron 3 Ultra in prime-rl and Lab.
Specialize Nemotron 3 Ultra for your use case.
primeintellect.ai/blog/nemot…
We're excited to join the NVIDIA Nemotron Coalition 💚
Frontier open models matter for the whole ecosystem.
We're bringing the RL infrastructure and environments we've built over the last year to help scale agentic capabilities.
primeintellect.ai/blog/nemot…
NVIDIA Nemotron 3 Ultra is here
We have Day‑0 support for Nemotron 3 Ultra in prime-rl and Lab.
Specialize Nemotron 3 Ultra for your use case.
primeintellect.ai/blog/nemot…
Every company is becoming an AI company, and the winners will own the loop: ship, learn, train, repeat.
NVIDIA Nemotron 3 Ultra gives the open ecosystem a frontier model built for agents. We give every team the stack to make it theirs.