I really don't understand the point of not having like the full benchmark tested and rely on "Overall Performance" or "E-Commerce" aggregated data
model looks super cool, but idk I just feel rugged when labs do such things
also why everyone keeps having 3.5 on their benchmark even tho 3.6 exists
Computer-use agents are moving from the cloud to your local machine. Fast.
When we launched Holo3 two months ago, the production feedback was clear: digital agents need to be blazing fast, cost-effective, and versatile.
Today, we're dropping Holo 3.1, engineered to run anywhere, instantly.
Massive token throughput. Low latency. Ready for your local workflow!