🚀 Narada AI sets a new state-of-the-art on the WebArena benchmark!
We’re excited to announce that the Narada Operator has achieved a 64.2% accuracy on the WebArena benchmark, setting a new SOTA result. This surpasses the performance of OpenAI Operator, IBM CUGA, and others in WebArena, a challenging benchmark focused on complex, long-horizon web workflows.
🔍 What makes this meaningful?
WebArena simulates complex, long web workflows, not just one-click demos. It involves multi-step tasks like managing ecommerce platforms, moderating forums, and navigating across multiple websites.
💡How did we achieve this?
The team behind Narada includes researchers with a strong track record, including papers at ICML 2024 (LLMCompiler) and ICML 2025 (Plan-and-Act). These efforts informed our approach to design a novel task decomposition approach, along with real time error correction and execution recovery, allowing Narada Operator to maintain high reliability over long, complex tasks.
🔐 What’s next?
Narada is purpose-built for agentic process automation in the enterprise. While we're excited to share our WebArena benchmark results, our focus remains narrow and deep: enabling high-reliability automation for specific, mission-critical enterprise workflows, not building general-purpose copilots or broad computer-use agents. Just as importantly, we deliver this with enterprise-grade security and compliance: HIPAA, GDPR, and CCPA compliant, and SOC 2 Type II certified.
💼 Have an enterprise use case in mind?
Book a demo with our team:
narada.ai/book-demo
🔗 More details available in our blog:
narada.ai/blog/narada-ai-web…
#AgenticProcessAutomation#AI
#Automation #EnterpriseAI #AgenticAI #NaradaAI #WebArena #LLM #ProcessAutomation #ComputerUseAgents #CUA #WebAgents #StartupNews #TechMilestone