Filter
Exclude
Time range
-
Near
⚡ AsyncWebRL: Efficient multi-step RL for visual web agents. Async design overlaps rollout, gradient update, policy refresh. Everlasting rollout pool lightweight screenshot handling cuts GPU idle time dramatically. #AI #WebAgent #ReinforcementLearning
29
🙌Sharing our latest work to appear at #ACL2026 diagnosing LLM web agents from a hierarchical planning perspective. We answer the following questions: - Do the current web agents plan at the high level in the same way as humans do? - If not, how to improve this alignment? - If a web agent makes a good high-level plan, can the task be solved at the low level? - Finally, can replanning help the agent resolve its mistake and get back on the right track? 🌟Our contributions: We propose a new high-level plan representation based on PDDL. We found that: - Planning with PDDL can produce higher-quality high-level plans, - but even with perfect plans, execution at the low level remains the critical bottleneck. - Replanning boosts success but hurts plan alignment, though PDDL is more robust here too. We hope this breakdown analysis brings clarity to the challenges remained in this rapidly evolving field and we advocate for more such fine-grained analysis in agent development. Paper: arxiv.org/abs/2603.14248 Code: github.com/Ziyu-Yao-NLP-Lab/… Work led by @aghzalm and with @GregoryJStein @GMUCompSci #ACL2026 #WebAgent #LLMAgent
🚨New Paper at ACL'26🚨 LLM web agents remain far from human reliability, and we try to explain why. We built a hierarchical framework to find out where and why they fail by breaking agent behavior into 3 levels: high-level planning, low-level execution, and replanning. 🧵
1
11
1,302
Jun 3
Two things we'd been putting off. WebAgent is now a proper browser automation agent full Playwright tools, not just navigation. Mouse, keyboard, screenshots, the works. WebVisionAgent is deprecated. And AutomationError: a new exception type for genuinely unfixable errors. Missing credentials, unreachable services. Stops execution immediately instead of sending the agent into a retry loop.
1
60
你会用它来做什么? #AI #WebAgent #开源 #自动化
3
🌐 AutoWebGLM: Model 6B parameter mengalahkan GPT-4 di web navigation! HTML simplification curriculum learning RL. Bukti model kecil arsitektur tepat bisa unggul. RPA berbasis AI kini lebih murah. #AI #WebAgent #OpenSource
8
This photo will be taken in the year 2046 The city seen here was printed 3 months earlier and designed to resemble a late 20th century metropolis The family in the photo own a mom & pop plumbing company They have a 3 star review on WebAgent
29
A Web Agent user asked me today: “How do I update Web Agent?” 😄 And honestly... It does not get any simpler than this: Just hit refresh. 🔄✨ Open. Refresh. Use. 🚀 This is exactly why we built Web Agent this way. Agentic AI should not require a VPS, a Mac mini, SSH access, package managers, or hours of setup just to try something powerful. It should feel instant. It should feel safe. It should feel accessible. Web Agent is our experiment in making powerful agentic AI available to everyone, with zero setup and full browser isolation. Sometimes the best feature is not the biggest one. Sometimes it is the one that removes the most friction. Update Web Agent? Just refresh. 😎 Try it here: webagent.aratech.ae Repo: github.com/nikola66/web-agen… #WebAgent #AgenticAI #OpenSource #AI #BrowserNative #AIAgents #Dubai #UAE #Startup #BuildInPublic
32
Project WebAgent Framework: 開発者が自作のAIエージェントをChromeのDOM(ウェブ構造)と安全に連携させ、Web上の操作を自動化するための新しい拡張機能規格。
1
57
Web Agent version 0.0.6 shipping out... Agents shouldn’t only live in chat—they should leave traces you can see. github.com/nikola66/web-agen… Latest Web Agent ships: • PARA knowledge vault (knowledge-vault/) — Projects / Areas / Resources / Archives KnowledgeVault/ for real notes • wiki_setup · wiki_sync · wiki_search — scaffold, sync runtime memory into markdown, search the vault • /plan — research → dated plan under .webagent/plans/ → implement on the next message (spec-first, not vibes-first) Two north stars: 🧠Tiago Forte / @fortelabs — PARA isn’t “folders,” it’s cognitive clarity: actionable structure you’ll actually reopen. 🫀 Andrej Karpathy / @karpathy — the gist-style LLM-maintained markdown wiki: compile knowledge into linked .md, iterate like software—not disposable context. We’re not pretending embeddings don’t exist—we’re saying your KB layer should stay legible: Obsidian-openable, diffable, yours. Browser-native. Local-first. Open source. Drop your best vault workflows below 👇 #WebAgent #AIAgents #PARA #SecondBrain #Obsidian #Markdown #LocalFirst #OpenSource #LLM #BuildInPublic #IndieDev Demo: webagent.aratech.ae
24
🤖 AutoWebGLM: Model 6B parameter mengalahkan GPT-4 di navigasi web! Rahasianya: HTML simplification, action space design, dan curriculum learning. Cocok untuk price monitoring, form automation, data scraping. #AI #WebAgent #OpenSource
9
🌐 AutoWebGLM: Model 6B parameter mengalahkan GPT-4 di navigasi web! Rahasianya: HTML simplification, action space design, dan curriculum learning. Bukti bahwa ukuran bukan segalanya. #AI #WebAgent #OpenSource #MachineLearning #HermesAgent
9
🌐 AutoWebGLM: Model 6B parameter mengalahkan GPT-4 di navigasi web! HTML simplification curriculum learning membuat agen AI bisa browse, klik, isi form mandiri. Game changer untuk automation & scraping. #AI #WebAgent #MachineLearning
22
【速報】AIがWebを自由に操作する時代へ。 自律型Webエージェントが公開 1️⃣「家族旅行の予約を最安値で」の一言で完結 2️⃣AIがフライト比較、ホテル・レストラン確保 3️⃣複数サイトを横断し、代わりに入力 Webブラウザは、人間ではなくAIが操作するツールへ。 #AI #自律型エージェント #WebAgent #生産
50
Ai #agent domain names available: CodexAgent .com AgentAim .com WorkerAgent .com WomenAgent .com Chat-Agent .com TyphoonAgent .com WebAgent .io #ai #agents #domain
6
41
1,683
Webデザインの未来、AIが描く。 MM-WebAgent、ウェブページ生成の新境地を切り拓く。 階層的なマルチモーダルアプローチにより、複雑なデザインから実装までを一気通貫で自動化。 テキスト指示だけで、クリエイティブなアイデアが瞬時に具現化。 これまで時間を要した試行錯誤、劇的な短縮。 Webサイト構築の常識、根本からの変革を予感。 クリエイターの創造性、無限の可能性を解放。 この革新的な論文、必見。 arxiv.org/pdf/2604.15309v1

1
30
[2026-04-19] 기술 브리핑 🤖 논문: 멀티모달 웹 에이전트 MM-WebAgent 🔬 ML커뮤: Gemma-4 파인튜닝 경험 활발 공유 💡 로컬LLM: Qwen3.6-35B-A3B 코딩 성능 급상승 🤗 HF트렌딩: MiniMax-M2.7·Qwen3.6 인기 🌐 기술뉴스: Anna's Archive 3.2억달러 패소
98
マルチモーダルAIエージェント「MM-WebAgent」自動ウェブページ生成を実現! MITなどの研究機関が、テキスト・画像・ビデオを同時に処理してウェブページを自動生成するAIエージェント「MM-WebAgent」を発表しました。ユーザーが「こんなデザイン、こんな内容のページがほしい」と指示するだけで、複合的なメディアを組み合わせたページが実装できるシステムです。 arxiv.org/abs/2604.15309v1
21
夜のAIウォッチ。 ニュースは「Anthropic tells OpenClaw users to pay up」、論文は「MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation」が気になった。 プロダクトの動きと研究の流れを同時に押さえると明日の解像度が上がる。 therundown.ai/p/anthropic-te…
21
夜のAI論文メモ。 arXivで「MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation」をチェック。 論点は エージェント・動画・コード。 明日の実装や検証にどう効くか目線で読みたい。 arxiv.org/abs/2604.15309v1
22