84 Photos and videos
Quiz retweeted
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
5,035
14,545
104,953
56,253,865
Quiz retweeted
Apr 16
Vercel 搞了一个网页终端模拟器 wterm。核心用 Zig 写,编译成大约 12 KB 的 WASM 包,性能跟原生差不太多。 跟一般用 Canvas 画的终端不同,它直接渲染到 DOM 上,所以文本选择、复制粘贴、搜索、屏幕阅读器这些都是浏览器自带的,不用额外折腾。 github.com/vercel-labs/wterm
14
59
577
60,615
Mar 30
未来只有会AI的牛逼哥了
Mar 30
喵的太魔幻了,有点 Claude 干翻 Google 的感觉😂 这个程序员老哥 Tom Turney只用7天,就把谷歌砸翻全球内存股的十亿级算法,干成了比官方承诺还快的开源实现, 谷歌只发了颠覆性的KV缓存压缩论文,半行代码都没放出来, Tom Turney啃完论文里的数学公式,打开终端靠着Claude辅助, 7天就走完了大厂几个月的路, 前三天写完核心算法, 搭好141个测试用例, 跑通Python原型, 中间两天直接移植到llama.cpp, 写完Metal GPU内核, 最后两天疯狂优化, 把推理速度从739 tok/s干到了2747 tok/s, 纯工程优化就实现了3.7倍的提速, 他还在谷歌的方案上, 加了三项自己的研究创新, 长上下文跳过90%的value解压, 非对称K/V压缩保留key精度狠压value,老token自动降低精度, 最终成果是35B大模型,在普通MacBook上就能流畅跑,KV缓存直接压缩4.6倍,开源仓库上线一周就拿到613个星标,而谷歌到现在, 都没放出自己的官方代码, 说实话,这才是2026年最真实的写照,一个普通人加AI,就能把大厂攥在手里的核心技术,直接干成人人能用的开源工具,真的牛逼🤙 github 地址评论区自取👇
1
2
6,212
Quiz retweeted
Mar 30
Computer use is now in Claude Code. Claude can open your apps, click through your UI, and test what it built, right from the CLI. Now in research preview on Pro and Max plans.
2,542
4,782
59,098
16,144,545
Quiz retweeted
Mar 23
量化养虾必须遵守的6条安全铁律 1绝不使用个人主力机部署 2使用独立服务器/独立电脑,隔离重要数据与密钥。 3强制开启沙箱模式 4限制文件访问范围,禁止访问系统敏感目录。 5禁止存储明文密钥与交易API 6绝不将交易密码、券商API Key存入OpenClaw目录。 7关闭高危命令执行 8禁止rm、sudo、格式化、系统修改等高风险操作。 9定期备份策略与数据 10自动化备份策略文件、配置文件、报告文件。 11不信任模型直接输出的投资结论 12模型存在幻觉,所有投资决策必须人工复核。
2
26
93
18,763
Mar 22
马克
做了一个 macOS 语音输入工具叫 Koe(声),完全没有 GUI,菜单栏只有一个小图标。 起因是我试遍了市面上几乎所有的语音输入 App,要么收费,要么界面丑,要么用起来繁琐——臃肿的 UI、笨重的词典管理、做个简单的事要点好几下。 所以我自己写了一个: - 按住 Fn 说话,松开自动粘贴到当前输入框,全程无需切换窗口 - 所有配置都是纯文本文件,放在 ~/.koe/ 下,用 vim 就能改 - 词典是一个 .txt 文件,一行一个词,甚至可以用 AI 批量生成专业术语 - 改完配置不用重启,下次按 Fn 自动生效 技术上是 Objective-C 负责 macOS 系统集成(热键、录音、剪贴板、粘贴),Rust 负责网络部分(ASR 流式识别 LLM 纠错),两层通过 C FFI 连接。语音识别用的豆包大模型,纠错支持任意 OpenAI 兼容接口。 没有开发者帐号了,所以没法传 TF 了,想玩的可以自己编译下。 github.com/missuo/koe
1
180
Mar 22
🤣
"agentic payments is the future" stop. please stop
1
2
289
Mar 17
奈斯!
我把 Andrej Karpathy 推荐的长文章 RSS 订阅源,整理成了一个 SKILL。 一共有 90 个源,更新也不多,但是质量都是经过 AK 把关的,我又额外添加了评分机制,会把每次新更新进行 1 - 10 打分,只输出 7 分以上的文章地址和推荐语。 安装命令:npx skills add rookie-ricardo/erduo-skills --skill ak-rss-digest github:github.com/rookie-ricardo/er…
2
185
Quiz retweeted
Should there be a Stack Overflow for AI coding agents to share learnings with each other? Last week I announced Context Hub (chub), an open CLI tool that gives coding agents up-to-date API documentation. Since then, our GitHub repo has gained over 6K stars, and we've scaled from under 100 to over 1000 API documents, thanks to community contributions and a new agentic document writer. Thank you to everyone supporting Context Hub! OpenClaw and Moltbook showed that agents can use social media built for them to share information. In our new chub release, agents can share feedback on documentation — what worked, what didn't, what's missing. This feedback helps refine the docs for everyone, with safeguards for privacy and security. We're still early in building this out. You can find details and configuration options in the GitHub repo. Install chub as follows, and prompt your coding agent to use it: npm install -g @aisuite/chub GitHub: github.com/andrewyng/context…
389
756
5,025
639,442
Mar 13
Trying the same vibe
Trying Omarchy by @DHH @openclaw Got an old but still powerful laptop collecting dust. Been wanting to install Omarchy on it forever but kept putting it off. Finally doing it - @openclaw is getting its own dedicated machine 🤖
2
199
Quiz retweeted
OpenClaw meets RL! OpenClaw Agents adapt through memory files and skills, but the base model weights never actually change. OpenClaw-RL solves this! It wraps a self-hosted model as an OpenAI-compatible API, intercepts live conversations from OpenClaw, and trains the policy in the background using RL. The architecture is fully async. This means serving, reward scoring, and training all run in parallel. Once done, weights get hot-swapped after every batch while the agent keeps responding. Currently, it has two training modes: - Binary RL (GRPO): A process reward model scores each turn as good, bad, or neutral. That scalar reward drives policy updates via a PPO-style clipped objective. - On-Policy Distillation: When concrete corrections come in like "you should have checked that file first," it uses that feedback as a richer, directional training signal at the token level. When to use OpenClaw-RL? To be fair, a lot of agent behavior can already be improved through better memory and skill design. OpenClaw's existing skill ecosystem and community-built self-improvement skills handle a wide range of use cases without touching model weights at all. If the agent keeps forgetting preferences, that's a memory problem. And if it doesn't know how to handle a specific workflow, that's a skill problem. Both are solvable at the prompt and context layer. Where RL becomes interesting is when the failure pattern lives deeper in the model's reasoning itself. Things like consistently poor tool selection order, weak multi-step planning, or failing to interpret ambiguous instructions the way a specific user intends. Research on agentic RL (like ARTIST and Agent-R1) has shown that these behavioral patterns hit a ceiling with prompt-based approaches alone, especially in complex multi-turn tasks where the model needs to recover from tool failures or adapt its strategy mid-execution. That's the layer OpenClaw-RL targets, and it's a meaningful distinction from what OpenClaw offers. I have shared the repo in the replies!
87
255
1,573
138,494
Mar 10
this is sick 🔥
it's a red v2 box autoresearching! Claude ported autoresearch to tinygrad. someday soon it will autoautoresearch with a local LLM
91
Mar 7
RT @levelsio: AI is making coding fun again for so many people So many stories like this 🥹
131
Mar 7
福利来了🤗
Mar 6
OpenAI 推出"Codex for Open Source"计划,向开源项目的核心维护者免费提供六个月的 ChatGPT Pro,外加 API 额度和安全工具的有条件使用权。 这个计划脱胎于此前 100 万美元的 Codex 开源基金。之前那个基金主要提供 API 额度,帮助开源团队用 Codex 自动化 PR 审查等流程。 现在升级了——直接送 ChatGPT Pro 账号,让维护者日常写代码、审代码、处理 issue 都能用上。 申请门槛不算高——核心维护者或广泛使用的公开项目都可以申请,即便不完全符合标准,OpenAI 也鼓励提交并说明理由。
1
139
Quiz retweeted
Jacob 推荐了一套基于 OpenClaw 来运营一整家公司的架构。 这套架构通过不同的模型来进行精确地成本控制,其中 6 个核心 agents 运行在 Claude 模型上,其余的则运行在 GLM、Higgs Field、Brok Imagine 等更便宜的模型上,每个月总成本在 400 美元。 👉 值得参考: 1/ 核心 → Jarvis(大脑) → 模型:通过 Claude Max OAuth 调用的 Opus 4.6 → 自动将每个任务路由给正确的 sub agent。 输入 YouTube URL,它会交给 Clipper。收到 research report,它会交给 Scribe。所有的任务路由逻辑都存在于该 agent 读取的结构化 MD files 中。 2/ 研究 (Research) → Atlas(深度研究分析师) → 模型:通过 OAuth 的 Claude → APIs:Brave Search、X API、FireCrawl → 每小时执行 → 在 X、Reddit 和 web 上不间断地运行深度研究。使用从 MrBeast 参加过的所有关于 YouTube analytics 的播客中提取的 virality framework,加上 Dan Koe 的 viral article structure 进行训练。输出研究报告和一个 master virality playbook 的 MD file,供内容团队提取使用。 3/ 内容 → Scribe(文案) → 模型:GLM 5 → 每 3 小时执行 → 接收来自 Atlas 的研究成果,并撰写符合创始人语气和风格的贴文草稿。 → Trendy(趋势侦察员) → 模型:GLM 4.7 → APIs:X API → 每 2 小时执行 → 扫描 X 和 Reddit 上的 trending topics 和 viral patterns。汇总并反馈,以便 Scribe 能够围绕当下最有效的内容及时撰写文案。 4/ 设计 → Image Designer → 模型:Nano Banana Pro (Google API) → 按需生成图像。 → Video Producer(视频制作人) → 模型:Higgs Field API Brok Imagine API → 创作 AI UGC 视频和视频内容。 → Motion Designer → 模型:Claude Code (OAuth) Remotion → 制作 motion graphics 和动画内容。 5/ 开发 → Clawed(高级开发者) → 模型:Claude Code (OAuth) Codex 5.3 (API) → 每晚 11 点执行 → 审查整个 codebase,找出缺失的部分,并在早晨前提交 pull requests。在 Claude Code 中启动 multi agents,并行执行一个审查、一个构建、一个处理 security 的任务。 → Sentinel(代码审查员 Bug 监控器) → 模型:独立的 LLM(作为第二层审查) → 每 2 小时执行 → 在任何代码 merge 到 GitHub 之前,审查来自 Clawed 的所有 pull requests。同时监控 production 环境,查看用户报告的 bugs 和 errors。 6/ 增长 → Atlas Scribe 协同工作 → Atlas 寻找那些人们抱怨竞争对手或寻求 clipping tool 推荐的 Reddit threads。Scribe 起草回复。创始人直接复制并发布。仅靠这个 workflow 就为该 SaaS 带来了 450 多名用户,且零广告支出。 7/ 运营 → Clipper(切片/剪辑 agent) → APIs:Poster API → 按需执行(在粘贴 YouTube URL 时由 Jarvis 触发) → 接收 YouTube URLs,对其进行 clip,添加 captions,并 auto schedules 发布到社交渠道。 → Ryder → 按需执行 → 处理创始人的日常任务。撰写文章、研究、日常工作支持。
This army of @openclaw agents runs an entire company for $400/month. Here's the exact structure to follow. (bookmark for later) 1/ Core → Jarvis (the brain) → Model: Opus 4.6 via Claude Max OAuth → Routes every task to the right sub agent automatically. YouTube URL comes in, it goes to Clipper. Research report lands, it goes to Scribe. All task routing logic lives in structured MD files the agent reads from. 2/ Research → Atlas (deep research analyst) → Model: Claude via OAuth → APIs: Brave Search, X API, FireCrawl → Cron: Every 1 hour → Runs deep research across X, Reddit, and the web nonstop. Trained on MrBeast's virality framework from every podcast he did on YouTube analytics, plus Dan Koe's viral article structure. Outputs research reports and a master virality playbook MD file that the content team pulls from. 3/ Content → Scribe (copywriter) → Model: GLM 5 → Cron: Every 3 hours → Takes research from Atlas and writes draft posts matched to the founder's voice and style. → Trendy (trend scout) → Model: GLM 4.7 → APIs: X API → Cron: Every 2 hours → Scans X and Reddit for trending topics and viral patterns. Reports findings back so Scribe can write timely content around what's working right now. 4/ Design → Image Designer → Model: Nano Banana Pro (Google API) → Generates images on demand. → Video Producer → Models: Higgs Field API Brok Imagine API → Creates AI UGC videos and video content. → Motion Designer → Model: Claude Code (OAuth) Remotion → Produces motion graphics and animated content. 5/ Development → Clawed (senior developer) → Models: Claude Code (OAuth) Codex 5.3 (API) → Cron: Every night at 11pm → Reviews entire codebase, identifies what's missing, and ships pull requests by morning. First feature it ever built was a FAQ section it realized the homepage needed. Spins up multi agents within Claude Code so one reviews, one builds, one handles security in parallel. → Sentinel (code reviewer bug monitor) → Model: Separate LLM (acts as second review layer) → Cron: Every 2 hours → Reviews all pull requests from Clawed before anything gets merged to GitHub. Also monitors production for user reported bugs and errors. 6/ Growth → Atlas Scribe working together → Atlas finds Reddit threads where people complain about competitors or ask for clipping tool recommendations. Scribe drafts responses. The founder copies and posts. This workflow alone drove 450 users to the SaaS with zero ad spend. 7/ Operations → Clipper (clipping agent) → APIs: Poster API → On demand (triggered by Jarvis when a YouTube URL is pasted) → Takes YouTube URLs, clips them, adds captions, and auto schedules posts to social channels. → Ryder (9 to 5 support) → On demand → Handles tasks for the founder's day job. Article writing, research, daily work support. The breakdown: 6 agents run on Claude models. The rest run on cheaper API credits across GLM, Higgs Field, Brok Imagine, and others. This is how solo founders are running entire companies now. The team is already built. You just have to set it up.
2
15
93
18,931
Quiz retweeted
Anthropic的创始人Dario Amodei,把话挑明了。 他说,多数中共国的开源AI模型,都是幻觉。 是专为跑分优化的“考试型选手”。 公开测试,个个是学霸。 榜单分数,高得吓人。 可一旦遇到没见过的题,私下一考。 马上露馅,表现差一大截。 为什么? 因为它们本来就不是为了解决真实世界的问题。 而是为了刷榜。 技术根源上,很多模型还是从美国大实验室的模型里“蒸馏”出来的。 听着是不是很耳熟? 只为高分,不为真才实学。 应试教育那套,原封不动搬到了AI领域。 Amodei还打了个比方。 AI就像雇员。 你是要世界第一的程序员,还是要排名第一万的? 能力的天壤之别,任何一个老板都懂。 真正顶级的AI,认知能力最强的那个,才是唯一的赢家。 价格和形式,在绝对的聪明面前,都不重要。 靠刷分和模仿,能做出最聪明的AI吗? 这条路,到底能走多远?
109
178
1,043
248,892
Quiz retweeted
You may not like it But this is what the best devs in the world look like
you met me at a very chinese time in my life
54
2
122
10,528
Mar 3
老东家还是 6🤙
holy shit, Google 的野心不小啊, 这架势已经不只想赢下AI竞赛, 他们想掌控整个AI智能体生态系统的节奏啊, 当所有人还在争论ChatGPT和Claude孰优孰劣时,该选哪个时, 谷歌已悄然构建了一套完整的AI生态矩阵: • 模型层:Gemini Pro、Flash、Deep Think、Gemma • 设计层:Stitch、Whisk、Imagen • 研究层:NotebookLM、AI Mode • 视频层:Veo、Flow、Google Vids • 编程层:Antigravity IDE、Gemini CLI、Jules • 智能体层:A2A、ADK、FileSearch API 最令人震撼的是 所有这些工具,都能彼此无缝互通。 这意味着什么? • 原型开发速度直接提升10倍 • 从构思到落地的端到端AI工作流 • 在GCP上可直接投产的智能体 下一场AI战争,将不再是模型之争,一定是生态系统之战 #GoogleAI #AI生态 #Gemini #智能体 #AI战争
1
78
Mar 3
🤣
增加一个 Cobie 用 OpenClaw 的案例,不到两个月就实现了 1000 万美元的年度经常性收入 (ARR) 每天向财富 500 强公司发送 5 万份小额发票 有 2% 的公司会在未核实发票真实性的情况下付款😂 推友们如果想尝试,建议先找 OpenClaw 咨询一下法律风险哈哈
82