johann.GPT

johann.GPT

267 Photos and videos

Tweets

johann.GPT

@ProgramerJohann

Mar 27

不要试图通过更长的 Prompt 来让模型变乖，需要在 Harness 层建立硬性规则。

290

johann.GPT

johann.GPT

@ProgramerJohann

Mar 22

微信官方通过插件支持连接 OpenClaw ，在我的-设置-插件中启用。

740

johann.GPT

johann.GPT

@ProgramerJohann

Mar 22

197

johann.GPT

johann.GPT

@ProgramerJohann

Feb 12

最近在研究记忆，逛 GitHub 发现了 @seclink 这个 Awesome AI Memory 的仓库，非常不错，聚合了最新的 Memory 的列表。

866

johann.GPT

johann.GPT

@ProgramerJohann

Feb 12

github.com/XiaomingX/awesome…

GitHub - XiaomingX/awesome-ai-memory: A list of AI memory projects（ AI长期记忆的开源和商业项目列表）

A list of AI memory projects（ AI长期记忆的开源和商业项目列表）. Contribute to XiaomingX/awesome-ai-memory development by creating an account on GitHub.

github.com

190

johann.GPT

johann.GPT

@ProgramerJohann

Jan 6

Claude Code 这个 Loading 的单词原来也是大模型生成的

323

johann.GPT

johann.GPT

@ProgramerJohann

Jan 4

分享一个我很喜欢的融合了低空经济类人形机器人 AI陪伴的项目，光尽职调查就看了一百多集

394

johann.GPT

johann.GPT

@ProgramerJohann

3 Dec 2025

我最爱用的 Bun 被 Anthropic 收购了。，用于驱动 Claude code 等工具。

607

johann.GPT

johann.GPT

@ProgramerJohann

7 Nov 2025

又来聊 grep 和语义相似性搜索了。 Cursor 给的 A/B 实验结论：启用语义搜索后，我们观察到代理代码的留存率提升了0.3%；在包含1,000个及以上文件的大型代码库中，这一提升达到2.6%。其实他也多次强调了，在 1000 个文件以上对更明显。现在 Cursor 是 grep 语义相似性多路召回的方式。话又说回来，这个其实在 RAG 是基操了～

733

johann.GPT

johann.GPT

@ProgramerJohann

7 Nov 2025

cursor.com/cn/blog/semsearch

通过语义搜索改进智能体 · Cursor

语义搜索显著提升编码智能体表现，准确率提高 12.5%，增强代码留存并减少用户不满意的请求。

cursor.com

230

johann.GPT

johann.GPT

@ProgramerJohann

6 Nov 2025

腾讯微信AI团队提出了一种名为“连续自回归语言模型”（CALM）的新架构。它颠覆了传统大模型逐字（token）预测的范式，改为一次性预测一个包含多个token信息的“语义向量”。这好比将模型思考的基本单位从“字”提升到了“想法”或“短句”，大大提高了生成效率。

441

johann.GPT

johann.GPT

@ProgramerJohann

6 Nov 2025

arxiv.org/abs/2510.27688

Continuous Autoregressive Language Models

The efficiency of large language models (LLMs) is fundamentally limited by their sequential, token-by-token generation process. We argue that overcoming this bottleneck requires a new design axis...

arxiv.org

210

johann.GPT

johann.GPT

@ProgramerJohann

5 Nov 2025

聊聊《代码执行与 MCP：构建更高效的智能体》简单总结就是：通过代码执行来处理大量数据，例如数据分析场景，LLM 接收到的参数只有，我给他的 csv 的路径，他去写代码来处理，而不是我们直接把整个 csv 的数据读到上下文去，然后给出今天有 50 个订单。技术上来说就是，通过代码执行，可以不用把 tool call result 放到 context 上，从而节省大量 context token。

1,296

johann.GPT

johann.GPT

@ProgramerJohann

5 Nov 2025

anthropic.com/engineering/co…

Code execution with MCP: building more efficient AI agents

Learn how code execution with the Model Context Protocol enables agents to handle more tools while using fewer tokens, reducing context overhead by up to 98.7%.

anthropic.com

248

johann.GPT

johann.GPT

@ProgramerJohann

3 Nov 2025

今天第一次读到 “Harness as a Service” ， Harness 原意是"马具、挽具"——套在马身上，让马能拉车干活的整套装备。类比到 AI 领域： LLM = 野马（有能力，但不受控） Harness = 马具（驯服、引导、让它干活的装备） Agent = 能干活的马车（可用的生产力工具） ----- 用 LLM 和用 Harness 的区别：没有 Harness: 你："帮我分析代码库" LLM："好的！[输出一段文字]" 你：😤 但文件没读、报告没存... 有 Harness: 你："帮我分析代码库" Agent： ✅ 自动读取文件 ✅ 调用分析工具 ✅ 生成并保存报告你：😊 完美！目前属 Harness 的有： Claude Code SDK、Deep Agents 等。 DeepAgents 这类 Harness 出现的原因，表面上是为了"快速验证想法"，但更深层的原因是：从 LLM API 到可用 Agent 之间存在巨大的工程鸿沟——包括上下文管理、工具调用、错误处理、状态管理、权限控制等 1500 行与业务无关的基础设施代码，需要 3-4 周才能搭建完成。而 AI 的使用范式正在从"对话"转向"自主执行任务"，每个公司都在重复造这些轮子。Harness 让开发者把 70% 的时间从"造基础设施"转向"写业务逻辑"（提示词、领域工具、知识），把上线时间从数周压缩到数天。所以 Harness 出现的本质是：填补 LLM 到 Agent 的鸿沟，让"快速迭代"成为可能。之所以会在此时出现，是因为技术成熟（工具调用、长上下文、强推理）共识形成商业动机对齐。

1,390

johann.GPT

johann.GPT

@ProgramerJohann

3 Nov 2025

vtrivedy.com/posts/claude-co…

The Claude Code SDK and the Birth of HaaS (Harness as a Service) | vtrivedy

As tasks require more autonomous behavior, we're seeing a shift from LLM APIs to Harness APIs. Explore how Claude Code's SDK enables rapid agent development through HaaS (Harness as a Service).

vtrivedy.com

203