jietang

jietang

65 Photos and videos

Tweets

jietang

@jietang

17h

GLM-5.2 is Fully Open, Frontier Intelligence Belongs to Everyone Today, the sudden restriction of certain frontier models is deeply regrettable. At a time when access to frontier models is abruptly cut off for non-technical reasons, we are even more convinced of one thing: science should be global. The path to AGI (Artificial General Intelligence) must never be enclosed by high walls. We have always believed that AGI should be the cornerstone for all of humanity to collaboratively explore the boundaries of intelligence and solve complex challenges, rather than a privilege monopolized by a few rules and subject to revocation at any moment. In the face of external blockades and restrictions, our attitude is one of radical openness. Frontier intelligence must remain open-source, accessible, and buildable, serving every dedicated developer. GLM-5.2 is Zhipu's most capable open-source model to date. It not only supports a truly usable 1M context window but also maintains a continuous lead in the independent completion of long-horizon tasks, providing solid foundational support for building complex agent applications. It also continues to be our main engine for creating the strongest domestic coding model. Tonight at 5:21—at this special moment—GLM-5.2 will officially be available to all GLM Coding Plan users (including Lite / Pro / Max). The API will also go live next week. A step closer to frontier intelligence for everyone. The future of AI is open, and it is for the people. ModelKey: GLM-5.2

201

552

5,600

612,874

Z.ai

jietang retweeted

Z.ai

@Zai_org

22h

Intelligence should be open, accessible, and ready to build with, empowering every developer, everywhere. GLM-5.2 is now available to all GLM Coding Plan users, including Lite, Pro, Max, and Team plans. docs.z.ai/devpack/latest-mod… As our new flagship model, GLM-5.2 delivers powerful coding capabilities, usable 1M-context support, and continued strengths in long-horizon tasks. API and Chatbot services will launch next week. The model will also be officially open-sourced next week under the MIT License. The future of AI is open, and it belongs to the people.

How to Switch Models - Overview - Z.AI DEVELOPER DOCUMENT

docs.z.ai

285

812

6,644

1,166,356

jietang

jietang

@jietang

May 22

GLM-5.1-highspeed is coming, 400 tokens per second. Very expensive, but bring a new possibility.

581

40,618

Z.ai

jietang retweeted

Z.ai

@Zai_org

May 20

x.com/i/article/205720692320…

127

879

183,302

jietang

jietang

@jietang

May 12

Recent thoughts: The Shift to Long-Horizon Tasks The most likely breakthrough this year will be in long-horizon tasks. We are moving toward a stage where Large Language Models (LLMs) learn to complete extended, complex missions by interacting with Agent environments. This is perhaps where the true value of LLMs lies. Take cybersecurity as an example: imagine a model that continuously hunts for software bugs and vulnerabilities. While it sounds like a search process, it’s actually the model learning the high-level intuition and methodology of a professional hacker. Unlike humans, AI can run 24/7 without fatigue. It could potentially find exploits at a much higher frequwill ency and claim bounties on platforms like HackerOne or BugCrowd. It sounds fun, but fundamentally, it's a revolution that displaces the hacker. If even hackers are being "disrupted," one can only imagine the impact on general programmers. From One-Person to None-Person Companies Building on long-horizon capabilities, Autonomous Agent Systems (AAS) will inevitably become the next frontier. Last year, we were discussing the rise of the "One Person Company" (OPC). I didn't expect us to move so quickly toward the "None Person Company" (NPC). It’s an ironic twist—we might all end up as NPCs in this new ecosystem. Engineering the Impossible: Memory and Learning To realize the vision above, we must solve three technical pillars: Memory, Continual Learning, and Self-Judging. I used to think these would require massive paradigm shifts and years of research. However, the pressure from both the technical and application sides is so intense that we are seeing these capabilities emerge through ingenious engineering "tricks": Memory: Long context windows (1M ) and RAG have significantly bridged the gap. Continual Learning: While true continual learning remains difficult, the release cycles are shrinking. Global models are updated monthly; domestic models are catching up. If we reach weekly updates by next year, it will effectively function as continual learning. Self-Judging: This remains the most elusive, yet models like Opus 4.7 are already demonstrating early self-correction and judgment capabilities. The Self-Evolving Endgame The most difficult—and most promising—path is Self-Evolution. The current wave is incredibly fierce. I suspect that models like Claude may have already achieved a baseline for self-training: writing their own code, cleaning their own data, generating synthetic data, and then training on it. It might "waste" some compute, but it saves the most precious resources: human labor and time. In the LLM era, speed is everything. Rapid iteration is what creates the cognitive gap between leaders and followers. Claude’s rumored 2-million-chip cluster for next year is likely dedicated to exactly this: autonomous model self-training. Technical Summary: 1M Context: Necessary baseline. Memory & Continual Learning: Prerequisites, likely solved first via "tricky" engineering. Harnessing Environments: The breakthrough point. Self-Judging: The tipping point. Full Self-Training: The endgame. Redefining AGI and the Industry If this is the road to AGI, then AGI’s definition should be the sum of all human collective intelligence, not just an individual’s intelligence. It must possess the creative capacity to produce something as profound as the "Theory of Relativity"—meeting the bar set by Hassabis. During this transition, every APP will need to be reconstructed as AI-native. In fact, we might move past the concept of APPs entirely. The most significant challenge will be the reconstruction of the operating system itself. In the future, you won’t see a traditional desktop; you will see an LLM OS, where applications are "generated on demand." This challenges the 80-year-old Von Neumann architecture and represents a total upheaval of the computer science industry. The Irreversible Wave From completing long-horizon tasks to fully autonomous operations, every sector—Security, Finance, Law, E-commerce—will be reshaped. Many friends have reached out lately, asking how to transform their enterprises to keep pace with AI. But few truly realize that this irreversible process has already begun. As this massive technical wave hits, we must be prepared to act, but we must also start thinking seriously about how to regulate it.

147

736

190,413

jietang

jietang

@jietang

May 12

coding is all you need

David Hendrickson

@TeksEdge

May 11

📰 A new coding Agent Index was just released by @ArtificialAnlys. This measures both the model and the harness. No open source harnesses included. OpenSource for Coding is now legit. 🔥 Claude Code GLM-5.1 (53) > Claude Code Sonnet 4.5 🔥 Claude Code GLM-5.1 (53) > Gemini CLI Gemini3.1 DeepSeek V4 Pro & Kimi K2.6 also hit 50

10,285

Z.ai

jietang retweeted

Z.ai

@Zai_org

Apr 29

Scaling laws push model capability forward. But whether that capability becomes reliable in production depends on how we handle Scaling Pain. z.ai/blog/scaling-pain In our latest blog, we share how we debugged GLM-5 serving at scale: reproducing rare garbled outputs, repetition, and rare-character generation; tracing and eliminating KV Cache race conditions; fixing HiCache synchronization issues; and introducing LayerSplit for up to 132% throughput improvement. We hope these lessons help the community avoid similar pitfalls and build more robust inference infrastructure.

885

87,039

jietang

jietang

@jietang

Apr 11

Is it driven by LLM?

Elon Musk

@elonmusk

Apr 9

Tesla driving itself around LA

4:14

11,472

jietang

jietang

@jietang

Apr 11

nice. IMO model

0xSero

@0xSero

Apr 10

GLM-5.1 shut my ZAI usage up so much, it's such a good model. huge leap for them even between this and GLM-5 It can run for hours without needing nudging, first open model IMO that hits this

7,676

jietang

jietang

@jietang

Apr 11

welcome to give it a try. hmmmm... indeed too many users and short of GPUs....

Arena.ai

@arena

Apr 10

GLM-5.1 by @Zai_org is now #3 in Code Arena - surpassing Gemini 3.1 and GPT-5.4, and now on par with Claude Sonnet 4.6. The first frontier level open model to break into the top 3. It’s a major 90 point jump over GLM-5, and 100 over Kimi K2.5 Thinking. Huge congrats to @Zai_org on pushing open model progress forward 🚀

337

38,309

jietang

jietang

@jietang

Apr 8

GLM 5.1 is coming huggingface.co/zai-org/GLM-5…. Coding is the cornersone and Long Horizon Task (LHT) is the new feature this time. focus more on 1. memory 2. evolving/continual learning 3. self judge/reflextion.

249

13,126

jietang

jietang

@jietang

Apr 8

looks super github.com/milla-jovovich/me…

GitHub - MemPalace/mempalace: The best-benchmarked open-source AI memory system. And it's free.

The best-benchmarked open-source AI memory system. And it's free. - MemPalace/mempalace

github.com

3,601

jietang

jietang

@jietang

Apr 7

toward long horizon tasks

Z.ai

@Zai_org

Apr 7

Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. Blog: z.ai/blog/glm-5.1 Weights: huggingface.co/zai-org/GLM-5… API: docs.z.ai/guides/llm/glm-5.1 Coding Plan: z.ai/subscribe Coming to chat.z.ai in the next few days.

5,292

jietang

jietang

@jietang

Apr 2

Ai coding->vibe coding->agentic engineering harness engineering->autonomous organization

127

11,039

jietang

jietang

@jietang

Apr 2

more smart glm-5v. welcome to give it a try

Zaid

@zaidmukaddam

Apr 1

It's game over for Anthropic now

168

11,441

Z.ai

jietang retweeted

Z.ai

@Zai_org

Apr 1

Introducing GLM-5V-Turbo: Vision Coding Model - Native Multimodal Coding: Natively understands multimodal inputs including images, videos, design drafts, and document layouts. - Balanced Visual and Programming Capabilities: Achieves leading performance across core benchmarks for multimodal coding, tool use, and GUI Agents. - Deep Adaptation for Claude Code and Claw Scenarios: Works in deep synergy with Agents like Claude Code and OpenClaw. Try it now: chat.z.ai API: docs.z.ai/guides/vlm/glm-5v-… Coding Plan trial applications: docs.google.com/forms/d/e/1F…

1:18

251

649

5,740

1,960,151

jietang

jietang

@jietang

Mar 31

Thanks for mentioning GLM. We will soon release GLM 5.1 with better performances on many other benchmarks …

X Freeze

@XFreeze

Mar 31

Grok 4.20 Beta ranks #2 with 97% accuracy score on the 𝜏²-Bench for Telecom (Agentic Tool Use) It outperforms Claude Opus 4.6(max), GPT-5.4(xhigh), and Gemini 3.1 Pro, while closing in on GLM-5 scoring the top in agentic work flow Tool calling is the whole game for AI agents, and this is where Grok 4.20 takes over with state-of-the-art intelligence that fires up instantly, making it the fastest at tokens per sec in the industry

262

18,158

Z.ai

jietang retweeted

Z.ai

@Zai_org

Mar 27

GLM-5.1 is available to ALL GLM Coding Plan users! z.ai/subscribe

356

559

5,511

1,295,504

jietang

jietang

@jietang

Mar 23

maybe the next move is to trade with real money with GLM-5.

Zixuan Li

@ZixuanLi_

Mar 22

Whether driven by luck or analytical capabilities, GLM-5 is currently the only model outperforming the Human Baseline on predictionarena.ai. Anyone using GLM-5 for trading? Does it feel capable to you?

184

21,884

Zixuan Li

jietang retweeted

Zixuan Li

@ZixuanLi_

Mar 22

778

104,135