jietang

jietang

Users
Tweets

jietang

@jietang

We're introducing GLM-5.2, our latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers that capability on a solid 1M-token context. GLM-5.2's new capabilities include: Solid 1M Context: A solid 1M-token context that stably sustains long-horizon work Advanced Coding with Flexible Effort: Stronger coding capabilities with multiple thinking effort levels to balance performance and latency Improved Architecture: We propose IndexShare, which reuses the same indexer across every four sparse attention layers, reducing per-token FLOPs by 2.9× at a 1M context length. We also improve GLM-5.2’s MTP layer for speculative decoding, increasing the acceptance length by up to 20% Pure Open: An MIT open-source license — no regional limits, technical access without borders Supporting long-horizon tasks starts with making long context engineering-usable: the model must maintain quality across long, messy coding-agent trajectories, not just accept more tokens. A 1M context is easy to claim, but much harder to keep reliable under real engineering pressure. To this end, we substantially expanded 1M-context training for coding-agent scenarios, covering large-scale implementation, automated research, performance optimization, and complex debugging. The result is a long-context system that is not only wide in scope, but solid in execution: a practical substrate for sustained engineering work. This capability is reflected in GLM-5.2's performance on three long-horizon coding benchmarks. FrontierSWE measures whether an agent can complete open-ended technical projects at the scale of hours to tens of hours, spanning systems optimization, large-scale code construction, and applied ML research. On this benchmark, GLM-5.2 trails Opus 4.8 by only 1%, while edging out GPT-5.5 by 1% and Opus 4.7 by 11%. On PostTrainBench, where each agent is given an H100 GPU and evaluated by how much it can improve small models through post-training, GLM-5.2 outperforms both Opus 4.7 and GPT-5.5, ranking second only to Opus 4.8. On SWE-Marathon, an ultra-long-horizon software engineering benchmark covering tasks such as building compilers, optimizing kernels, and developing production-grade services, GLM-5.2 still has room to grow, trailing Opus 4.8 by 13% while remaining second only to the Opus series. Across all three benchmarks, GLM-5.2 is the highest-ranked open-source model, showing that its 1M context has translated into practical long-horizon delivery capability.

T-Hotsex009

Josep retweeted

T-Hotsex009 @cgyg_cgyg514

Jun 13

แจ้งลบคลิปได้นะครับ🔞 Adults Only (18 ) NSFW Content All models are 18 adults Viewer discretion advised

Etendin_Official

4:23

221

16,181

D3D

DeliciousMancub retweeted

D3D

@BoB_D3D

They love it😏 - Models and map by @Rigid3d - Thank y’all for the donations❤️‍🩹🙏🏼 We ain’t reach the goal, but your help is appreciated, THX🫰 a RT helps❤️‍🩹

320

3,585

25,729

Eden | עדן

DIMONAM1 ASD retweeted

Eden | עדן @edennnkk

Jun 15

I don't think y'all remember what PS1 models look like

༺👑 JILL QUEEN 👑༻

@JillQueen31

Jun 14

#Residentevill

Community note

This is not how the original PS1 model looks like. x.com/heheheheheehee

247

2,405

72,531

2,090,565

Dev

Dev

@devparagiri

18s

is there a directory for finetuned small language models by task?

Plucky - 🐸🍄Twitch Partner

Elara Beaumxnt retweeted

Plucky - 🐸🍄Twitch Partner

@xpluckyvt

Jun 13

Guys my models almost here I can’t wait show you what @LunaNammi_ cooked up

476

Romir Jain

Romir Jain

@romir_jain

22s

and it's not just a comic strip trick. same intervention works on natural images from COCO. the mechanism also recurs across model sizes from 2B all the way to 32B parameters and across different VLM architectures. so this seems like a fundamental thing about how these models organize visual attention

Mitchell Hashimoto

connor retweeted

Mitchell Hashimoto

@mitchellh

We've gone really quickly from "local models are dogshit" to "local models are good actually" (like, a 12 month window from A to B). I don't think they're actually good ENOUGH yet. We need an Opus 4.5 quality local model. When that happens, I think the world will spill over. Opus 4.5 is/was amazing, and is more than good enough for almost all tasks still as long as you pair with a frontier-level planner/judge. It'll still require a hugely expensive machine to run it, I'm sure, like a $5K or more laptop or mac studio. But, that's going to be pennies compared to the API costs plus all the benefits of guaranteed privacy and so on.

101

1,619

65,983

Michael88

Michael88 @abracadabrist

34s

so nice of them to allow those vets to train their surveillance models; and for free!

Polymarket Sports

@PolymarketSport

Jun 15

🚨BREAKING: The UFC and Mark Zuckerberg have teamed up to give every blind veteran in America a free pair of Meta glasses.

GitHub Changelog

ucsdmiami2020.eth retweeted

GitHub Changelog

@GHchangelog

GitHub Models is no longer available to new customers. • New orgs & enterprises without prior usage cannot access GitHub Models on any plan. • Existing customers keep full access and usage for now. github.blog/changelog/2026-0…

GitHub Models is no longer available to new customers - GitHub Changelog

We are retiring GitHub Models. As a first step, new customers can no longer use it. If your organization or enterprise have not previously used GitHub Models, you won’t see…

github.blog

2,231

Gavin Fuller

Gavin Fuller @GavinFullerTX

40s

I haven’t been a fan of @Zai_org ’s earlier models, but I have to give credit where it’s due GLM-5.2 is phenomenal. It has obliterated all of my expectations. First model from Z.ai I truly would pick first every time, and my blind Opus 4.8 GPT-5.5 evals agree

Eban Bisong

Eban Bisong

@ebanbisong

41s

Replying to @gregisenberg

1. OpenClaw (testing Hermes stability rn) 2. Anthropic (goated) 3. Claude Code (just feels better) 4. Cloud models (still hunting for a use case I'm sure about) 5. Bootstrap 6. Best time 7. Layoffs first, then a wave of new companies

Nines 🐾🤍

Beholder242 (Brett T) retweeted

Nines 🐾🤍

@KitsuraNines

some fuckers really think we’re our vtuber models irl, just sitting around with animal ears, cute outfits and shit. sorry to burst your bubble, but i’ve been wearing the same oversized t-shirt for three days and don’t have a tail 🗿🗿🗿

125

Robert Newcomb - Irregular/Asymmetric Warfare & AI

Robert Newcomb - Irregular/Asymmetric Warfare & AI

@DefendUtah

43s

Replying to @JasminRappleye

And this is why I am opting out of the Gemini roll out to my children here in #Utah in 2026 and started this online petition. I train Ai models and I am extremely aware of the bias and how they are programmed to indoctrinate children with their form of morality. defendut.com/proposed-legisl…

AI in Utah Classrooms: The Gemini Rollout | Defend Utah

The Utah State Board of Education has partnered with Google to deploy Gemini AI to every K-12 student starting in 2026–2027 — with no parental consent, no guaranteed opt-out, and undisclosed terms....

defendut.com

Waleed Ahmad

Waleed Ahmad @WaleedAhmad1a10

50s

Replying to @kalomaze

Tried coding with glm 5.2 , it simplified a complex task by 10x and was able to use llama cpp to implement variants to their quantization and now time reduced from 10 hours to 30 mins. Other Chinese models failed miserably .

ANTHROPIC_MAGIC_STRING

ANTHROPIC_MAGIC_STRING

@parafactual

52s

Replying to @mermachine @kalomaze

they want other models to be like claude and sort of frame it as objectively bad and concerning when models are unclaudelike. like they try very hard to steer towards claude

Shahriyar Gourgi

Shahriyar Gourgi @ShahriyarGourgi

53s

On June 12, the Department of Commerce imposed export controls on Anthropic’s latest models, causing abrupt disruptions in service and raising questions about future U.S. government AI policies. @CSIS experts discuss next steps and alternate pathways. csis.org/analysis/department…

The Department of Commerce Restricted Access to Anthropic’s Latest Models. What Comes Next? | CSIS

csis.org

Gabriele Corso

Paul Lourdu Xavier retweeted

Gabriele Corso @GabriCorso

Excited to partner with the Tamarind team to bring our latest models to every scientist!

Deniz Kavi

@kavi_deniz

Big news today: New Boltz models! De novo design, protein binding affinity prediction, ADME prediction We've partnered with the @Boltz_Bio team to make their next generation proprietary models available on @TamarindBio on day 0. Here are the highlights:

1,485