Joined May 2025
58 Photos and videos
Pinned Tweet
Mar 17
Introducing FlashCompact - the first specialized model for context compaction 33k tokens/sec 200k → 50k in ~1.5s Fast, high quality compaction
94
133
2,173
225,696
Morph retweeted
another week another record
unfortunately, due to the gpu shortage, we’ve had to lay off some of our silicon and increase human headcount to 3 despite this setback, we continue to scale
1
1
21
1,474
Introducing the Morph Model Router. It chooses the best model for each task in under 50ms. Keep frontier performance while lowering latency and cost. Available today in our API
12
7
102
18,959
Not every agent step needs your most expensive model. It’s not “cheap vs smart.” There are many points on the cost-performance curve. Model intelligence is jagged. Routing lets you use the right model for the right task, improving quality while cutting cost 25-50%
1
7
900
May 29
Morph AutoRouter is now 5x faster know difficulty, ambiguity, and domain in under 50ms
2
1
9
2,677
May 29
Why these classes? We think they're useful for routing between GPT vs Opus vs open-source how 30ms? megakernel
1
1
545
Morph retweeted
the general applied standard intelligence compute company
morph is 2 people we spend 10x more on gpus than salary we’re hiring for the first sub-10-person billion-dollar company. join us
1
1
51
12,552
Morph retweeted
Join us in welcoming @morphllm Founder, @tejasybhakta to the AIE Miami lineup! Don't miss his talk 'Everything is Models' next week on the big stage! Get your tickets: ai.engineer/miami
3
18
1,979
warpgrep_github_search from @morphllm is probably the most unfathomoly unfair advantage you can have right now. 10x better than grep app Even beats Ctx7 tbh docs.morphllm.com/sdk/compon…
2
1
8
1,059
Mar 27
Our Claude Code plugin is here! - WarpGrep for state of the art fast code search - FlashCompact, our specialized fast compaction model end to end speedup on long claude code sessions: -37%, while saving claude tokens and improving accuracy
25
21
211
24,502
Morph retweeted
Agents don’t need bigger models. They need better tools. Morph trains coding subagents. Not for humans. For frontier models. Fast Apply edits at 10,000 tokens/sec. WarpGrep handles code and log search. Both keep the main model’s context clean Because when context gets too large, performance drops. Now Morph is pushing coding subagents even faster One newer model runs at 33,000 tokens/sec: docs.morphllm.com/sdk/compon… 🎙️ @tejasybhakta, Founder & CEO, @morphllm on @fondocom @thestartpod w/ @davj
2
10
32
7,135
Mar 17
Introducing FlashCompact - the first specialized model for context compaction 33k tokens/sec 200k → 50k in ~1.5s Fast, high quality compaction
94
133
2,173
225,696
Morph retweeted
Perfect compaction is a prerequisite for long-running agents. It’s the difference between a country of geniuses and a pile of clankers. #unLobotomizeClaude
Mar 17
Introducing FlashCompact - the first specialized model for context compaction 33k tokens/sec 200k → 50k in ~1.5s Fast, high quality compaction
3
1
20
2,304
Morph retweeted
Compaction should feel invisible It should be fast, accurate, and cheap some of our beta users were confused because they didn't notice compaction happening in their coding agent now mission accomplished
Mar 17
Introducing FlashCompact - the first specialized model for context compaction 33k tokens/sec 200k → 50k in ~1.5s Fast, high quality compaction
11
5
111
22,116
Mar 17
So, we trained a specialized model for compaction and made it really fast - outputting at 33,000 tok/sec We built on a custom PyTriton based stack on H200, using a similar inference stack as our FastApply model
2
1
135
9,085
Mar 17
We looked at 200 agent sessions and over 40 of the top coding agent harnesses Most context bloat comes from tool responses, not model generation. Result: → no performance drop → fewer tokens → fewer steps To push performance higher and perform long horizon tasks, agents need cleaner context. More details in the blog: morphllm.com/blog/compact-sd… Or try it in the playground: morphllm.com/dashboard/playg…
9
118
8,691