Morph

Morph

58 Photos and videos

Tweets

Pinned Tweet

Morph

@morphllm

Mar 17

Introducing FlashCompact - the first specialized model for context compaction 33k tokens/sec 200k → 50k in ~1.5s Fast, high quality compaction

0:13

133

2,173

225,696

Tejas Bhakta

Morph retweeted

Tejas Bhakta

@tejasybhakta

Jun 9

another week another record

Tejas Bhakta

@tejasybhakta

Jun 1

unfortunately, due to the gpu shortage, we’ve had to lay off some of our silicon and increase human headcount to 3 despite this setback, we continue to scale

1,474

Morph

Morph

@morphllm

Jun 3

Introducing the Morph Model Router. It chooses the best model for each task in under 50ms. Keep frontier performance while lowering latency and cost. Available today in our API

0:20

102

18,959

Morph

Morph

@morphllm

Jun 3

Not every agent step needs your most expensive model. It’s not “cheap vs smart.” There are many points on the cost-performance curve. Model intelligence is jagged. Routing lets you use the right model for the right task, improving quality while cutting cost 25-50%

900

Morph

Morph

@morphllm

Jun 3

Being model agnostic is better, faster, and cheaper Try the Morph Router API: docs.morphllm.com/sdk/compon… or dm us to self host

Model Router - Morph Documentation

Classifies prompt difficulty, ambiguity, and domain for automatic model selection

docs.morphllm.com

580

Morph

Morph

@morphllm

May 29

Morph AutoRouter is now 5x faster know difficulty, ambiguity, and domain in under 50ms

2,677

Morph

Morph

@morphllm

May 29

Why these classes? We think they're useful for routing between GPT vs Opus vs open-source how 30ms? megakernel

545

Morph

Morph

@morphllm

May 29

vibe test it here morphllm.com/dashboard/playg… integrate here docs.morphllm.com/sdk/compon…

330

Tejas Bhakta

Morph retweeted

Tejas Bhakta

@tejasybhakta

May 13

the general applied standard intelligence compute company

Tejas Bhakta

@tejasybhakta

Apr 29

morph is 2 people we spend 10x more on gpus than salary we’re hiring for the first sub-10-person billion-dollar company. join us

12,552

AI Engineer: Miami

Morph retweeted

AI Engineer: Miami

@AIEMiami

Apr 16

Join us in welcoming @morphllm Founder, @tejasybhakta to the AIE Miami lineup! Don't miss his talk 'Everything is Models' next week on the big stage! Get your tickets: ai.engineer/miami

1,979

𝗥𝗬𝗔𝗡 𝗟𝗘𝗘

Morph retweeted

𝗥𝗬𝗔𝗡 𝗟𝗘𝗘

@ryanleecode

Apr 15

warpgrep_github_search from @morphllm is probably the most unfathomoly unfair advantage you can have right now. 10x better than grep app Even beats Ctx7 tbh docs.morphllm.com/sdk/compon…

GitHub Search - Morph Documentation

Search public GitHub repositories without cloning

docs.morphllm.com

1,059

Morph

Morph

@morphllm

Mar 27

Our Claude Code plugin is here! - WarpGrep for state of the art fast code search - FlashCompact, our specialized fast compaction model end to end speedup on long claude code sessions: -37%, while saving claude tokens and improving accuracy

0:38

211

24,502

Morph

Morph

@morphllm

Mar 27

morphllm.com/mcp install the plugin mcp here

Morph MCP - Supercharge Your Coding Agent

One MCP. Plug into Cursor, Claude Code, or any agent. Faster edits, smarter retrieval, and better context.

morphllm.com

1,587

Fondo.com

Morph retweeted

Fondo.com

@Fondocom

Mar 27

Agents don’t need bigger models. They need better tools. Morph trains coding subagents. Not for humans. For frontier models. Fast Apply edits at 10,000 tokens/sec. WarpGrep handles code and log search. Both keep the main model’s context clean Because when context gets too large, performance drops. Now Morph is pushing coding subagents even faster One newer model runs at 33,000 tokens/sec: docs.morphllm.com/sdk/compon… 🎙️ @tejasybhakta, Founder & CEO, @morphllm on @fondocom @thestartpod w/ @davj

1:23

7,135

Morph

Morph

@morphllm

Mar 17

Introducing FlashCompact - the first specialized model for context compaction 33k tokens/sec 200k → 50k in ~1.5s Fast, high quality compaction

0:13

133

2,173

225,696

Morph

Morph

@morphllm

Mar 18

We just removed access control. Anyone can now try this from the API as well. docs.morphllm.com/sdk/compon…

Compact - Morph Documentation

Drop filler from chat history at 33,000 tok/s. No rewriting, no paraphrasing.

docs.morphllm.com

2,360

dhruv bhatia

Morph retweeted

dhruv bhatia

@dhruvbhatia0

Mar 17

Perfect compaction is a prerequisite for long-running agents. It’s the difference between a country of geniuses and a pile of clankers. #unLobotomizeClaude

Morph

@morphllm

Mar 17

Introducing FlashCompact - the first specialized model for context compaction 33k tokens/sec 200k → 50k in ~1.5s Fast, high quality compaction

0:13

2,304

Tejas Bhakta

Morph retweeted

Tejas Bhakta

@tejasybhakta

Mar 17

Compaction should feel invisible It should be fast, accurate, and cheap some of our beta users were confused because they didn't notice compaction happening in their coding agent now mission accomplished

Morph

@morphllm

Mar 17

Introducing FlashCompact - the first specialized model for context compaction 33k tokens/sec 200k → 50k in ~1.5s Fast, high quality compaction

0:13

111

22,116

Morph

Morph

@morphllm

Mar 17

So, we trained a specialized model for compaction and made it really fast - outputting at 33,000 tok/sec We built on a custom PyTriton based stack on H200, using a similar inference stack as our FastApply model

135

9,085

Morph

Morph

@morphllm

Mar 17

We looked at 200 agent sessions and over 40 of the top coding agent harnesses Most context bloat comes from tool responses, not model generation. Result: → no performance drop → fewer tokens → fewer steps To push performance higher and perform long horizon tasks, agents need cleaner context. More details in the blog: morphllm.com/blog/compact-sd… Or try it in the playground: morphllm.com/dashboard/playg…

Morph - Fast Models That Improve Coding Agents

General coding models for agent loops, plus specialized models for search (WarpGrep), edits (Fast Apply), and context (Compact). One OpenAI-compatible API.

morphllm.com

118

8,691