Joined September 2014
83 Photos and videos
Pinned Tweet
The goal is to democratize film making
Jun 3
Grok @Imagine 1.5 Preview is here Try it today in the API: x.ai/api/imagine
12
6
136
8,044
Hexiang (Frank) Hu retweeted
Lots to do together. Excited to be joining forces with @SpaceX to build useful AI.
SpaceX has exercised the option to acquire @cursor_ai in an all-stock transaction with the goal of building the world’s most useful AI models. For the past few months, SpaceXAI has been jointly training a model with Cursor, which will be released in Cursor and Grok Build soon. We look forward to working closely with the Cursor team to advance our frontier AI capabilities
653
872
10,894
2,137,335
wow😮
Wait what? Rio 3.5 Open 397B, developed by IT company of Rio de Janeiro's city government is now SOTA open source and even outperforming Qwen 3.7? What is happening today. Never heard of them before.
2
19
2,367
❤️
I love the incredible people of SpaceX beyond words
1
46
1,970
Congrats on the release. I think SD 2 results on image to video is nerf-ed, from my experience it should be on par to grok imagine v1.5 (and better than grok imagine v1 reported here
Gemini Omni Flash is SOTA at image to video, text to video, and video editing : ) Excited to get this to developers in the API soon!
2
1
28
2,346
Hexiang (Frank) Hu retweeted

5,735
14,329
106,762
21,774,843
LoL my laptop is pretty good🤣
3
1
87
6,171
Since May, commanding a giant team of AI agents in @cursor has become my daily routine. In less than a month, they produced over 1 million lines of effective code — autonomously researching, implementing features, debugging, and shipping real work (about data). Insanely productive, the multi-agent AI coding revolution is a real game changer; Great product🚀🚀!
Working with agents should feel like working with a colleague. You should be able “speak to” them not just with text chats, but by gesturing at a screen together, talking live, etc.
14
5
168
12,347
The v0.9 image model release would not be possible without @JiachenLi11 and @du_peichao75719🫡🫡🫡 (Some part of v0.9 model is still alive, because it is too good 😆
I can safely say we couldn't reach any of our goals without @du_peichao75719.
1
43
2,815
Hexiang (Frank) Hu retweeted
@EthanHe_42 worked with me and @imhaotian for half a year on Grok Imagine. I don't think he was being intentional overclaiming here, but the internet narrative quoting him as the "lead" was werid. It was really driven by @imhaotian most of the time. Good times — especially that intense three-month sprint at the start. Huge credit goes to @imhaotian and several other key people like @zeliu_ @jathushan @ZhibeiM @hexiang @JackCaiXun @chaitu, and later @jia_xuhui @YknZhu, along with many others — especially the latest Grok Imagine 1.5.
In @latentspacepod podcast, I shared my view on video generation, world models, LLMs, agents, continual learning and where the next frontier is. 1. Video models get most of their intelligence from language, not from video data. 2. Idea-to-code is fast now. The bottleneck is back to having enough compute to try every idea. 3. Iteration speed beats almost everything else in model development. 4. The next leap won't be a better video model. It'll be a video agent. 5. Diffusion will be the frontend of AGI, the LLM the backend. Generative UI will replace HTML/CSS: user intent straight to pixels. 6. Physical embodiment may become a tool a powerful AI picks up. Robotics may get solved by video-capable LLMs. 7. Continual learning may look like models that manage their own context, and even rewrite their own harness at test time. Thanks @swyx and @vibhuuuus for having me 🙏 youtube.com/watch?v=jPtQlILf…
5
18
305
97,644
Hexiang (Frank) Hu retweeted
Jun 3
Meet Go. Gopuff's AI shopping genius, co-developed with SpaceXAI. Just say what you need. It's already on its way.
581
653
8,711
77,482,073
A recent update from the team, enjoy!
May 31
Grok-Imagine-Video-1.5-Preview (720p) has landed #1 in the Image-to-Video Arena! This is a massive 52 pt improvement over Grok-Imagine-Video (720p), surpassing the best video models Seedance-2.0 and HappyHorse. Congrats to @xAI and @elonmusk on this big achievement!
14
6
148
10,780
Imagine would improve a lot on real-world usefulness!
May 21
Prototyping game assets directly with Grok @imagine
2
3
63
3,379
Hexiang (Frank) Hu retweeted
Inline image & video on alpha release btw (next stable)
Grok Build from xAI comes with /imagine and /imagine-video commands out of the box to generate images and videos directly from CLI. First CLI tool that comes with native video and image generation. Result 👇
8
7
102
11,268
Hexiang (Frank) Hu retweeted
Together with SpaceXAI, we’re training a significantly larger model from scratch, using 10x more total compute. With Colossus 2’s million H100-equivalents and our combined data and training techniques, we expect this to be a major leap in model capability.
89
258
4,838
1,173,918
Hexiang (Frank) Hu retweeted
Grok Imagine is the new Leica. Left: Sony Alpha VII Right: Grok Imagine
4
9
190
14,338
Hexiang (Frank) Hu retweeted
Wow, Grok Imagine knows how to cast the shadow through the glass. It even casts the shadow on the door hinge. Picture on the left is original (real), right is Imagine.
17
25
685
24,113
Congrats on the release @linluqiu, awesome work!
Language is discrete. Language models don’t have to be. 🧚Introducing ELF🧚‍♀️: Embedded Language Flows—a class of diffusion models in continuous embedding space based on continuous-time Flow Matching 🧵
1
13
2,538
🧐🧐
Unitree CEO Wang Xingxing just unveiled a real-life mecha. Marketed as the world’s first mass-produced manned robot, this machine can transform into a quadrupedal civilian vehicle. The unit weighs roughly 500 kg (1,100 lb), including the pilot.
2
1
18
1,965
Hexiang (Frank) Hu retweeted
May 7
🚨 Grok Imagine Quality Mode API is now available on fal! 🖼️ High-quality image generation from text prompts ✏️ Edit existing images with natural instructions 🔍 Analyze visuals and reason about image content
7
11
109
10,232