I need a vacation

Joined November 2014
154 Photos and videos
Pinned Tweet
time for some vacation stuff
Yesterday was my last day at @LumaLabsAI. Over the last three years, I had the privilege of helping drive the company's transition from 3D AI to video generation and native multimodal foundation models. I am grateful to have worked alongside an extraordinary group of researchers, and I look forward to seeing the next chapter of the company's story unfold.
9
1
171
24,503
Over the weekend, I was using codex to update my homepage and a paper I wrote a year ago on the topic of diffusion LLMs (should be updated on Monday). tsong.me/blog/inference-time… While I did not want to make it too explicit back then, I have argued that discrete diffusion LLMs were not the right thing to do and if diffusion ever works on LLMs continuous dLLMs are the way to go. A year later, we are seeing a lot cool papers in this space, and I hope the community can push for something practical and scalable.
8
14
172
14,978
I really think that autoregression and diffusion is a false dichotomy -- they can easily co-exist (e.g., diffusion forcing). The real one is between discrete and continuous tokens.
Most researchers agree that autoregression is best when memory bandwidth is cheap and diffusion is best when FLOPS are cheap. They also admit the future of compute is all FLOPS because memory scaling is hard and scaling FLOPS is easy. So why not go all in on diffusion????
14
12
321
52,405
Yesterday was my last day at @LumaLabsAI. Over the last three years, I had the privilege of helping drive the company's transition from 3D AI to video generation and native multimodal foundation models. I am grateful to have worked alongside an extraordinary group of researchers, and I look forward to seeing the next chapter of the company's story unfold.
65
7
553
67,809
#3 on Image Edit. #3 on Text-to-Image. @arena The compute we did it with would surprise you. Proud of this team @LumaLabsAI ! (A slightly more detailed report will come out soon)
Exciting news: UNI-1.1-Max and UNI-1.1 debuts making @LumaLabsAI the #3 lab in the Image Arena across both Text-to-Image and Image Edit! These are versions released without agentic search. Text-to-Image Arena - UNI-1.1-Max #6 overall (1193), 12 points over MAI-Image-2 - UNI-1.1 #7 overall (1190), 13 points over Reve-v1.5 Multi-Image Edit Arena - UNI-1.1-Max #7 overall (1315), 21 points over Seedream 4.5 - UNI-1.1 #8 overall, (1298) Single-Image Edit Arena - UNI-1.1-Max #7 overall (1337) - UNI-1.1 #11 overall, (1310) on par with Grok-Imagine-Image (20260207) Congratulations to @LumaLabsAI on this solid performance!
3
17
154
27,866
Jiaming Song retweeted
Excited to share what we’ve been building at Meta Superintelligence Labs! We just released Muse Spark, our first AI model. It's a natively multimodal reasoning model and the first step on our path to personal superintelligence. We've overhauled our entire stack to support scaling, and this is just the beginning. ai.meta.com/blog/introducing…
74
172
1,668
236,833
This is such an amazing idea that I should never have given up….
Apr 1
personalization is back. most systems still fake it. we went back to our paper from two years ago, rebuilt it from the ground up, applied newer techniques, and replaced the base model with a flow-matching transformer. the result: our pipeline now beats all SOTA personalization approaches on FT-Arena. introducing Person2Person Diffusion 2. code and weights are out now (MIT, commercial use allowed). okaris.github.io/p2p-diffusi…
3
1
56
11,097
Jiaming Song retweeted
Most image models are good at one thing. Uni-1 has been good at everything we've thrown at it. Our team generated thousands of images leading up to Uni-1 launch. We embedded them all into a single map where visual similarity determines proximity. The result speaks for itself.
22
43
328
48,966
My dream come true: lobster cat!
终于有可以和 Nanobanana Pro 打一打的图像模型了。 @LumaLabsAI 凌晨发布的 Uni-1 图像模型可以画条漫了,指定的中文文字基本不出错。 Prompt in Alt! ▶ 模型详细说明:lumalabs.ai/uni-1
2
23
2,956
Jiaming Song retweeted
Mar 25
We are loving the energy around Uni-1! Quick note since we’re seeing questions: With Luma Agents, requests can route across models. If you want to make sure you’re using Uni-1, here’s how: - Select Create Image → Uni-1 - Or, explicitly ask the agent to use Uni-1 - Check the model label on outputs to confirm API access coming soon for more direct testing. Keep the feedback coming, and keep on creating → lumalabs.ai/uni-1.
19
45
254
19,795
A fixed text token length is a limitation of most existing text-to-image models. In Uni-1, we engineered around this, so that it can take at least 3500 tokens (even in markdown) if you want to promptmaxx. The differences is huge between Uni-1 (1st image) and Flux 2 (2nd image).
This has to be a prank.
1
5
57
7,457
Here is the prompt and thinking in case you are wondering.
1
13
1,493
Jiaming Song retweeted
🙏 Grateful and Proud beyond words to be part of the incredible team that built UNI-1 @LumaLabsAI! Intelligent, directable, cultured — and the manga generation? See these 👇 and try it FREE today here: lumalabs.ai/uni-1! Plus we are hiring!
Mar 23
Uni-1 is here! A new kind of model that thinks and generates pixels simultaneously. Less artificial. More intelligent.
1
6
58
5,226
Thanks for the amazing team that made this possible! We saw a significant improvement on the model even in the 2 weeks between the announcement and the launch. Now, to new heights!
Mar 23
Uni-1 is here! A new kind of model that thinks and generates pixels simultaneously. Less artificial. More intelligent.
5
12
117
8,881
Thanks! But not sure if that is what is specifically being asked. I suspect given Aaron’s past papers, he is asking for a thing for Wasserstein GANs? TVM does bring an implementation for backwards on “forward mode autodiff” but WGAN does backward on “reverse mode autodiff”.
Replying to @SkyLi0n
This implementation came with the Terminal Velocity Matching paper (arxiv.org/abs/2511.19797): github.com/lumalabs/tvm/blob…
1
10
5,383
Good luck!
We raised $165M at a $1.15B valuation to stop doing demos. 2026 is about 1) deployment and 2) research. We will start shipping Memo with our new frontier models in a few months. Our series-B is led by Coatue, with Thomas Laffont joining the board. ->🧵
1
20
4,021
Reimagined with Uni-1.
Ana da Armas Gemini Nano Banana Pro Prompt: { "image_generation_request": { "framing_and_composition": { "shot_type": "Medium shot", "orientation": "Vertical", "composition": "Mirror selfie, subject centered", "camera_in_shot": "Smartphone visible in reflection, held at chest height", "perspective": "Candid night out" }, "subject_identity": { "reference_name": "Ana de Armas (young)", "skin_tone": "Pale, porcelain", "facial_structure": { "jawline": "Sharp", "cheekbones": "High", "eyes": "Large, almond-shaped, green-hazel", "nose": "Refined, slim" }, "identity_fidelity": "100% face preservation, zero alterations to unique features" }, "expression_and_pose": { "expression": "Sultry, neutral, composed", "gaze": "Directly into the reflected camera lens", "head_pose": "Slightly tilted", "body_language": "Confident, heavy coat draped loosely to reveal shoulder", "hand_detail": "Right hand holding phone, index finger extended, red manicure" }, "styling_details": { "makeup": { "eyes": "Sharp black winged eyeliner, smoked-out lower lash line, heavy volume mascara", "eyebrows": "Well-defined, natural dark", "lips": "Soft matte rose-toned, slightly overlined", "contour": "Subtle, cheekbone-focused" }, "hair": { "color": "Dark espresso", "style": "Long, thick, wavy, voluminous", "texture": "Messy-chic, healthy sheen", "parting": "Slightly off-center" }, "outfit": { "base_layer": "Chocolate-brown metallic one-shoulder bodycon dress with ruching", "outerwear": "Heavy, brown faux-fur mink-style coat, worn off-shoulder" }, "accessories": { "jewelry": "Large, chunky vintage-style gold earrings", "nails": "Long, pointed acrylics, glossy cherry red", "phone_case": "Modern smartphone in a dark case" } }, "physical_attributes": { "silhouette": "Slim, toned hourglass", "details": "Defined collarbones, slender shoulder, narrow waist, feminine curves" }, "environment_and_lighting": { "setting": "Dark minimalist luxury interior (high-end club/lounge)", "background_elements": "Horizontal black tiled walls or polished dark wood panels", "lighting_type": "Hard mirror flash", "lighting_effects": "Central starburst flare, high-contrast highlights, deep shadows", "film_texture": "Subtle film grain, low-light mobile photography look" }, "technical_specifications": { "aesthetic": "Shot on iPhone, ultra-realistic", "resolution": "8k, high-resolution textures, raw photo quality", "artifacts": "Slight motion blur, natural digital noise", "focus": "Sharp focus on facial features despite flash flare" }, "mood_and_style": { "theme": "It-girl nightlife, edgy, glamorous, expensive", "atmosphere": "Intimate, performative, high-fashion vanity" }, "strict_constraints": { "face_integrity": "Do not alter eye shape, lip shape, or facial proportions", "lighting_retention": "Maintain flash flare in mirror", "prohibited_elements": "No smiling", "aspect_ratio": "3:4" } } }
7
1,049