Jiaming Song

Jiaming Song

154 Photos and videos

Tweets

Pinned Tweet

Jiaming Song

@baaadas

May 31

time for some vacation stuff

0:51

Jiaming Song

@baaadas

May 31

Yesterday was my last day at @LumaLabsAI. Over the last three years, I had the privilege of helping drive the company's transition from 3D AI to video generation and native multimodal foundation models. I am grateful to have worked alongside an extraordinary group of researchers, and I look forward to seeing the next chapter of the company's story unfold.

171

24,503

Jiaming Song

Jiaming Song

@baaadas

Jun 1

Over the weekend, I was using codex to update my homepage and a paper I wrote a year ago on the topic of diffusion LLMs (should be updated on Monday). tsong.me/blog/inference-time… While I did not want to make it too explicit back then, I have argued that discrete diffusion LLMs were not the right thing to do and if diffusion ever works on LLMs continuous dLLMs are the way to go. A year later, we are seeing a lot cool papers in this space, and I hope the community can push for something practical and scalable.

172

14,978

Jiaming Song

Jiaming Song

@baaadas

Jun 1

I really think that autoregression and diffusion is a false dichotomy -- they can easily co-exist (e.g., diffusion forcing). The real one is between discrete and continuous tokens.

David

@DavidSHolz

May 27

Most researchers agree that autoregression is best when memory bandwidth is cheap and diffusion is best when FLOPS are cheap. They also admit the future of compute is all FLOPS because memory scaling is hard and scaling FLOPS is easy. So why not go all in on diffusion????

321

52,405

Jiaming Song

Jiaming Song

@baaadas

May 31

553

67,809

Jiaming Song

Jiaming Song

@baaadas

May 5

#3 on Image Edit. #3 on Text-to-Image. @arena The compute we did it with would surprise you. Proud of this team @LumaLabsAI ! (A slightly more detailed report will come out soon)

Arena.ai

@arena

May 5

Exciting news: UNI-1.1-Max and UNI-1.1 debuts making @LumaLabsAI the #3 lab in the Image Arena across both Text-to-Image and Image Edit! These are versions released without agentic search. Text-to-Image Arena - UNI-1.1-Max #6 overall (1193), 12 points over MAI-Image-2 - UNI-1.1 #7 overall (1190), 13 points over Reve-v1.5 Multi-Image Edit Arena - UNI-1.1-Max #7 overall (1315), 21 points over Seedream 4.5 - UNI-1.1 #8 overall, (1298) Single-Image Edit Arena - UNI-1.1-Max #7 overall (1337) - UNI-1.1 #11 overall, (1310) on par with Grok-Imagine-Image (20260207) Congratulations to @LumaLabsAI on this solid performance!

154

27,866

Jiaming Song

Jiaming Song

@baaadas

Apr 24

DeepSeek-V4 from the 🐐 @deepseek_ai released: huggingface.co/deepseek-ai/D… DeepSeek-V4 Pro: 1.6T/49B DeepSeek-V4 Lite: 284B/13B

DeepSeek_V4.pdf · deepseek-ai/DeepSeek-V4-Pro at main

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

1,485

Shengjia Zhao

Jiaming Song retweeted

Shengjia Zhao

@shengjia_zhao

Apr 8

Excited to share what we’ve been building at Meta Superintelligence Labs! We just released Muse Spark, our first AI model. It's a natively multimodal reasoning model and the first step on our path to personal superintelligence. We've overhauled our entire stack to support scaling, and this is just the beginning. ai.meta.com/blog/introducing…

172

1,668

236,833

Jiaming Song

Jiaming Song

@baaadas

Apr 2

This is such an amazing idea that I should never have given up….

@okaris

Apr 1

personalization is back. most systems still fake it. we went back to our paper from two years ago, rebuilt it from the ground up, applied newer techniques, and replaced the base model with a flow-matching transformer. the result: our pipeline now beats all SOTA personalization approaches on FT-Arena. introducing Person2Person Diffusion 2. code and weights are out now (MIT, commercial use allowed). okaris.github.io/p2p-diffusi…

11,097

William Shen

Jiaming Song retweeted

William Shen

@shenbokui

Mar 26

Most image models are good at one thing. Uni-1 has been good at everything we've thrown at it. Our team generated thousands of images leading up to Uni-1 launch. We embedded them all into a single map where visual similarity determines proximity. The result speaks for itself.

0:15

328

48,966

Jiaming Song

Jiaming Song

@baaadas

Mar 25

My dream come true: lobster cat!

Ring Hyacinth

@ring_hyacinth

Mar 24

终于有可以和 Nanobanana Pro 打一打的图像模型了。 @LumaLabsAI 凌晨发布的 Uni-1 图像模型可以画条漫了，指定的中文文字基本不出错。 Prompt in Alt! ▶ 模型详细说明：lumalabs.ai/uni-1

任务：生成一页高精度的黑白条形漫画 (Vertical Strip Manga)。讲述“小猫虾”的觉醒故事。

【整体要求】: 写实漫 (Realistic Manga) 风格，精准的人体解剖结构（尽管它是猫虾），禅意的水墨感 (Brush strokes)，通过眼神和肢体语言传达情感。

【核心角色保持】: 主角“小猫虾”（猫头表情，虾/龙虾身体大钳子）。它的表情必须是非常写实的、灌篮高手式的专注和觉醒。

【关键分镜序列】:
(Top): 中等横格。小猫虾静谧地在静谧沙滩上打呼噜。
台词：“呼...今天的太阳真舒服。”
(Down): 短小特写格。一个巨大的浪花瞬间吞没整个画面。
小猫虾身体翻转，表情惊恐（写实风格）。
台词：“哇！救命！” 精准的人体结构。
(Down): 超长纵向大格。水下。小猫虾闭着眼在水下拼命挣扎，身体扭曲，旁边有大量的气泡。台词：“诶？我不怕水？” 表现挣扎和发现的过程。
(Down): 短小特写格。小猫虾猛地睁开眼，一脸专注地看着自己的虾尾。台词：“原来...我竟然可以游泳？” 表情专注和觉醒。
(Bottom): 超长纵向大格。高潮。小猫虾在海底快速穿梭，身后留下清晰的白色水流痕迹，旁边的小鱼都被甩飞了。它笑得很专注。
台词：“这速度！太爽啦！”

ALT 任务：生成一页高精度的黑白条形漫画 (Vertical Strip Manga)。讲述“小猫虾”的觉醒故事。【整体要求】: 写实漫 (Realistic Manga) 风格，精准的人体解剖结构（尽管它是猫虾），禅意的水墨感 (Brush strokes)，通过眼神和肢体语言传达情感。【核心角色保持】: 主角“小猫虾”（猫头表情，虾/龙虾身体大钳子）。它的表情必须是非常写实的、灌篮高手式的专注和觉醒。【关键分镜序列】: (Top): 中等横格。小猫虾静谧地在静谧沙滩上打呼噜。台词：“呼...今天的太阳真舒服。” (Down): 短小特写格。一个巨大的浪花瞬间吞没整个画面。小猫虾身体翻转，表情惊恐（写实风格）。台词：“哇！救命！” 精准的人体结构。 (Down): 超长纵向大格。水下。小猫虾闭着眼在水下拼命挣扎，身体扭曲，旁边有大量的气泡。台词：“诶？我不怕水？” 表现挣扎和发现的过程。 (Down): 短小特写格。小猫虾猛地睁开眼，一脸专注地看着自己的虾尾。台词：“原来...我竟然可以游泳？” 表情专注和觉醒。 (Bottom): 超长纵向大格。高潮。小猫虾在海底快速穿梭，身后留下清晰的白色水流痕迹，旁边的小鱼都被甩飞了。它笑得很专注。台词：“这速度！太爽啦！”

2,956

Luma

Jiaming Song retweeted

Luma

@LumaLabsAI

Mar 25

We are loving the energy around Uni-1! Quick note since we’re seeing questions: With Luma Agents, requests can route across models. If you want to make sure you’re using Uni-1, here’s how: - Select Create Image → Uni-1 - Or, explicitly ask the agent to use Uni-1 - Check the model label on outputs to confirm API access coming soon for more direct testing. Keep the feedback coming, and keep on creating → lumalabs.ai/uni-1.

0:23

254

19,795

Jiaming Song

Jiaming Song

@baaadas

Mar 24

A fixed text token length is a limitation of most existing text-to-image models. In Uni-1, we engineered around this, so that it can take at least 3500 tokens (even in markdown) if you want to promptmaxx. The differences is huge between Uni-1 (1st image) and Flux 2 (2nd image).

Atelier SG

@_AtelierSG_

Mar 24

This has to be a prank.

7,457

Jiaming Song

Jiaming Song

@baaadas

Mar 24

Here is the prompt and thinking in case you are wondering.

1,493

Yiqing Liang

Jiaming Song retweeted

Yiqing Liang

@YiqingLiang2

Mar 24

🙏 Grateful and Proud beyond words to be part of the incredible team that built UNI-1 @LumaLabsAI! Intelligent, directable, cultured — and the manga generation? See these 👇 and try it FREE today here: lumalabs.ai/uni-1! Plus we are hiring!

Luma

@LumaLabsAI

Mar 23

Uni-1 is here! A new kind of model that thinks and generates pixels simultaneously. Less artificial. More intelligent.

1:00

5,226

Jiaming Song

Jiaming Song

@baaadas

Mar 23

Thanks for the amazing team that made this possible! We saw a significant improvement on the model even in the 2 weeks between the announcement and the launch. Now, to new heights!

Luma

@LumaLabsAI

Mar 23

Uni-1 is here! A new kind of model that thinks and generates pixels simultaneously. Less artificial. More intelligent.

1:00

117

8,881

Jiaming Song

Jiaming Song

@baaadas

Mar 16

Thanks! But not sure if that is what is specifically being asked. I suspect given Aaron’s past papers, he is asking for a thing for Wasserstein GANs? TVM does bring an implementation for backwards on “forward mode autodiff” but WGAN does backward on “reverse mode autodiff”.

Sander Dieleman

@sedielem

Mar 15

Replying to @SkyLi0n

This implementation came with the Terminal Velocity Matching paper (arxiv.org/abs/2511.19797): github.com/lumalabs/tvm/blob…

5,383

Jiaming Song

Jiaming Song

@baaadas

Mar 12

Good luck!

Tony Zhao

@tonyzzhao

Mar 12

We raised $165M at a $1.15B valuation to stop doing demos. 2026 is about 1) deployment and 2) research. We will start shipping Memo with our new frontier models in a few months. Our series-B is led by Coatue, with Thomas Laffont joining the board. ->🧵

1:43

4,021

Jiaming Song

Jiaming Song

@baaadas

Mar 9

Reimagined with Uni-1.

Nobara @Nobarakia

Jan 15

Ana da Armas Gemini Nano Banana Pro Prompt: { "image_generation_request": { "framing_and_composition": { "shot_type": "Medium shot", "orientation": "Vertical", "composition": "Mirror selfie, subject centered", "camera_in_shot": "Smartphone visible in reflection, held at chest height", "perspective": "Candid night out" }, "subject_identity": { "reference_name": "Ana de Armas (young)", "skin_tone": "Pale, porcelain", "facial_structure": { "jawline": "Sharp", "cheekbones": "High", "eyes": "Large, almond-shaped, green-hazel", "nose": "Refined, slim" }, "identity_fidelity": "100% face preservation, zero alterations to unique features" }, "expression_and_pose": { "expression": "Sultry, neutral, composed", "gaze": "Directly into the reflected camera lens", "head_pose": "Slightly tilted", "body_language": "Confident, heavy coat draped loosely to reveal shoulder", "hand_detail": "Right hand holding phone, index finger extended, red manicure" }, "styling_details": { "makeup": { "eyes": "Sharp black winged eyeliner, smoked-out lower lash line, heavy volume mascara", "eyebrows": "Well-defined, natural dark", "lips": "Soft matte rose-toned, slightly overlined", "contour": "Subtle, cheekbone-focused" }, "hair": { "color": "Dark espresso", "style": "Long, thick, wavy, voluminous", "texture": "Messy-chic, healthy sheen", "parting": "Slightly off-center" }, "outfit": { "base_layer": "Chocolate-brown metallic one-shoulder bodycon dress with ruching", "outerwear": "Heavy, brown faux-fur mink-style coat, worn off-shoulder" }, "accessories": { "jewelry": "Large, chunky vintage-style gold earrings", "nails": "Long, pointed acrylics, glossy cherry red", "phone_case": "Modern smartphone in a dark case" } }, "physical_attributes": { "silhouette": "Slim, toned hourglass", "details": "Defined collarbones, slender shoulder, narrow waist, feminine curves" }, "environment_and_lighting": { "setting": "Dark minimalist luxury interior (high-end club/lounge)", "background_elements": "Horizontal black tiled walls or polished dark wood panels", "lighting_type": "Hard mirror flash", "lighting_effects": "Central starburst flare, high-contrast highlights, deep shadows", "film_texture": "Subtle film grain, low-light mobile photography look" }, "technical_specifications": { "aesthetic": "Shot on iPhone, ultra-realistic", "resolution": "8k, high-resolution textures, raw photo quality", "artifacts": "Slight motion blur, natural digital noise", "focus": "Sharp focus on facial features despite flash flare" }, "mood_and_style": { "theme": "It-girl nightlife, edgy, glamorous, expensive", "atmosphere": "Intimate, performative, high-fashion vanity" }, "strict_constraints": { "face_integrity": "Do not alter eye shape, lip shape, or facial proportions", "lighting_retention": "Maintain flash flare in mirror", "prohibited_elements": "No smiling", "aspect_ratio": "3:4" } } }

1,049