dylan ツ

dylan ツ

Photos and videos

Tweets

Sujee Maniyam retweeted

dylan ツ

@demian_ai

Jun 4

America's best open model shipped toda (550 billion parameters) and it's serving tokens on @nebiustf Nemotron 3 Ultra is the most intelligent open-weights model the US has shipped. NVIDIA gave away the weights, the data, and the recipes. The intelligence is now a free, downloadable file. That is the headline, and it is real. But a free file is not a product. This model was co-designed for NVFP4 on Blackwell. The thing that makes it fast is the format and the hardware path, and that only pays off on a serving stack tuned to exploit it. Download the weights and run them naive, you leave most of the model on the floor. The work that turns the file into fast, cheap tokens is cache-aware routing, disaggregated prefill and decode, speculative decoding shaped to your traffic, dedicated capacity, regional isolation. The conversion layer. Nobody clones that in an afternoon. That layer is what a token factory is. The model is the crude. The factory is the yield. Nemotron 3 Ultra is live on Nebius Token Factory today, tuned to run the way NVIDIA built it to run. The weights are everyone's. The throughput is the part you come here for. Jensen shipped the weights, we shipped the throughput.

Nebius Token Factory

@nebiustf

Jun 4

NVIDIA Nemotron™ 3 Ultra is now live on Nebius Token Factory. It’s built for long-running agents across coding, deep research and enterprise workflows. Nemotron 3 Ultra delivers frontier reasoning with up to 5x faster inference and up to 30% lower cost for agentic workloads. For builders, the next question is production: performance, reliability, economics and control. Run it today: tokenfactory.nebius.com/mode…

201

53,838

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Jun 5

Speaking at @pydatalondon this week - Saturday June 6. Talk: "Using coding agents with open models" Coding agents like Cursor and Claude Code are genuinely changing how we build software. But most teams still run them on proprietary models by default - not always because open models aren't good enough, but because pairing the two takes a bit of work to get right. That is exactly what I’ll be showing. We’ll use open models running on @nebiustf , plug them into coding agents, and look at what actually matters in practice: Developer experience, Model behavior, Setup patterns, Best practices. Live demos - not just slides. And yes, attendees will get platform credits so you can try it yourself after the session. If you’re at PyData London, come say hi. Bring your questions, your coding-agent war stories, and your opinions on open models. 📍 Convene Sancroft, St. Paul's · Grand Hall 2 🗓️ June 6 · 16:15–17:00 🔗 pretalx.com/pydata-london-20… #pydata #pydatalondon

634

Eigen AI

Sujee Maniyam retweeted

Eigen AI

@Eigen_AI_Labs

May 20

🚀 New #1 on the @ArtificialAnlys Kimi K2.6 leaderboard: Eigen AI at 265 tok/s — in collaboration with @Nebius Token Factory. @nebiustf On B200. Not GB300. Topping the chart without Blackwell Ultra isn't a silicon story — it's a serving stack story. Every layer of EigenInference is co-designed for trillion-param MoE. More coming 🔥 artificialanalysis.ai/models…

18,383

Sujee Maniyam

Sujee Maniyam

@sujee_dev

May 19

Your production logs aren't just logs... they are TRAINING DATA. see how @nebiustf Data Lab feature allows you to capture / curate production logs and use them for training. links below:

0:50

Sujee Maniyam

Sujee Maniyam

@sujee_dev

May 19

Try it : tokenfactory.nebius.com/data… Docs : docs.tokenfactory.nebius.com…

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Apr 29

I love it when a room full of hands goes up when I ask, “who’s using open models?” That was the vibe at Nebuis.Build.Berlin (@nebiusai) event yesterday. @demian_ai and I spoke about "Engineering open models for production at scale" We covered some of the work in Nebius Token Factory (@nebiustf ) - KV cache management across GPU CPU memory - speculative decoding (training draft models) - disaggregated prefill (decoupling prefill and decode to better utilize compute and memory) - strong support for post-training Great questions, great discussions. Thanks to everyone who showed up and made it awesome. PS: Was a perfect sunny ☀️ spring day in Berlin PPS: got legit good chocolate 🍫 - top-tier swag

445

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Apr 7

If you’re at HumanX conf in San Francisco ... Find the 'hidden' @nebiusai Speakeasy Reward: a delicious drink 🍹 Stop by Nebius booth #1127 Reward: Nebius Token Factory @nebiustf credits 🎁

0:17

128

Nebius

Sujee Maniyam retweeted

Nebius

@nebiusai

Mar 31

Headed to HumanX? Be sure to join our Masterclass Session on April 8 at 4:30 PM (Hall D-10) to break down the full engineering stack behind production LLMs. 📌 Plus don't miss: • Live demos at our Booth #1127 • Meeting the team at our speakeasy See you there!

4,090

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Mar 27

Remember when we were all reworking websites to actually function on smartphones? Now we’re doing it again - but this time, for agents 🤖 At our Nebius.Build.SF Hackathon, the Injestor team - Benjamin Shyong, Vishal Verma , Alex Shirazi - built a tool to convert websites (originally made for humans) work better with agents - and took 1st place in a seriously competitive field (200 attendees, 70 projects). Their stack: - Models on Nebius Token Factory @nebiustf (they liked Nemotron3-Super for speed and good performance) - @tavilyai for pulling web content - And (my favorite) a Karpathy-style agent loop to iteratively optimize sites for agent use Agents building sites for agents - Pretty meta 😄 As part of the win, we invited them join us at the @nebiusai booth at #nvidiagtc. It was great fun! Checkout Benjamin's post for more details and a cool video : lnkd.in/gihJ85rt ; And injester.com/ Great geeking out with you all - excited to see where this goes next.

3,030

waqas

Sujee Maniyam retweeted

waqas @waqasmakhdum

Mar 23

I'm hiring a (US-based) Hackathons Lead to turn hackathons into a repeatable growth engine at @Nebius. Goal is not just more hackathons, it’s to systemize, scale and deliver the best hackathon program in the industry. If that's you (or know someone) pls DM or tag them here

813

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Mar 23

Your LLM logs = training data 👀 But most teams don’t use them effectively. See how Nebius Token Factory (@nebiustf) turns production logs into better models. 🗓️ March 26 🕐 10AM PDT / 12PM EDT / 6 PM CET 👉 nebius.com/events/webinar-fr… Can’t make it live? No worries - register anyway and we’ll send you the recording a 🎁.

0:40

141

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Mar 20

Packed workshop on "synthetic data generation using Nvidia data designer" at Nvidia GTC. Using open models (gpt-oss-120b, nemotron-3-super, kimi-2.5) powered by Nebius Token Factory (@nebiustf) Great work @johnnypgreco and team 👏

0:08

467

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Mar 19

🐍 Snakes at the Nebius booth @ Nvidia GTC '26🐍 …not that kind of snakes 😄 It’s the classic Snake game - but each snake is controlled by an LLM in real time. Powered by Nebius Token Factory @nebiustf 📍 Last day of the conference 🎮 Try different models and watch them compete live Stop by the @nebiusai booth 👀 #Snakegame #LLMGames

0:17

162

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Mar 17

Demoing Nebius Token Factory (@nebiustf ) to a "potential future customer" 🤖 at #nvidiagtc 😁 @UFBots @nebiusai

2,403

Nebius

Sujee Maniyam retweeted

Nebius

@nebiusai

Mar 17

“Nebius will take care of you,” — Jensen Huang, @NVIDIA’s Founder and CEO, in conversation with our CRO Marc Boroditsky at the Nebius booth here at #NVIDIAGTC. We will.

0:17

192

1,627

253,220

Sujee Maniyam

Sujee Maniyam

@sujee_dev

Mar 17

Come checkout cool robots (powered by Nebius) from @UFBots at @nebiusai booth at #nvidiagtc

1:20

1,974

Evan

Sujee Maniyam retweeted

Evan

@StockMKTNewz

Mar 16

Nvidia $NVDA CEO Jensen Huang is in the Nebius $NBIS booth right now at GTC

1,177

90,147