Developer (1st) Advocate (2nd) @ Nebius | AI ML Data Engineer | Open Source contributor | Technical Instructor | Author | Speaker

Joined February 2025
Photos and videos
Sujee Maniyam retweeted
America's best open model shipped toda (550 billion parameters) and it's serving tokens on @nebiustf Nemotron 3 Ultra is the most intelligent open-weights model the US has shipped. NVIDIA gave away the weights, the data, and the recipes. The intelligence is now a free, downloadable file. That is the headline, and it is real. But a free file is not a product. This model was co-designed for NVFP4 on Blackwell. The thing that makes it fast is the format and the hardware path, and that only pays off on a serving stack tuned to exploit it. Download the weights and run them naive, you leave most of the model on the floor. The work that turns the file into fast, cheap tokens is cache-aware routing, disaggregated prefill and decode, speculative decoding shaped to your traffic, dedicated capacity, regional isolation. The conversion layer. Nobody clones that in an afternoon. That layer is what a token factory is. The model is the crude. The factory is the yield. Nemotron 3 Ultra is live on Nebius Token Factory today, tuned to run the way NVIDIA built it to run. The weights are everyone's. The throughput is the part you come here for. Jensen shipped the weights, we shipped the throughput.
NVIDIA Nemotron™ 3 Ultra is now live on Nebius Token Factory. It’s built for long-running agents across coding, deep research and enterprise workflows. Nemotron 3 Ultra delivers frontier reasoning with up to 5x faster inference and up to 30% lower cost for agentic workloads. For builders, the next question is production: performance, reliability, economics and control. Run it today: tokenfactory.nebius.com/mode…
9
24
201
53,838
Speaking at @pydatalondon this week - Saturday June 6. Talk: "Using coding agents with open models" Coding agents like Cursor and Claude Code are genuinely changing how we build software. But most teams still run them on proprietary models by default - not always because open models aren't good enough, but because pairing the two takes a bit of work to get right. That is exactly what I’ll be showing. We’ll use open models running on @nebiustf , plug them into coding agents, and look at what actually matters in practice: Developer experience, Model behavior, Setup patterns, Best practices. Live demos - not just slides. And yes, attendees will get platform credits so you can try it yourself after the session. If you’re at PyData London, come say hi. Bring your questions, your coding-agent war stories, and your opinions on open models. 📍 Convene Sancroft, St. Paul's · Grand Hall 2 🗓️ June 6 · 16:15–17:00 🔗 pretalx.com/pydata-london-20… #pydata #pydatalondon

1
5
634
Sujee Maniyam retweeted
🚀 New #1 on the @ArtificialAnlys Kimi K2.6 leaderboard: Eigen AI at 265 tok/s — in collaboration with @Nebius Token Factory. @nebiustf On B200. Not GB300. Topping the chart without Blackwell Ultra isn't a silicon story — it's a serving stack story. Every layer of EigenInference is co-designed for trillion-param MoE. More coming 🔥 artificialanalysis.ai/models…
5
13
75
18,383
Your production logs aren't just logs... they are TRAINING DATA. see how @nebiustf Data Lab feature allows you to capture / curate production logs and use them for training. links below:
1
1
51
I love it when a room full of hands goes up when I ask, “who’s using open models?” That was the vibe at Nebuis.Build.Berlin (@nebiusai) event yesterday. @demian_ai and I spoke about "Engineering open models for production at scale" We covered some of the work in Nebius Token Factory (@nebiustf ) - KV cache management across GPU CPU memory - speculative decoding (training draft models) - disaggregated prefill (decoupling prefill and decode to better utilize compute and memory) - strong support for post-training Great questions, great discussions. Thanks to everyone who showed up and made it awesome. PS: Was a perfect sunny ☀️ spring day in Berlin PPS: got legit good chocolate 🍫 - top-tier swag
2
13
445
If you’re at HumanX conf in San Francisco ... Find the 'hidden' @nebiusai Speakeasy Reward: a delicious drink 🍹 Stop by Nebius booth #1127 Reward: Nebius Token Factory @nebiustf credits 🎁
1
128
Sujee Maniyam retweeted
Mar 31
Headed to HumanX? Be sure to join our Masterclass Session on April 8 at 4:30 PM (Hall D-10) to break down the full engineering stack behind production LLMs. 📌 Plus don't miss: • Live demos at our Booth #1127 • Meeting the team at our speakeasy See you there!
2
4
47
4,090
Remember when we were all reworking websites to actually function on smartphones? Now we’re doing it again - but this time, for agents 🤖 At our Nebius.Build.SF Hackathon, the Injestor team - Benjamin Shyong, Vishal Verma , Alex Shirazi - built a tool to convert websites (originally made for humans) work better with agents - and took 1st place in a seriously competitive field (200 attendees, 70 projects). Their stack: - Models on Nebius Token Factory @nebiustf (they liked Nemotron3-Super for speed and good performance) - @tavilyai for pulling web content - And (my favorite) a Karpathy-style agent loop to iteratively optimize sites for agent use Agents building sites for agents - Pretty meta 😄 As part of the win, we invited them join us at the @nebiusai booth at #nvidiagtc. It was great fun! Checkout Benjamin's post for more details and a cool video : lnkd.in/gihJ85rt ; And injester.com/ Great geeking out with you all - excited to see where this goes next.
1
3
3,030
Sujee Maniyam retweeted
I'm hiring a (US-based) Hackathons Lead to turn hackathons into a repeatable growth engine at @Nebius. Goal is not just more hackathons, it’s to systemize, scale and deliver the best hackathon program in the industry. If that's you (or know someone) pls DM or tag them here
1
3
4
813
Your LLM logs = training data 👀 But most teams don’t use them effectively. See how Nebius Token Factory (@nebiustf) turns production logs into better models. 🗓️ March 26 🕐 10AM PDT / 12PM EDT / 6 PM CET 👉 nebius.com/events/webinar-fr… Can’t make it live? No worries - register anyway and we’ll send you the recording a 🎁.
1
2
141
Packed workshop on "synthetic data generation using Nvidia data designer" at Nvidia GTC. Using open models (gpt-oss-120b, nemotron-3-super, kimi-2.5) powered by Nebius Token Factory (@nebiustf) Great work @johnnypgreco and team 👏
5
467
🐍 Snakes at the Nebius booth @ Nvidia GTC '26🐍 …not that kind of snakes 😄 It’s the classic Snake game - but each snake is controlled by an LLM in real time. Powered by Nebius Token Factory @nebiustf 📍 Last day of the conference 🎮 Try different models and watch them compete live Stop by the @nebiusai booth 👀 #Snakegame #LLMGames
2
1
5
162
Demoing Nebius Token Factory (@nebiustf ) to a "potential future customer" 🤖 at #nvidiagtc 😁 @UFBots @nebiusai
1
3
34
2,403
Sujee Maniyam retweeted
Mar 17
“Nebius will take care of you,” — Jensen Huang, @NVIDIA’s Founder and CEO, in conversation with our CRO Marc Boroditsky at the Nebius booth here at #NVIDIAGTC. We will.
43
192
1,627
253,220
Come checkout cool robots (powered by Nebius) from @UFBots at @nebiusai booth at #nvidiagtc
1
16
1,974
Sujee Maniyam retweeted
Nvidia $NVDA CEO Jensen Huang is in the Nebius $NBIS booth right now at GTC
24
70
1,177
90,147