Making the impossible - POSSIBLE

Joined June 2023
12 Photos and videos
Brea Browne retweeted
Today, Nebius broke ground on its flagship AI factory campus in Independence, Missouri – the company's first gigawatt-scale digital infrastructure project in the US. The campus will support ~1,200 construction jobs and 130 permanent roles, alongside a community benefits plan covering local education and workforce development. State and local leaders, economic development partners, and community members joined company representatives to mark the occasion. Read the full release: nebius.com/newsroom/nebius-b…
20
106
1,144
240,159
Brea Browne retweeted
A portal is opening to your very own world... 🧵
2
6
56
26,870
Fantastic work team - proud to share this Cogito v2.1 was trained on @nebiusai 🦾🦾🦾
4/ For Cogito v2.1, we fork off the open-licensed Deepseek base model from November 2024. This is an obvious choice for a pretrained base model, as Deepseek architecture has an ecosystem of cheap inference built around it. We have built a frontier training stack, while being an early stage startup, since we can stand on the shoulders of open source champions like @huggingface, @togethercompute, @runpod and @nebiusai, as well as stellar contributions by @Microsoft, @Meta, @nvidia and a lot of other folks in open source. Over the last months, we have iterated and refined our post-training strategies of self-play RL (called Iterated Distillation and Amplification - IDA) with Cogito v1 and v2. You will see high-quality responses from Cogito v2.1 while being a bit different from usual models - we increase the model’s intelligence prior and teach it how to think via process supervision. So there are significantly shorter reasoning chains for the responses. We also use less markdown, less verbosity. In short, we want to make the model great for API usage - faster, fewer tokens with super high quality.
1
139
teaching models how to search, not just what to predict - brilliant work @drishanarora
It is intuitively easy to understand why self play *can* work for LLMs, if we are able to provide a value function at intermediate steps (although not as clearly guaranteed as in two-player zero-sum games). In chess / go / poker, we have a reward associated with every next move, but as Noam points out, natural language is messy. It is hard to define a value function at intermediate steps like tokens. As a result, in usual reinforcement learning (like RLVR), LLMs get a reward at the end. They end up learning to 'meander' more for hard problems. In a way, we reward brute forcing with more tokens to end up at the right answer as the right approach. However, at @DeepCogito, we provide a signal for the thinking process itself. Conceptually, you can imagine this as post-hoc assigning a reward to better search trajectories. This teaches the model to develop a stronger intuition for 'how to search' while reasoning. In practice, the model ends up with significantly shorter reasoning chains for harder problems in a reasoning mode. Somewhat surprisingly, it also ends up being better in a non-thinking mode. One way to think about it is that since the model knows how to search better, it 'picks' the most likely trajectory better in the non-thinking mode.
1
84
Brea Browne retweeted
Guess how long after signing the pivotal deal it took our CEO to ask: “Why hasn’t this (relatively small) new customer received an answer yet?” Less than 1 hour. That’s the obsession we need to keep as we build a true multi-customer cloud. The MSFT deal is the fuel for growth.
22
31
449
27,187
AMAZING work @DeepCogito team!!! 🔥 You think, therefore you are - and I'm sure this is just the beginning 🧠
Today, we are releasing 4 hybrid reasoning models of sizes 70B, 109B MoE, 405B, 671B MoE under open license. These are some of the strongest LLMs in the world, and serve as a proof of concept for a novel AI paradigm - iterative self-improvement (AI systems improving themselves). The largest 671B MoE model is amongst the strongest open models in the world. It matches/exceeds the performance of the latest DeepSeek v3 and DeepSeek R1 models both, and approaches closed frontier models like o3 and Claude 4 Opus.
1
1
182
Brea Browne retweeted
Anyone at ICML wanna see the future of world models? We're walking around with a laptop running our world model at 500 FPS (fully local). Would love to demo/chat with anyone interested
4
5
45
6,366
Brea Browne retweeted
We'll be at CVPR tomorrow! Interested in chatting about open science diffusion world models? Reach out!
8
18
4,543
Packing up to head to @NVIDIAGTC - what's the thing you can't go to a conference without? Besides caffiene of course 😅 #GTC25
55
Brea Browne retweeted
17 Mar 2025
We’re here at @NVIDIA #GTC25 in San Jose! Our first tech talk — on how we rebuilt our AI Cloud from scratch — takes place at 3:00 PM today. Join us in room SJCC 212A. Tomorrow, the expo halls open. Stop by booth 809 to meet our leaders and tech experts!
1
34
93
4,625
✨ Gartner IOCS 2024 is just around the corner! Connect with @weka's on how to unlock new levels of performance, efficiency, and cost savings. 📅 Dec. 10–12, 2024 📍The Venetian, Las Vegas, NV 🔗 Book your meeting now! 👇 #GartnerIOCS sprou.tt/1ErLVsBnL7n

1
49
🌊 Ready to ride the GenAI wave? @weka President Jonathan Martin shares how businesses can act fast to harness #GenAI innovation—or risk being left behind. 🚀 Check out his insights in Fast Company NOW #AI #Leadership #Innovation #WEKA sprou.tt/1qm2KeqjYEo
1
42
Byte-Sized Joke Thursday! 🧀 Why did the legacy data platform lose the race? 🏁 Because it couldn’t match @weka’s speed! 🚀 WEKA is setting new benchmarks for #AI and #HPC workloads with record-breaking performance and unmatched efficiency. 💡 sprou.tt/17IyXRxA1hT
1
35
Headed to #SC24 in Atlanta?! Join @weka for a night of F-U-N and see Jimmy Eat World Preform! Register now 👇🎟️
3
62
🚀 Ready to turbocharge your #AI game? Check out @weka's latest data platform appliances—WEKApod Nitro & WEKApod Prime! Built to fuel your AI innovation and keep your data moving at lightning speed. ⚡️ 🔗 weka.io/data-platform/enviro…
2
37
💀 Hey Deadheads! Ever wonder how Dead and Company’s unforgettable #DeadForever show at Sphere came to life? 🎶 Dive into the behind-the-scenes magic with @weka and Treatment Studio that made it all possible. #DeadAndCompany #TechAndMusic #WEKA sprou.tt/1SBTRUqnUWw
3
83
💡 Need some fresh #AI insights? @weka 2024 Global #TrendsinAI webinar is now available on-demand! Packed with actionable strategies to help you master the evolving AI landscape. 🚀 Watch now! sprou.tt/1iFRBfFBWC7
3
20
🚀 Have you checked out WEKApod Nitro and WEKApod Prime? These game-changing @weka data platform appliances bring top-tier performance & cutting-edge specs to the table. 🔥 Dive into their features & see how WEKA is leveling up like never before! sprou.tt/1el8MKSvcGh
3
17
🚀 Introducing WEKApod Nitro & WEKApod Prime! 🔥 These new, turnkey data platform appliances combine @weka’s high-performance software with best-in-class hardware for a powerhouse experience! 💥 Learn more below! 👇 sprou.tt/1AyWk3CXWbu
1
16