bidhan

bidhan

429 Photos and videos

Tweets

Pinned Tweet

bidhan

@bidhan

May 28

We're releasing Paris 2.0, which, to our knowledge, is the world's first decentralized trained video generation model. We benchmarked it against a monolithic model trained on the same data and compute budget, and Paris 2.0 outperformed the monolithic by ~2x on FVD benchmark.

0:42

114

667

449,730

bidhan

bidhan

@bidhan

15h

world cup where sovereign ai labs from countries compete live on their model benchmarks

146

gabrielanderson.eth (📓💰)

bidhan retweeted

gabrielanderson.eth (📓💰)

@gabrielanderson

18h

This is a scary place to find ourselves in, and for many of us in decentralized systems and technologies, the exact thing that brought us into the industry. @AnthropicAI's export controls is the most anti-American thing I can possibly think of and is a dangerous precedent. Jake wrote a great take on this, and @GPC_xyz did our small part backing a great team in @bidhan and @bageldotcom. Decentralized compute and AI infra, along w/ sovereign OS AI is essential, and I hope a lot of people are waking up to that fact this AM. What a time to be alive...

Jake Brukhman

@jbrukh

19h

Unlike many investors in crypto, I did not pivot to AI in the last few years. However, since 2020, I built some of the deepest understanding in this industry on the intersection of AI and decentralized networks (crypto, web3). From the start, it was very clear that AI models are a centralizing force and the biggest target for government control. That point became market fact last night, with @AnthropicAI’s export control compliance. As an investor in decentralized AI, I know that d-networks are a counterbalance to this state of affairs. In particular, the starting point of sovereign, open, public, decentralized AI is the seemingly insurmountable compute problem. How are people supposed to source more industrial compute for frontier training than these huge trillion dollar companies? The answer is simple: there is enough commodity GPU compute in the world to compete on the frontier, but to make use of it we need new algorithms for training. That’s what a few companies like @gensynai @PrimeIntellect @bageldotcom @Pluralis @NousResearch @MacrocosmosAI @covenant_ai set out to research, while everyone on the planet told them it was impossible. The result is that it is not only possible, but it can be cheaper and nearly as efficient as the alternative process. The second major problem is economic sustainability. Open source models are great, however, they are not economically viable as they don’t have a business model. So far in decentralized AI, only @Pluralis has an answer — by breaking up the weights of the model among participants, we create a business model for tokenized AI models. This is the moment of truth — will AI become fully centralized and fall under censorship and unilateral government control? Or will the AI world realize the importance of public AI on open decentralized networks?

413

bidhan

bidhan

@bidhan

Jun 12

Thanks for the shoutout alongside Perplexity, Pika, and ElevenLabs. @TheRundownAI 🙌

bidhan

@bidhan

May 28

0:42

306

bidhan

bidhan

@bidhan

Jun 11

if you haven't seen this message yet, unfortunately you're not working on something interesting.

117

bidhan

bidhan

@bidhan

Jun 10

the whole thesis of decentralized training is more raw FLOPs from consumer devices and less reliance on high memory. and diffusion is uniquely capable of utilizing more flops and less memory.

bidhan

@bidhan

Jun 10

Diffusion is taking over the local/owned compute category by storm. DiffusionGemma architecture is significantly better for running local models.

409

bidhan

bidhan

@bidhan

Jun 10

Diffusion is taking over the local/owned compute category by storm. DiffusionGemma architecture is significantly better for running local models.

Google Gemma

@googlegemma

Jun 10

Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇

0:05

2,431

bidhan

bidhan

@bidhan

Jun 6

restocked shirts, come get before we run out! @cvpr

1,026

bidhan

bidhan

@bidhan

Jun 5

.@cvpr come by the @bageldotcom booth before we run out of merch!

715

Gin Jiang @CVPR

bidhan retweeted

Gin Jiang @CVPR

@ZhiyingJ

Jun 5

I'm presenting Heterogeneous Decentralized Diffusion Models tomorrow at #CVPR2026! We train diffusion experts on separate single GPUs with no gradient sync, mixing DDPM Flow Matching objectives that each fit whatever data shard it owns. The trick: a closed-form, training-free schedule-aware ε→v conversion fuses the mismatched objectives into one velocity space at test time, with a lightweight router picking which experts denoise what. It beats homogeneous ensembles on both quality and diversity. We see this as a step toward making large-scale generative training genuinely decentralized - no data center, no interconnect, just contributors with single GPUs. Come talk to us about where this goes next, including scaling it to video and world models 👋 📄 Paper: arxiv.org/abs/2603.06741 📍 Poster session 1, tomorrow, 10:45–12:45, Exhibit Hall 👋

641

Richard Hanania

bidhan retweeted

Richard Hanania

@RichardHanania

Jun 4

India has sent 96 people to America who started billion dollar companies. No one else is even close. There's only about 5 million Indians in America. Almost one in 50,000 of them is a unicorn founder! What a holy, special, beautiful people. I will always fight for them.

1,114

2,609

14,220

1,005,840

bidhan

bidhan

@bidhan

Jun 4

bagel labs news cvpr edition

342

bidhan

bidhan

@bidhan

Jun 3

bidhan

@bidhan

Jun 3

heading to @CVPR with the Bagel Labs team. if you want some elite merch (we spent a month designing it), discuss world models, physical ai, distributed training -- I'm your guy 🫡

979

Ankur Nagpal

bidhan retweeted

Ankur Nagpal

@ankurnagpal

Jun 3

If you’re over the age of 30 and going to tech week parties Why

422

101,322

bidhan

bidhan

@bidhan

Jun 3

heading to @CVPR with the Bagel Labs team. if you want some elite merch (we spent a month designing it), discuss world models, physical ai, distributed training -- I'm your guy 🫡

1,742

bidhan

bidhan

@bidhan

May 31

what if I told you there’s a subsection of the AI industry which has 1000x the TAM of LLMs?

347

bidhan

bidhan

@bidhan

May 30

a friend let me know that bagel labs was trending yesterday

bidhan

@bidhan

May 28

0:42

615

bidhan

bidhan

@bidhan

May 28

come to the bagel labs paper session and booth at CVPR!

Gin Jiang @CVPR

@ZhiyingJ

May 28

we found decentralized diffusion framework work for video generative models too! Still an early attempt, and a lot of open research questions left to explore. Would love to dig into it more next week at CVPR :)

770

bidhan

bidhan

@bidhan

May 28

The quality of the videos themselves are far from SOTA like VEO, Seedance etc. But the point we wanted to prove is that video generation objective, with its nuances like temporal coherence, time dimension, character consistency etc can be trained in a distributed way without shared clusters. And the mathematical evidence shows that this recipe will scale.

bidhan

@bidhan

May 28

0:42

2,182

bagel.com

bidhan retweeted

bagel.com

@bageldotcom

May 28

Today we're releasing Paris 2.0, to our knowledge the first decentralized-trained video generation model. At Bagel Labs, we believe frontier models should not require homogeneous clusters of premium, supply constrained GPUs. Paris 1.0 proved this for image generation. Paris 2.0 extends the recipe to video generation and lays the substrate for global-scale world models. To test the approach, we trained two models head-to-head in an iso-FLOP, iso-data comparison. One was a monolithic model trained conventionally, on a single premium GPU cluster. The other was Paris 2.0, trained across an extreme mix of GPU types, generations, and vendors distributed around the globe. Against the monolithic model under matched data and compute, the results were: FVD: 561.04 → 279.01 (a ~2x improvement) CLIP text-video alignment and aesthetic score both improved. To our knowledge, this is the first distributed training architecture to surpass its monolithic counterpart under matched data and compute. Technical Report: arxiv.org/abs/2605.26064 Model Weights: huggingface.co/bageldotcom/p…

bidhan

@bidhan

May 28

0:42

5,675

mirian

bidhan retweeted

mirian

@mirimayer

May 28

we're releasing Paris 2.0, the first video generation model trained across decentralized GPUs instead of relying on one massive expensive cluster, Paris 2.0 was trained on a mix of GPUs distributed around the world - and it outperformed the traditional setup by ~2x so proud of our team 🥯

bidhan

@bidhan

May 28

0:42

875