ORION LAUNCHES:
@macrocosmosai unveils Orion-100B at
@proofoftalk on the 1 year anniversary of
@IOTA_SN9 launch in this very building.
They asked: Can frontier-scale AI training be distributed?
On Monday, the team published Orion-100B, an early pretraining run designed to test exactly that. At Proof of Talk co-founders
@macrocrux and
@WSquires took the audience through the results.
Orion-100B represents the largest distributed LLM pretraining run conducted over the open internet to date. The run trained a 100-billion-parameter model architecture across geographically distributed infrastructure.
This was not an attempt to produce a finished frontier model. Orion-100B processed approximately 1.1 billion training tokens over a two-day period before being stopped. The objective was simpler, and arguably more important: demonstrate that training at 100B scale is possible without relying on a single, centralized cluster of GPUs.
For Macrocosmos, the result represents the culmination of more than a year of work.
The version of the story shared on stage was refreshingly unglamorous. The team launched Subnet 9 in 2025 targeting a 15B parameter model and quickly discovered that building distributed training systems in a permissionless environment is harder than it looks. The network struggled. Assumptions broke. The architecture was reworked.
So instead of pushing forwards, they scaled backwards. For months, the team trained smaller 1.5B parameter models, running more than 700 experiments in the process. By their own admission, it wasn't particularly exciting. But it allowed them to harden the networking layer, improve fault tolerance, increase throughput and gradually remove bottlenecks from the system.
Only then did they begin scaling again. 8B. 18B. Then 100B.
The result was Orion.
According to the figures presented, the run achieved average model FLOP utilisation of 30.8%, roughly 65% of the speed of equivalent co-located infrastructure, at a third of the cost. More importantly, the learning dynamics remained stable throughout the run, even as the system handled synchronisation and communication across dozens of distributed devices.
The technical depth in the room was high. Questions came in on heterogeneous compute stacking, minimum GPU yield thresholds, the reconstructability of sharded weights, architectural constraints on the ResBLM approach. Halfway through,
@const_reborn materialised in the audience and started grilling them on the economics and fault tolerance at scale. They took it in elegant stride.
The Macrocosmos thesis is that the future of AI training does not necessarily belong to ever-larger datacentres. If distributed systems can become sufficiently efficient, they could unlock vast pools of underutilised compute spread across the world.
Today's Orion run was intentionally conservative. The GPUs were distributed, but still professionally provisioned. The next stages of Project Orion will progressively introduce heterogeneous hardware, interruptible spot instances, permissionless participation and eventually consumer-grade devices.
Whether distributed training ultimately reshapes the economics of AI remains to be seen. But either way, Macrocosmos has moved the conversation beyond whether distributed training is possible and towards how far it can scale.
The presentation also provided useful context for
@bitstarterai's newly announced ML Track, which was unveiled by
@macrozack at the same roundtable event and is linked by a common theme - how Bittensor attracts and supports the next generation of machine learning teams.
Throughout the event,
bitstarter.ai outlined its approach to reducing the barriers to entry for researchers looking to build on the network, combining subnet funding, infrastructure support, compute resources, partnerships and incubation.
If Macrocosmos' journey demonstrates what is possible once a team is established on Bittensor,
bitstarter.ai's ambition is to help more teams make that journey in the first place.
For now, however, the spotlight belonged to Orion.
After more than 750 experiments, a year of iteration, and a successful 100B-scale demonstration, Macrocosmos has moved distributed training a little further out of the realm of theory and a little closer to reality.
And judging by the reaction in the room, plenty of people were paying attention.