Web-scale data collection on Bittensor’s subnet 13, powered by @MacrocosmosAI

Joined October 2025
45 Photos and videos
Pinned Tweet
Introducing `dv` - a Rust CLI for querying real-time social data from X & Reddit. Powered by Bittensor SN13's decentralized miner network. ``` dv search x -k bitcoin -l 100 ``` One command. Live data. No middleman. Open source. Built for agents. 🧵👇
16
55
661
65,651
Data Universe ・ SN13 retweeted
Today, we are launching the first stage of Project Orion. Our early pre-training run of Orion-100B achieves upward of 65% of data-center training efficiency on hardware costing a fraction of the price. Orion-100B is the first proof point for a simple idea: that underutilized compute around the world can be turned into frontier training capacity. We believe that this work presents, for the first time, an economically compelling case for training large models using distributed approaches.
23
79
417
635,535
Data Universe ・ SN13 retweeted
HackQuest x Bittensor Co-Learning Camp | India Recap 🇮🇳 150 registrations, 80 attendees, 45 graduates, 30 winners. Over 5 days at Galgotias University, builders came together to: ⚡ Learn the fundamentals of @opentensor 🛠 Complete HackQuest learning tracks ⛏️ Become miners on Data Universe (SN13) @Data_SN13 & Sportstensor (SN41) @sportstensor 🤝 Build alongside mentors, developers, and future founders But don’t just take it from us — check out some firsthand reflections from builders who experienced the camp themselves 👇
5
12
47
5,587
Get ~70 tweets per second for any topic you’re interested in — in one second. with sn13 api.
2
3
17
950
Data Universe ・ SN13 retweeted
“We want to orchestrate the world’s compute, in the same way Bitcoin did, but to train models that can rival ChatGPT” @macrocrux on the @twistartups podcast, sharing the motivations behind @IOTA_SN9: to train frontier models efficiently and at scale, in an age where CapEx for centralized training is reaching unsustainable highs.
4
35
184
14,463
SN13 On-Demand is scaling fast ! In April alone, we’ve processed 147k jobs - averaging ~9k/day and growing. This is API usage only - dataset jobs are not included. Most of this traffic comes from the Dataverse CLI: github.com/macrocosm-os/data…
4
15
90
7,865
Data Universe ・ SN13 retweeted
Today, we present ResBM (arxiv.org/pdf/2604.11947), a 128x activation compression technique for achieving SOTA training results in low-bandwidth, distributed communication settings for pipeline parallel training across the internet. This technology underpins @IOTA_SN9 - our distributed training platform.
10
36
167
31,980
Data Universe ・ SN13 retweeted
Training frontier models over the internet requires new techniques. Today, we present ResBM, a residual encoder-decoder bottleneck architecture that enables 128x activation compression for low-bandwidth distributed pipeline parallel training. Developed for @IOTA_SN9, we show SOTA compression without significant loss in convergence rates, increases in memory, or compute overhead. Expect the full paper release in the next 72 hours.
13
45
215
48,974
Languages change fast. Especially online, where words and phrases evolve rapidly on social media. Tracking this is a huge part of linguistics and #SocialScience in the digital age. It’s also a key use-case for Data Universe. Optimise your #SocialListening by setting up bespoke data collection jobs, powered by our SN13 #Bittensor miners. ☑️ Select words and phrases you’re interested in ☑️ Choose which platforms to listen for (X, Reddit, or both) ☑️ Collect real-time results from the web, formatted in CSV files (perfect for external analysis). Use Data Universe for your social science research today.
1
13
545
Use dataverse-cli with Hermes Agent to scrape, search, and create social-media datasets
1
1
21
1,036
We're migrating the infrastructure for Gravity jobs. Existing jobs may be cancelled during the transition. Jobs will resume by tomorrow midday.
5
327
Introducing `dv` - a Rust CLI for querying real-time social data from X & Reddit. Powered by Bittensor SN13's decentralized miner network. ``` dv search x -k bitcoin -l 100 ``` One command. Live data. No middleman. Open source. Built for agents. 🧵👇
16
55
661
65,651
Built for the agentic era. `dv commands` outputs a full JSON schema of every command, flag, and API mapping - so Claude, Copilot, or any LLM can use it natively.
2
14
3,069