dltHub is the creator of data load tool (dlt)

Joined November 2022
101 Photos and videos
Pinned Tweet
May 14
Building pipelines with AI usually means losing context between tools. dltHub AI Workbench runs the full 12-step workflow as a continuous session, schemas, incrementals, traces, transformations, and notebooks share context across the stack. dlthub.com/blog/agentic-data…
4
245
How much data can $1 of compute move? We benchmarked dltHub on a small worker (2 vCPU / 4 GB) loading into BigQuery: Parquet: ~170 GB Postgres: ~65 GB JSON: ~4.6 GB REST: whatever the API allows Methodology results ↓ dlthub.com/blog/benchmark-dl…
5
93
Data quality usually starts too late. By the time bad data hits a dashboard, the original business assumptions are gone. We're fixing that at the ingestion layer. 👇
1
5
130
The new dltHub AI Workbench Data Quality Toolkit starts from context your pipeline already knows: schema contracts, keys, constraints, and sampled values. Plain-language business rules → checks that run on every load. dlthub.com/blog/dq-toolkit-p…
2
95
Agentic Analytics Demo Night lands in San Francisco this Wednesday 🚀 We're joining @inngest, @lightdash_devs, and @streamkap for an evening of demos from teams building at the intersection of AI, analytics, and data infrastructure. 🧵
1
1
6
144
At dltHub, @elviskahoro will demo our new Transformations public preview. Expect demos from practitioners and startups building real-world AI and data systems, and a look at what agentic analytics looks like in practice. 📅 June 3 · SF luma.com/gl94l89s
5
117
What does it take to build a product people actually keep using? On Wednesday, we're hosting @sameer_alsakran, Founder & CEO of @metabase, at our office in Berlin for a live conversation with Francesco Mucio from Data Berlin. 🧵
1
1
6
263
Sameer started Metabase as a side project, waited years before charging, ignored conventional SaaS advice, and built a product now used by 90k companies with 8-figure ARR. We'll discuss OSS, AI, customer feedback, and building products that last. luma.com/bv3xvi4f
1
7
90
May 26
Berlin’s Applied AI Week is looking pretty special. Tomorrow, dltHub and @modal are hosting an evening in Berlin for founders, engineers, and builders shipping agents in production. Featuring Kenny Ning, Jefferson Girao & Alena Astrakhantseva.
1
5
133
May 26
We’ll cover: → turning agent traces into dashboards for AI adoption, cost, and real business impact → how teams use Modal Sandboxes to run agents safely, plus a live background agent demo 📅 May 27 · 6–8 PM GMT 2 📍 Berlin luma.com/ac6rt5od
1
4
115
dltHub retweeted
The open source AI stack walked into a bar in Silicon Valley. Ingestion, retrieval, and metadata all showed up. 🚀 The community came through at our event with @dltHub & @LanceDB for a night of technical talks built for engineers actually shipping AI in production. No fluff. Just engineers talking shop. Stay tuned for more community meetups!
1
5
99
dltHub retweeted
Three production stories from @GrabID, @dltHub, and @iFood. Three very different starting points. One shared foundation. 🤝 Our May Town Hall brings together the teams answering the same question from very different angles: how do you make context actionable for AI at scale? 📈 Register: luma.com/k0fri4bt
2
3
153
dltHub retweeted

4
9
61
21,132
dltHub retweeted
Some of the best open source AI conversations happen in person. We're joining @dltHub and @LanceDB in Menlo Park on May 21 for technical talks and demos from the engineers building the ML stack. Join us! 👉 luma.com/80pocni3
1
2
4
195
dltHub retweeted
Next Thur May 21: @lancedb x @dltHub x @DataHubCloud demystifying the missing data layer for ML: - Lance, the default storage layer for multimodal AI - dltHub, the connective tissue of your AI data stack - Supercharging Cortex with DataHub luma.com/80pocni3
1
2
4
131
May 18
The AI stack is evolving fast, but reliable data movement is still the foundation. Join dltHub, @LanceDB and @DataHubCloud on May 21 in Menlo Park for talks on multimodal AI storage, AI data pipelines, and trusted lineage systems. 🔗 luma.com/80pocni3
1
3
92
May 15
How fast can you go from zero to a production-ready data pipeline when AI is your copilot? Next Wednesday, @elviskahoro joins @temporalio alongside @nyghtowl and @cecilphillip for a live Vibe Check building a GitHub-powered pipeline with AI dlt.
1
4
90
May 15
In under 10 minutes, we’ll cover: - AI-assisted pipeline setup - GitHub → dlt workflows - Where AI helps vs where engineers still need to steer - How to inspect & validate pipelines 📅 May 20 · 10 AM PST 📍 Live on YouTube Watch live: luma.com/ir410q6l?tk=hO5ZL3
1
4
713
May 12
Explainer on ontology engineering and what we're building around it: why just clean schemas and prompts aren’t enough, and how adding a canonical model taxonomy ontology changes what agents can correctly compute (ARPU being the clearest example). dlthub.com/blog/ontology-eng…
2
101
Apr 30
We’re excited to share that Violetta Mishechkina will be speaking at GOSIM Paris 🇫🇷 Invited by @probabl_ai to join the “Own Your Data Science and AI” workshop. 🎤 From Agent Traces to Analytics Agents generate code, text, telemetry, yet most teams still rely on stale datasets.
1
2
108
Apr 30
We’ll show how agents can power data pipelines as code, turning traces into fresh, reliable datasets. Stack: dlt, LanceDB, Pydantic, Ibis, HuggingFace more → behind our agent evaluation platform. 📅 May 5 | 🕒 12:20–12:40 📍 Station F, Paris paris2026.gosim.org/schedule…
2
101