Matt Topol

Matt Topol

67 Photos and videos

Tweets

Pinned Tweet

Matt Topol @zeroshade

8 Nov 2023

Time for @oredev !! If you're here don't miss my talk today at 10:10 on using @ApacheArrow with #ml workflows! Looking forward to a day of interesting talks and discussions

567

Alex Monahan

Matt Topol retweeted

Alex Monahan @__AlexMonahan__

May 6

In SF during Snowflake Summit June 1-3? Duck out (ha!) to The Dive! Hear from rockstars at Anthropic, Braintrust, Lovable, Hex, & more. See the future of lakehouses with @J_ , creator of Apache Parquet, and @zeroshade, founder at Columnar (& me!) Register! thedive.motherduck.com/

MotherDuck | The Cloud Data Warehouse Built on DuckDB

The modern cloud data warehouse powered by DuckDB. Serverless SQL analytics with no infrastructure to manage—query your data in seconds. Start free.

motherduck.com

533

Columnar

Matt Topol retweeted

Columnar @columnar_tech

Mar 15

columnar.tech/blog/zero-copy…

Zero-copy, zero contest

Comparing ADBC performance against the alternatives on two Arrow-native systems: BigQuery and DuckDB

columnar.tech

Columnar

Matt Topol retweeted

Columnar @columnar_tech

Mar 15

The fastest operation is the one you don’t have to do. When a database natively supports @ApacheArrow, ADBC can speed up fetching and ingestion by eliminating costly row/column conversions. How much faster is it in practice? We ran some benchmarks to find out. Link below 👇

ALT An abstract hyperspace warp image inspired by the comedic "going plaid" effect from the 1980s cult film "Spaceballs".

362

Dipankar Mazumdar

Matt Topol retweeted

Dipankar Mazumdar @Dipankartnt

Mar 5

If you aren't paying attention to some of the Apache Spark Acceleration projects like Gluten, you should! Gluten just graduated as a Top-Level Project @TheASF

449

Columnar

Matt Topol retweeted

Columnar @columnar_tech

Jan 29

Fetch query results without ODBC / JDBC bottlenecks. The new ADBC driver for @databricks is now in early release. Install it with dbc. Details in comments.

ALT dbc install databricks

165

す

Matt Topol retweeted

す @ktou

27 Nov 2025

私の稼働が空いちゃったので、私と一緒になにかお仕事したい人からの連絡を待っています！須藤、空いています - 2025-11-27 - ククログ clear-code.com/blog/2025/11/…

須藤、空いています - 2025-11-27 - ククログ

須藤です。Ruby関連の開発をしたり、Groonga関連の開発をしたり、Apache Arrow関連の開発をしたりしています。この3年くらいApache Arrowの開発がメインの業務でしたが、いろいろあって7割くらいなくなりました。お仕事のネタがないわけではないのでなにかしらでなくなった分を埋めることはできるのですが、せっかくの機会なのでどんなことをできる可能性があるのかを検討したいと思...

clear-code.com

8,354

Supermetal

Matt Topol retweeted

Supermetal @SupermetalInc

5 Nov 2025

🚀 Launching Supermetal — data replication that just works. Sync databases to warehouses in real-time or batch — no Kafka, no JVM, no Debezium. Built in Rust & Apache Arrow. Try it → trial.supermetal.io Launch post → supermetal.io/blog/launch #dataengineering #rustlang

2,231

Bessemer

Matt Topol retweeted

Bessemer

@BessemerVP

29 Oct 2025

Data and AI are evolving fast, but much of today’s infrastructure still runs on standards from the 90s. @columnar_tech, from the team behind Apache Arrow, is bringing an Arrow-native protocol (ADBC) that moves data 10–100× faster across systems like Snowflake and DuckDB. We're excited to lead Columnar's $4M seed round. Read the full Q&A to learn more: bessemervp.team/47d8gWk

2,772

The New Stack

Matt Topol retweeted

The New Stack

@thenewstack

3 Nov 2025

A new startup, @columnar_tech, looks to streamline the copying of tabular data across systems, using @ApacheArrow and the ADBC API. By @Joab_Jackson thenewstack.io/apache-arrows…

Apache Arrow's Final Frontier: Replacing Outdated Database Drivers

A new startup, Columnar, looks to streamline the copying of tabular data across systems, using Apache Arrow and the ADBC API.

thenewstack.io

2,584

Matt Topol

Matt Topol @zeroshade

29 Oct 2025

Come join the community with our launch! Learn more and come talk to us about data connectivity with @ApacheArrow !!

Ian Cook @ianmcook

29 Oct 2025

The future of data connectivity is columnar. Today we launched @columnar_tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @ApacheArrow. Learn more 👉 columnar.tech/blog/announcin…⚡️

153

Ian Cook

Matt Topol retweeted

Ian Cook @ianmcook

23 Oct 2025

ODBC is getting tired. It can't keep up with the fast new kids in the data world these days. The next generation is ready to take the torch. Meet ADBC, a fast, modern data connectivity standard built on @ApacheArrow. Watch my talk from the @CMUDB seminar: youtu.be/TjlmNGNx77E

Where We’re Going, We Don’t Need Rows: Columnar Data Connectivity...

CMU Database Group - Future Data Systems Seminar Series (Fall 2025)...

youtube.com

13,316

Spiral

Matt Topol retweeted

Spiral

@SpiralDB

11 Sep 2025

We're building the data infrastructure that AI actually needs. Current systems were built for humans reading dashboards. But an H100 can consume 4 million images per second. The future isn't human-scale. It's machine-scale. Introducing Spiral: Data 3.0 🌀 1/8

383

64,866

Andrew Lamb

Matt Topol retweeted

Andrew Lamb @andrewlamb1111

21 Aug 2025

Its happening -- DataFusion will (finally) get spilling hash joins. The march to completeness begins

jonathanc-n

@jonathanc_n

20 Aug 2025

I'd like to start using this platform as a place to post about open source work I do on my off time. To lead it off, I have posted a hash join spilling proposal in Apache Datafusion. Check it out if you're interested 😀: github.com/apache/datafusion…

6,656

Columnar

Matt Topol retweeted

Columnar @columnar_tech

18 Aug 2025

In September the @columnar_tech crew are headed to @PyDataParis 2025 and the first ever @ApacheArrow Summit. The organizer @QuantStack is a dedicated supporter of Apache Arrow. We’re delighted to be sponsoring the event.

QuantStack @QuantStack

17 Jan 2025

PyData Paris will be back in 2025 ! 🎉 📆 Sept 30th & Oct 1st 2025 📍Cité des Sciences et de l’Industrie Thanks to our early supporters @hopsworks and @UnivParisSaclay's Graduate School of Computer Science. @NumFOCUS @PyData @PyDataParis pydata.org/paris2025

272

Wes McKinney

Matt Topol retweeted

Wes McKinney

@wesmckinn

18 Jul 2025

Excited to see continued improvements in embedded query processing in the @ApacheArrow C project: arrow.apache.org/blog/2025/0…

Recent Improvements to Hash Join in Arrow C

A deep dive into recent improvements to Apache Arrow’s hash join implementation — enhancing stability, memory efficiency, and parallel performance for modern analytic workloads.

arrow.apache.org

4,800

Philippe Noël

Matt Topol retweeted

Philippe Noël

@philippemnoel

15 Jul 2025

1/5. @paradedb has raised a $12M Series A to bring Elasticsearch workloads to Postgres. 🧵 techcrunch.com/2025/07/15/pa…

ParadeDB takes on Elasticsearch as interest in Postgres explodes amid AI boom | TechCrunch

ParadeDB built a Postgres extension that facilitates full-text search and analytics on Postgres without the need to transfer data.

techcrunch.com

382

47,359

Bauplan

Matt Topol retweeted

Bauplan

@Bauplan_labs

16 Apr 2025

🚀 Introducing Bauplan A serverless, code-native platform for building data and AI pipelines — directly on your object store. No clusters. No notebooks. No GUI based workflows. Just Python SQL S3. 👉 bauplanlabs.com/blog/hello-b…

13,533

Atlanta Cloud Conference

Matt Topol retweeted

Atlanta Cloud Conference @AtlCloudCon

14 Apr 2025

Please join @zeroshade at the Atlanta Cloud Conference on April 26th for Apache Arrow: The Great Library Unifier. Register at ticketleap.events/tickets/de…

Wes McKinney

Matt Topol retweeted

Wes McKinney

@wesmckinn

27 Mar 2025

I’m excited about xorq! Ibis and DataFusion brought together to orchestrate multi-engine data pipelines, all powered by @ApacheArrow github.com/xorq-labs/xorq

GitHub - xorq-labs/xorq: Executable memory system for tabular data work

Executable memory system for tabular data work. Contribute to xorq-labs/xorq development by creating an account on GitHub.

github.com

5,943

Jaana Dogan ヤナドガン

Matt Topol retweeted

Jaana Dogan ヤナドガン

@rakyll

7 Feb 2025

A lot of people are ignoring that Go is becoming a commonly used language for prompting pipelines. Python in prototypes and Go in production is another common combo.

Viktor Eriksson @cviktore

6 Feb 2025

Me and the team at @lovable just spent two months rewriting 42,000 lines of code from Python to Go. Technical deep dive of why we did it what this means: // 1

426

40,089