Delta Lake is an open-source storage framework that enables building a Lakehouse architecture for Spark, Flink, Trino, Hive, Scala, Java, Rust, Python, & more!

Joined April 2019
1,193 Photos and videos
From 2017 to now, Delta Lake has grown to 40M downloads/month and powers daily processing of hundreds of exabytes. At #DataAISummit, โ€œThe Road to Delta 5.0โ€ will cover whatโ€™s next: ๐Ÿ”น catalog-first Delta ๐Ÿ”น Data Source V2 modernization ๐Ÿ”น Delta Iceberg convergence ๐Ÿ”น Delta Kernel alignment (Java Rust) Details: databricks.com/dataaisummit/โ€ฆ #DeltaLake #OpenSource
3
5
410
Back by popular demand! ๐Ÿ“˜ Delta Lake: The Definitive Guide book signing returns to Data AI Summit 2026. If you are at Data AI Summit, come meet the authors, say hi to the Delta Lake community, and pick up a signed copy while supplies last. ๐Ÿ—“๏ธ Tuesday, June 16 ๐Ÿ• 2:00โ€“2:30 PM ๐Ÿ“ Dev Lounge, Data AI Summit Expo, Moscone Center Books tend to go quickly,ย so plan to stop by early. Hope to see you there! ๐Ÿ‘‹ #DeltaLake #DataAISummit #OpenSource #DataEngineering @dennylee @newfront @Data_AI_Summit
2
3
27
2,664
Delta Lake Apache Iceberg co-evolved as parallel standards. That era is ending. ๐Ÿ‘‡ ๐Ÿ”น Iceberg v4 adaptive metadata tree single-file commits ๐Ÿ”น Delta Lake 5.0 adopts it as native content metadata ๐Ÿ”น One on-disk format, no translation layers ๐Ÿ”— Add it to your schedule: databricks.com/dataaisummit/โ€ฆ #DeltaLake #DataAISummit #ApacheIceberg
2
4
558
At Data AI Summit (June 15โ€“18), Scott Sandre will show how Catalog-Managed Tables support was designed in Delta Kernel. ๐Ÿ‘‡ ๐Ÿ”น A unified, catalog-agnostic API for connectors to build against ๐Ÿ”น Deep engine integrations for DuckDB, Delta-Spark, and Delta-Flink ๐Ÿ”น Catalog complexity kept out of Kernelโ€”the one right Delta abstraction ๐Ÿ”— Session details: databricks.com/dataaisummit/โ€ฆ #DeltaLake #DeltaKernel #DataAISummit
1
1
258
Data AI Summit session: Tyler Croy (delta-rs maintainer) will introduce Virtual Delta Tables and the associated open source code designed for multimodal inference.๐Ÿ‘‡ ๐Ÿ—“๏ธ June 15-18 ๐Ÿ“ San Francisco ๐Ÿ”— Session details: databricks.com/dataaisummit/โ€ฆ #DeltaLake #MultimodalAI #DataAndAISummit #Lakehouse
1
3
339
Headed to Data AI Summit (June 15โ€“18)? ๐Ÿ—“๏ธ Don't miss Your Guide to Open Table Formats โ€” Delta, Iceberg, Best Practices, and What's Next! Benjamin Mathew & Scott Sandre will cover Delta 5.0, Iceberg v4, Unified Delta Kernel (GA), and best practices across formats. Add it to your agenda. ๐Ÿ‘‡ databricks.com/dataaisummit/โ€ฆ #DeltaLake #ApacheIceberg #DataAISummit
1
1
4
411
Delta Lake and Apache Icebergโ„ข have converged on similar ideasโ€”columnar metadata, manifest trees, deletion vectorsโ€”but two separate metadata structures still duplicate work. At DAIS (June 15โ€“18, San Francisco): the next evolution of Delta Lake metadata. ๐Ÿ‘‡ ๐Ÿ”น Delta commits โ†’ Iceberg v4โ€™s adaptive metadata tree ๐Ÿ”น Tree-structured manifests Iceberg interoperability ๐Ÿ”น Transactional guarantees Delta users depend on โ€” preserved ๐Ÿ”— databricks.com/dataaisummit/โ€ฆ #DeltaLake #ApacheIceberg #DataandAISummit
3
6
393
DuckDB's Delta @unitycatalog_io extensions are no longer experimental ๐Ÿ‘‡ ๐Ÿ”ท INSERT via ATTACH (TYPE delta) ๐Ÿ”ท Time travel: AT (VERSION => n), VERSION, PIN_SNAPSHOT ๐Ÿ”ท Catalog Managed Tables Catalog Commits for concurrent writes ๐Ÿ”ท Incremental snapshot loading (nightly; v1.5.3 next) Learn more: delta.io/blog/2026-05-06-delโ€ฆ #DeltaLake @duckdb
1
5
34
1,864
Headed to Data AI Summit? Don't miss New Foundations of Delta Lake with Kernel Spark's DSv2. Delta on Spark DSv1 set the standard for the past 8 years. Rahul Potharaju & Tathagata Das will highlight the move to DSv2 and new foundations for the next decade. ๐Ÿ—“๏ธJune 15-18 ๐Ÿ“San Francisco Add it to your agenda ๐Ÿ‘‡ databricks.com/dataaisummit/โ€ฆ #DeltaLake #DataAISummit
2
8
568
.@ClickHouseDB is data lake ready. ๐Ÿ‘‡ On the Delta Lake blog, the Clickhouse team shares how they integrated delta-kernel-rs: transaction logs, metadata, snapshots, Engine APIs, writes, schema evolution, time travel, and CDF (25.12). ๐Ÿ”— Read the post: delta.io/blog/2026-05-18-intโ€ฆ #DeltaLake #ClickHouse
7
573
Delta Lake Community Meetup [May 2026] x.com/i/broadcasts/1qJDzPkqwโ€ฆ

1
2
7
204
๐Ÿ“ฃ Join us for the next Delta Lake community meetup on May 19! Agenda: ๐Ÿ”น Issues & PR backlog ๐Ÿ”น Unified Kernel ๐Ÿ”นCommunity highlights Register: luma.com/deltalake-0519 #DeltaLake #OpenSource #Rust #OpenLakehouse
1
2
248
Delta Lake retweeted
[Ep 1] Open Lakehouse AI: The Catalog Layer, Interoperability & AI-Agent Governance Learn how composable lakehouses center the catalog for metadata, versioning, and commit coordination. ๐Ÿ‘‡ ๐Ÿ”ธ Interop: Iceberg REST, Unity Catalog (OSS), Spark, Delta Lake, Delta-RS, Iceberg. ๐Ÿ”ธ Governance: credential vending, row/col filters, column masks, audits, and why โ€œno governance was the easiest governance.โ€ ๐Ÿ”ธ Agents as data customers (Temporal). Fine-grained access beats blanket credentials on 24/7 workloads. ๐ŸŽฅ Full episode: youtube.com/watch?v=dEFAkS7vโ€ฆ #OpenLakehouse #UnityCatalog @ApacheSpark @ApacheIceberg @DeltaLakeOSS
1
2
5
330
Delta Lake 4.2 pushes Delta Kernel further (new Flink connector, richer types) and hardens catalog-managed tables (atomic ops, SQL evolution, UniForm). ๐Ÿ‘‡ ๐Ÿ”น Kernel / Flink โ€” New Kernel Flink connector (experimental): catalog-managed, catalog-coordinated writes, exactly-once. Replaces legacy connector deprecated in Delta Lake 4.0. ๐Ÿ”น Schema (SQL) โ€” INSERT INTO โ€ฆ BY NAME autoMerge adds columns in-commit. delta.stats.skipping.forceOptimizeStatsCollection โ†’ data skipping on new columns without OPTIMIZE. ๐Ÿ”น Types โ€” Geospatial, Collation, Variant (shredding out of preview); Spark: full Variant schema conversion. ๐Ÿ”น Catalog โ€” RTAS / dynamic partition overwrite: one atomic commit. UniForm: Iceberg metadata at commit (reads immediate). HMS UniForm deprecated. ๐Ÿ”— Read more โ†’ delta.io/blog/2026-04-17-delโ€ฆ #DeltaLake #OpenSource
7
590
Delta Lake Community Meetup [April 2026] x.com/i/broadcasts/1MJgNgyOqโ€ฆ

3
1
3
311
The next evolution of Delta - Catalog-Managed Tables: delta.io/blog/2026-02-02-delโ€ฆ

1
158
Hey all! Thanks for joining us. ๐Ÿ‘ Where are you tuning in from?
84
Delta Lake 4.2.0 is now available! ๐Ÿš€ Dive into the full release notes here: ๐Ÿ‘‰github.com/delta-io/delta/reโ€ฆ Huge thanks to everyone in the Delta community who made this release possible! ๐ŸŽ‰ #DeltaLake #OpenSource #ApacheSpark #ApacheFlink
2
2
9
864
Here is a breakdown of whatโ€™s new: ๐ŸŒŸ [๐—ฆ๐—ฝ๐—ฎ๐—ฟ๐—ธ] ๐—จ๐—ป๐—ถ๐˜๐˜† ๐—–๐—ฎ๐˜๐—ฎ๐—น๐—ผ๐—ด ๐— ๐—ฎ๐—ป๐—ฎ๐—ด๐—ฒ๐—ฑ ๐—ง๐—ฎ๐—ฏ๐—น๐—ฒ ๐—ฒ๐—ป๐—ต๐—ฎ๐—ป๐—ฐ๐—ฒ๐—บ๐—ฒ๐—ป๐˜๐˜€: REPLACE TABLE / RTAS and Dynamic Partition Overwrite support, automatic table schema/properties sync to catalog on table creation. ๐ŸŒŸ [๐—ฆ๐—ฝ๐—ฎ๐—ฟ๐—ธ] ๐——๐—ฒ๐—น๐˜๐—ฎ ๐—ฆ๐—ฝ๐—ฎ๐—ฟ๐—ธ ๐—ฉ๐Ÿฎ ๐—ฐ๐—ผ๐—ป๐—ป๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ - ๐˜€๐˜๐—ฟ๐—ฒ๐—ฎ๐—บ๐—ถ๐—ป๐—ด ๐—ฟ๐—ฒ๐—ฎ๐—ฑ (๐—ฒ๐˜…๐—ฝ๐—ฒ๐—ฟ๐—ถ๐—บ๐—ฒ๐—ป๐˜๐—ฎ๐—น): enhance streaming read capabilities for catalog-managed table by supporting critical options like startingTimestamp and skipChangeCommits. ๐ŸŒŸ [๐—™๐—น๐—ถ๐—ป๐—ธ] ๐—ก๐—ฒ๐˜„ ๐—ž๐—ฒ๐—ฟ๐—ป๐—ฒ๐—น-๐—ฏ๐—ฎ๐˜€๐—ฒ๐—ฑ ๐—™๐—น๐—ถ๐—ป๐—ธ ๐—ฐ๐—ผ๐—ป๐—ป๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ (๐—ฒ๐˜…๐—ฝ๐—ฒ๐—ฟ๐—ถ๐—บ๐—ฒ๐—ป๐˜๐—ฎ๐—น): a brand-new Kernel-based delta-flink connector that enables Apache Flink to read, write, and interact with catalog-managed Delta tables. ๐ŸŒŸ [๐—ž๐—ฒ๐—ฟ๐—ป๐—ฒ๐—น] ๐—š๐—ฒ๐—ผ๐˜€๐—ฝ๐—ฎ๐˜๐—ถ๐—ฎ๐—น, ๐—ฉ๐—ฎ๐—ฟ๐—ถ๐—ฎ๐—ป๐˜ ๐—š๐—”, ๐—ฎ๐—ป๐—ฑ ๐—–๐—ผ๐—น๐—น๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€ ๐˜๐—ฎ๐—ฏ๐—น๐—ฒ ๐—ณ๐—ฒ๐—ฎ๐˜๐˜‚๐—ฟ๐—ฒ: Delta Kernel can now read and write tables using geometry/geography types with bounding-box data skipping, generally available Variant columns, and collated string types. ๐ŸŒŸ [๐—ฆ๐—ฒ๐—ฐ๐˜‚๐—ฟ๐—ถ๐˜๐˜†] ๐—ฉ๐—ฒ๐—ฟ๐˜€๐—ถ๐—ผ๐—ป ๐—ต๐—ฎ๐—ฟ๐—ฑ๐—ฒ๐—ป๐—ถ๐—ป๐—ด ๐—ณ๐—ถ๐˜…: The Delta project has undergone a substantial hardening effort across multiple surface areas, including stronger validation and dependency security scanning to proactively reduce supply-chain risk.
1
191