🎉The final brick in our open data lakehouse platform: OpenEngines. 🏄🌊
Over the past few years, we have been laying the foundation for something bold at Onehouse — a truly open-first architecture for the data lakehouse that can serve diverse data needs.
Effectively, a cloud service that can deliver what my past teams at Uber, LinkedIn, and other companies built internally to democratize and infuse data across the company and its products, at a fraction of the cost, and close the gaps to operationalize such a platform.
We started with the basics:
• Open file formats (Parquet, Orc, JSON, CSV, XML, Avro, …)
• Open table formats (Hudi, Iceberg, Delta)
• Interop across formats, not fragmentation (Apache XTable)
Then, we moved up the stack:
• Catalog interoperability with multi-catalog sync (proposed to XTable)
And today, we’re flipping the final switch — 🧠 Introducing Open Engines™: Now you can deploy best-in-class open source compute engines — Flink, Trino, Ray — directly on your open data, with zero friction, 10x lower ops costs, and way better performance than your self-installed OSS versions.
Because:
1️⃣ Open data is only half the story: To unlock true choice, portability, and innovation, compute needs to be open too. Otherwise, your data is just in a different fancy jail in open formats.
2️⃣ Spark or any other engine is not best at everything: We’re talking non-stop about AI, but that needs a strong foundation for your data, that is multi-engine ready. Check out our deep-dive comparison blogs for yourself.
3️⃣ Starting engine-first is fundamentally flawed: Your data is everything, and you need to move from open data -> open engine -> closed compute platforms. Not the other way around!
4️⃣Picking the wrong engine or lack of flexibility will cost you: We’ve debated endlessly on tiny bits of table metadata, what about the compute engine you spend millions of dollars on? At this scale, data and its needs are growing; even a 10-20% difference in cost-performance is a meaningful spend or saving.
With Open Engines, Onehouse is now the only “multi-engine, multi-cloud”, open data platform in the market.
The universal data lakehouse becomes a living, breathing product, not just a coveted architecture, accessible with a few clicks in your browser.
And it’s open — all the way through.
👉 Read the full story:
onehouse.ai/blog/announcing-…
🤝If you are more than curious, there’s a webinar where I’ll hang in chat:
onehouse.ai/webinar/your-dat…
🪖 Let’s build our data platforms the right way: open first, data first.
#OpenFirst #OpenEngines #DataLakehouse #OpenSource #ApacheHudi #ApacheIceberg #DeltaLake #ApacheXTable #OpenCompute #Onehouse #DataEngineering #Data #DataLake #BigData #Analytics #MachineLearning #DataScience #StreamProcessing