Fastest open-source tool for replicating Databases to Apache Iceberg or Data Lakehouse

Joined March 2022
69 Photos and videos
Thanks for getting the data to the bucket now let's run analytics on it

1
91
OLake by Datazip retweeted
Happens NEXT WEDNESDAY in SF: Lakehouse, AI and Iceberg Meetup! Join @_olake, @RisingWaveLabs, @datastrato, @ryft, and @dremio as we explore how real-time data, lakehouses, AI, and Iceberg power the next generation of data platforms. Save your spot: luma.com/g76usnrz
1
2
4
232
👀
2
84
Join in people @cappybaradeploy and @praveenscience will be guiding you to the right path of open source!
I was once genuinely confused about open source. I contributed to two #GSoC organisations, spent weeks writing long proposals, and still faced rejection. Where to start? How to contribute? Is this even for me? If you’ve felt this, you’re not alone.
1
2
140
Olake Community Call - Jan 28 (4:30 PM IST) We’re announcing new source integrations, sharing updates on MOR → COW improvements and Kubernetes execution upgrades. If you’re looking to contribute, pick up new issues, or understand where Olake is headed next, this call is for you
1
1
83
AI agents don’t run on models alone they run on data. We recently hosted @andrewmadson11 ( @fivetran ) to dig into why data quality governance matter more than model size. @ApacheIceberg makes data predictable versioned, structured, and trustworthy . Better data>bigger models
1
72
Real-time data, AI, and @ApacheIceberg - this is where the next-gen stack comes together. Save your spot and come say hi 👋 Looking forward to some great speaker talks great conversations!
📍Feb 11 in SF - Lakehouse, AI & Iceberg Meetup Join @_olake, @RisingWaveLabs, @datastrato, @ryft & @dremio to explore how real-time data, AI, lakehouses & Iceberg are powering the next-gen data stack. Tech talks→ drinks→ great convos 🎟️Save your spot: luma.com/g76usnrz
2
57
Join us at our second open-source event. 2026 is the year to grow, learn, and contribute through meaningful open-source work. As part of SWOC 2026, If you’re an intermediate level data engineer and looking to strengthen your open-source profile this is a great place to start.
This one’s going straight into my 2026 recap. I’m serving as project admin for @_olake for the second time, and this year we’ve been selected as an organization-level repository at Social Winter of Code (SWOC) 2026. If you’re registered (or planning to be maybe in #GSOC ), this is a solid chance to start contributing early. I’ve seen how consistent, quality open-source work genuinely helps when it comes to internships and early-career roles.I’ve shared a quick walkthrough video covering: How to get started with OLake How to find and choose the right issues How org-level contribution workflows actually work How to approach open source with a long-term mindset We’re also building an active Slack community (link in comments) if you want to engage beyond PRs. If you’re aiming for programs like GSoC or just want to strengthen your open-source profile, the same fundamentals apply. At OLake, we’re building a high-performance data ingestion platform and welcoming contributors who want to learn and grow. DMs are open. 2026 is shaping up to be a strong year for open source.
2
80
Closing out 2025 with a big milestone What started as conversations around @ApacheIceberg and Open Lakehouse turned into meetups, webinars, deep technical discussions and a strong community. Huge thanks to every engineer, architect, and open-source contributor who supported.
2
59
We just wrapped up one of the final meetups of the year this one in collaboration with @cloudera . Great conversations around building scalable data platforms, with deep dives into @ApacheArrow writers, next-gen execution engines. More talks joining us soon in 2026.Stay tuned.
3
131
Start building!
Preparing for Google Summer of Code (GSOC)? This is a great place to start. One thing I strongly believe in is that GSoC preparation isn’t about waiting for the program to open it’s about starting early and building real open-source contributions that actually show up on your profile as that's where the program managers shortlist your from :) At @_olake , we’ve opened up a set of open-source bounties along with good first issues, specifically designed to help you start contributing in a structured and beginner-friendly way. Here’s how you can get started: 1️⃣ Open-source bounties & good first issues We’ve curated issues that are easy to start with while still being meaningful. The bounties range from $20 to $40 and go up to $100, depending on the contribution and impact. 2️⃣ Tech stack If you’re working with Java or Golang, this can be a great starter repository to gain hands-on experience with production-grade open-source code. 3️⃣ Join our Slack All conversations, discussions, and reviews happen on our Slack. This is where you’ll interact with maintainers and other contributors and get unblocked quickly. 4️⃣ Pick an issue & start contributing Go through the open issues, find the ones that match your skill set and interests, and start working on them. There’s no pressure just steady learning and shipping. 5️⃣ Share your PR on Slack Once you’ve raised a pull request, drop it on Slack. Our developers will review your work, guide you where needed, and help you improve. At the end of the day, Olake is all about open source, collaboration, and building in the open. If GSoC is on your radar, this is a solid way to start building a real and visible contribution history. 👉 Slack link is in the comments. Start contributing #GSOC
2
66
Helping every contributor along the way! If you are interested in open source contributions we will provide you a platform to build with us the world's fastest data migration tool . Where not only goodies wait, bounties over $50-100 all ready to be picked up and resolved !
Thanks @_olake for sending this
1
2
89
OLake by Datazip retweeted
At our recent @ApacheIceberg Community Meetup in India, we also sat down with Narsingh from @fivetran someone who’s been closely observing how data teams across India are evolving their architectures. Our conversation touched on a few themes we’re seeing more and more across the ecosystem: • How Apache Iceberg adoption is accelerating in India as teams move toward open, interoperable table formats • How meetups like these are helping practitioners exchange ideas, share real production learnings, and shape the future of Iceberg adoption locally Great to see the energy building around Iceberg, and even better to hear these insights directly from the people driving real-world implementations. More conversations coming soon as we continue strengthening this community and documenting the shifts happening in the data space. Huge thanks to our partners for powering this edition alongwith @_olake - @awscloud @puppyquery @Minio @e6data @FireboltHQ @devrelsquad_ @fivetran
2
4
103
The support from the team at @FireboltHQ and others made it all possible . The ecosystem for Iceberg is flourishing in india and across the world!
At our @ApacheIceberg Community Meetup in India, we had the chance to connect with some of the most active voices in the ecosystem including Pascal Schulze from @FireboltHQ . We spoke with him about two key things shaping the modern data world: • How companies are leaning into open-source technologies and why Apache Iceberg is becoming a core part of new data stacks • His takeaways from the meetup, the conversations that stood out, and what this growing community means for the future of Iceberg adoption in India More conversations and insights coming soon as we continue to build and support this community together.
1
58
India’s First Official @ApacheIceberg Meetup wrapped! An incredible day for the community with great talks and deep discussions. Huge thanks to our partners for powering this edition: @awscloud , @puppyquery , @Minio , @e6data , @FireboltHQ , @devrelsquad_ , and @fivetran .
2
9
202
A great start but still not an end ! More meetups coming soon Community>>
Data engineering connects all parts of AI, development and software engineering — probably a data engineer One such great example was our @ApacheIceberg community meetup in India the first official one we’ve conducted. Bangalore being the heart of tech, the meetup brought together data engineers, AI engineers and builders across the ecosystem. And while the pics show it housefull, it was equally full of ideas, conversations and learning. Coffee in hand or lunch on the table everywhere you looked, something meaningful was being discussed. We also had a solid speaker lineup covering everything across the Iceberg ecosystem and how it’s evolving across real use cases. And at the end thanks a lot for joining @_olake in a successful execution. Huge shout-out to all the partners who made this possible: @awscloud @puppyquery @Minio @e6data @FireboltHQ @devrelsquad_ Missed it? 20th December keep your calendars empty.
2
46
OLake by Datazip retweeted
India’s first @ApacheIceberg meetup has commenced Huge thanks to all the partners that joined @_olake in the execution @awscloud @e6data @Minio @puppyquery @devrelsquad_ @fivetran
1
3
5
194
OLake by Datazip retweeted
I recently had the chance to host a fireside chat with two incredible CTOs — @jacopotagliabue CTO at @Bauplan_labs and @cto_datazip Shubham Baldava from OLake — and it turned into one of the most honest and insightful conversations I’ve had around the modern data stack. One of the big questions I asked was: Why @ApacheIceberg ? Why not @apachehudi or @DeltaLakeOSS ? Jacopo shared Bauplan’s perspective how Iceberg’s community-first approach, its open governance, and its independence from any vendor ecosystem make it the format that aligns with how today’s data systems should evolve. Shubham added his side coming from his experience working with Hudi at scale at PayPay, he talked about the practical pain points, what worked, what didn’t, and why OLake made the decision to be exclusively built on Apache Iceberg. And honestly, hearing him break down that pivot was eye-opening. These were just a few of the many interesting questions we went into. If you’re curious about where data engineering is headed, and want to hear perspectives from people who’ve built real systems at massive scale, you can check out the full recording. And if you want to be part of these conversations in person, we’re also hosting an Apache Iceberg Meetup this Saturday would love to see you there. #DataEngineering #ApacheIceberg #OLake #Bauplan #FiresideChat #Op
1
1
4
99
We are sorry
3
78