Aakash Gupta

Aakash Gupta

20 Photos and videos

Tweets

Mark Lyons retweeted

Aakash Gupta

@aakashgupta

23 Jul 2024

True:

464

99,594

Marc Brooker

Mark Lyons retweeted

Marc Brooker @MarcJBrooker

26 Mar 2024

Microsecond-accurate time is now available in EC2 US East. So many cool things this makes possible: aws.amazon.com/about-aws/wha…

Amazon Time Sync Service now supports microsecond-accurate time in US East (N. Virginia) Region -...

Discover more about what's new at AWS with Amazon Time Sync Service now supports microsecond-accurate time in US East (N. Virginia) Region

aws.amazon.com

148

19,401

Mark Lyons

Mark Lyons @mcl5tech

29 Jun 2023

Anyone looking for a new SA opportunity DM me and I can intro you to Roger Frey! (Great team & Roger is fantastic!!) lnkd.in/edKZsu-b

147

Mark Lyons

Mark Lyons @mcl5tech

27 Apr 2023

Verifying myself: I am markclyons on Keybase.io. 2RdVlnBARFNGHkBQEWYYppwhlr0zvyetUhBV / keybase.io/markclyons/sigs/2…

145

Mim

Mark Lyons retweeted

Mim @mim_djo

19 Feb 2023

TPCH-SF30 ; 180 million rows #AZURE D16DS_V5; 16 Cores, 64 GB RAM #Databricks Photon 41 S #DuckDB : 43 second Query Parquet files from the VM SSD, no Azure storage involved Databricks Software cost (not hardware) 4.4 $/Hour github.com/djouallah/Testing…

8,317

Dipankar Mazumdar

Mark Lyons retweeted

Dipankar Mazumdar @Dipankartnt

1 Dec 2022

Join @dremio’s Tech advocacy & Eng team for the very first installment of the @ApacheIceberg Office Hours 📆 🚀 We will kick-off with a brief presentation on Copy-on-Write Vs Merge-on-Read strategies, followed up by Q&A on anything Iceberg related. When: December 7th, 12 PM

Alex Merced | Open Data Lakehouse Advocate

Mark Lyons retweeted

Alex Merced | Open Data Lakehouse Advocate

@AMdatalakehouse

19 Nov 2022

Reminder, if you want to learn more about Apache Iceberg I have loads of resources plus a video series all curated in this article. -> dremio.com/subsurface/apache… #BigData #DataLake #DataLakehouse

Apache Iceberg 101 - Your Guide to Learning Apache Iceberg Concepts and Practices | Dremio

Discover Apache Iceberg with a comprehensive 101 course and resources covering concepts, features, hands-on exercises, and real-world applications.

dremio.com

Dipankar Mazumdar

Mark Lyons retweeted

Dipankar Mazumdar @Dipankartnt

17 Nov 2022

Query planning in @ApacheIceberg Being able to efficiently plan queries is super critical for faster execution of the queries run by analysts 🧑🏻‍💻 This is specifically critical when dealing with large-scale data such as data in data lakes. Read @IcebergDevs 👇 #dataengineering

Dipankar Mazumdar

Mark Lyons retweeted

Dipankar Mazumdar @Dipankartnt

15 Nov 2022

The @ApacheArrow project has grown in all axes 🚀 In fact, more & more tools/libraries in the #dataanalytics space have started using Arrow. In this blog post, we go through the evolution of Apache Arrow from usage, capability & community angles. dremio.com/blog/apache-arrow…

Alex Merced | Open Data Lakehouse Advocate

Mark Lyons retweeted

Alex Merced | Open Data Lakehouse Advocate

@AMdatalakehouse

14 Nov 2022

My latest article on Apache Iceberg compaction strategies -> dremio.com/subsurface/compac… #BigData #dataengineering #datalake #DataLakehouse

Compaction in Apache Iceberg: Fine-Tuning Your Iceberg Table’s Data Files | Dremio

Explore compaction in Apache Iceberg for optimizing data files in your tables. Learn how to fine-tune and boost data performance.

dremio.com

Dipankar Mazumdar

Mark Lyons retweeted

Dipankar Mazumdar @Dipankartnt

10 Nov 2022

Manage data as code? Just like Git but for Data? That's right! @projectnessie is an open source work that brings the capabilities of Git-like branching to the world of data & specifically to data lake table formats like #ApacheIceberg #dataengineering

Dremio

Mark Lyons retweeted

Dremio @dremio

7 Nov 2022

We're thrilled to announce that we've been named to @CNBC’s ‘Top Startups for the Enterprise’ Inaugural List 🎉 Read more about our open data lakehouse and this inaugural list here: bwnews.pr/3UehuIN #CNBC #TopStartup #Tech

Dremio Named to CNBC’s ‘Top Startups for the Enterprise’ Inaugural List

Dremio, the easy and open data lakehouse, today announced it has been named to CNBC’s Top Startups for the Enterprise list. CNBC has debuted this list of 25 ...

businesswire.com

Dremio

Mark Lyons retweeted

Dremio @dremio

2 Nov 2022

Are you heading to AWS re:Invent later this month? Check out this link for all the details on how you can: ➡️ Schedule a meeting with us ➡️ Enter our Dremio Cloud data challenge (for a chance to win a PS5!) ➡️ RSVP to our cocktail reception awsreinventdremio2022.splash… #AWSreInvent

Alex Merced | Open Data Lakehouse Advocate

Mark Lyons retweeted

Alex Merced | Open Data Lakehouse Advocate

@AMdatalakehouse

28 Oct 2022

If you find what you see interesting here is a tutorial I wrote giving you a step by step guide getting setup and doing an example exercise -> dremio.com/blog/managing-dat…

Git for Data with Dremio's Lakehouse Catalog: Easily Ensure Data Quality in Your Data Lakehouse |...

Learn how to manage Git for Data with Dremio and Arctic. This blog post guides you through ensuring data quality in your data lakehouse effortlessly.

dremio.com

Dipankar Mazumdar

Mark Lyons retweeted

Dipankar Mazumdar @Dipankartnt

27 Oct 2022

How do we migrate from one catalog to another for @ApacheIceberg tables? if you are already using a catalog (say HDFS) & want to change it to something else (say AWS Glue), how is that possible? A 🧵 for @IcebergDevs #dataengineering

Dremio

Mark Lyons retweeted

Dremio @dremio

20 Oct 2022

With all the recent news about #ApacheIceberg we thought we'd share this video from last year's Subsurface Conference. We're looking for speakers for our event happening in spring 2023 🎤 submit your talk today! sessionize.com/subsurface-li… #CallForSpeakers

0:49

Dipankar Mazumdar

Mark Lyons retweeted

Dipankar Mazumdar @Dipankartnt

11 Oct 2022

Merge-On-Read (MOR) Vs Copy-On-Write (COW) in @ApacheIceberg. Both these approaches are used to deal with deletes & updates of data files in the Data lake. Let’s break down @IcebergDevs👇 #DataEngineering #data

Dremio

Mark Lyons retweeted

Dremio @dremio

10 Oct 2022

Don't miss your chance to take the stage at Subsurface LIVE, coming in the Spring of 2023 🎉 We’re accepting proposals now for key topics. See details submit your proposal now 🎤 lnkd.in/gMXtSTSJ #CallForSpeakers #Data #ApacheIceberg #DataLakehouse

Mark Lyons

Mark Lyons @mcl5tech

1 Oct 2022

Always great to catch up with people who have depth in the data space to share the stories from academic papers to how companies have been created. Thanks @juansequeda @TimGasper

Juan Sequeda

@juansequeda

1 Oct 2022

A Data Catalog is like the parent to the data who makes sure that you go grow, be successful, have fun while being safe. This is the result of our beer discussion with @TimGasper and @mcl5tech in Austin (after Big Data London)

Dremio

Mark Lyons retweeted

Dremio @dremio

29 Sep 2022

Subsurface LIVE is back! Coming in the Spring of 2023 🎉 We’re accepting proposals now for key topics. See details submit your proposal now 🎤 lnkd.in/gMXtSTSJ #CallForSpeakers #Data #ApacheIceberg #DataLakehouse