One thing I have learned while studying Databricks is that more learning does not always mean more progress.
When I started, I tried to learn everything at once. Spark, PySpark, Delta Lake, SQL, Workflows, Clusters, DataFrames, and Medallion Architecture. Every day felt like a race to cover more topics.
The problem was that I was consuming a lot of information but understanding very little. I could recognize concepts, but I struggled to connect them together.
A few weeks into the journey, I decided to slow down. Instead of asking, "How many topics can I finish today?" I started asking, "What concept do I understand better today than I did yesterday?"
That small change made a huge difference.
Reading a simple CSV file helped me understand DataFrames. DataFrames helped me understand transformations. Transformations helped me understand how data moves through a pipeline.
For the first time, learning started to feel connected instead of overwhelming.
BricksNotes has been helpful during this process because it keeps me focused on understanding one step at a time instead of jumping between random resources.
The biggest lesson so far?
Understanding beats memorization every single time.
What is one Databricks concept that took you longer than expected to truly understand?