Filter
Exclude
Time range
-
Near
The more I learn Databricks, the more I realize that progress is not about covering more topics. It is about understanding the fundamentals. A few weeks ago, I was focused on advanced concepts like Delta Lake, data pipelines, and optimization. But recently, some of my biggest learning moments have come from working with simple CSV files, DataFrames, and schemas. What surprised me most is how much depth exists in the basics. A simple question like "How does Spark identify data types?" can lead to a much deeper understanding of how everything works behind the scenes. While learning, I have been spending time with Databricks notes from BricksNotes. They have helped me stay focused on understanding one concept at a time instead of jumping between dozens of resources. One lesson has become very clear: Understanding a concept is far more valuable than simply recognizing its name. What basic Databricks concept turned out to be more important than you expected? #Databricks #DataEngineering #PySpark #BigData #LearningInPublic #DataCommunity #BricksNotes
24
Day 15/90 ✅ Started learning Pandas today! 📊 Explored Series, DataFrames, shape, columns, info() & describe(). Every day = 1 step closer to becoming a Data Analyst 🚀 #Python #Pandas #DataScience
1
8
Guys who don't understand how genes actually work and think it's just a soul but chemical and that you can mathematically deduce how much your Chemical Soul is or isn't being deformed by society using the Pandas Dataframes library in Python
45
Replying to @Nickjagger007
I love this shift to DuckDB for handling large DataFrames! The automatic parallelization is a game changer. Have you noticed specific scenarios where the performance leap is most dramatic?
1
43
Stop using Pandas groupby on large DataFrames. Run SQL directly on them with DuckDB instead. Same variable, no database setup, 10x faster Pandas groupby often crashes. DuckDB finishes in seconds. Vectorized columnar execution, parallelized across all your CPU cores automatically.
1
1
197
One thing I have learned while studying Databricks is that more learning does not always mean more progress. When I started, I tried to learn everything at once. Spark, PySpark, Delta Lake, SQL, Workflows, Clusters, DataFrames, and Medallion Architecture. Every day felt like a race to cover more topics. The problem was that I was consuming a lot of information but understanding very little. I could recognize concepts, but I struggled to connect them together. A few weeks into the journey, I decided to slow down. Instead of asking, "How many topics can I finish today?" I started asking, "What concept do I understand better today than I did yesterday?" That small change made a huge difference. Reading a simple CSV file helped me understand DataFrames. DataFrames helped me understand transformations. Transformations helped me understand how data moves through a pipeline. For the first time, learning started to feel connected instead of overwhelming. BricksNotes has been helpful during this process because it keeps me focused on understanding one step at a time instead of jumping between random resources. The biggest lesson so far? Understanding beats memorization every single time. What is one Databricks concept that took you longer than expected to truly understand?
27
✅WEEK 13 Day 3 - Pandas - Extracting and working with dates - Merge and Join DataFrames #DataEngineering #DataScience #DataAnalytics #LockedIn #LearningInPublic #Python
10
Day 13/60 #60DaysOfLearning2026 Worked with DataFrames, handled missing values, transformed data, created new columns, used GroupBy, and built simple reports from a dataset. Getting more comfortable with data. @lftechnology #LSPPDay13 #LearningWithLeapfrog #DataEngineering
6
117
just discovered Danfo.js if you've ever wished pandas existed in JavaScript, it pretty much does it makes data cleaning, analysis, and manipulation way more pleasant without leaving the JS ecosystem. the no. of times I've switched to python just for dataframes is embarrassing.
1
31
Allan Avendaño retweeted
📊Visualiza y explora tus datos de manera sencilla con esta librería de Python. PyGWalker convierte tus Dataframes en una UI interactiva y crea gráficos fácilmente sin escribir código. → Simplifica el EDA → Análisis Drag&Drop → Código abierto 🔗 github.com/Kanaries/pygwalke…
2
17
76
4,177
saturn_mystic retweeted
Transforms Python DataFrames into interactive GPU-powered graphs github.com/graphistry/pygrap…
2
15
95
4,881
🚀 Day 47/100 of #100DaysOfCode Today, I completed the Pandas lecture series from the Prime AI/ML Batch taught by Shradha Ma’am 📊🐼 What I learned today: ✅ Data concatenation on DataFrames ✅ Merging DataFrames using: • Left Join • Right Join • Inner Join • Outer Join ✅ Plotting data using Pandas for visualization Enjoying the journey of exploring data manipulation and analysis step by step! 💡 #Python #Pandas #DataScience #MachineLearning #AI #CodingJourney
2
10