transitioning from data analyst ➀ data engineer | contributor at @TDataScience | #learninginpublic

Joined February 2022
21 Photos and videos
Pinned Tweet
I’m learning Data Engineering from scratch as a Data Analyst. So I mapped out a 12-month self-study roadmap to guide the transition. I just published it here @TDataScience You can read it here towardsdatascience.com/from-…
2
3
21
3,197
I built my first ETL pipeline as a complete beginner. No Airflow. No Spark. No cloud infrastructure. Just Python, pandas, and a GitHub API. Recently shared the full story in @TDataScience. Here's exactly what I learned πŸ§΅πŸ‘‡
1
1
35
12/ If you're trying to break into data engineering: Stop waiting for the perfect roadmap. Pick a small project. Build something ugly. Finish it. Your first pipeline won't be impressive. But it will be the project that teaches you the most. Read the full article below πŸ‘‡
1
15
Ibrahim Salami (Ibby) retweeted
2026 Data Engineering Roadmap πŸ“Š Stage 1: SQL Stage 2: Python Stage 3: Data Modeling Stage 4: ETL Pipelines Stage 5: PostgreSQL Stage 6: Spark Stage 7: Airflow Stage 8: Data Warehousing Stage 9: Cloud (AWS/GCP/Azure) Stage 10: Streaming Systems Stage 11: Data Quality Monitoring Stage 12: Production Pipelines Data scientists get the spotlight. Data engineers move the data.
10
22
211
6,622
Data Engineering work is mostly ETL (Extract, Transform, Load). So to learn about ETL, I decided to build a basic ETL pipeline that extracts data from GitHub repositories and saves it as CSV Read about the entire process @TDataScience
Using the GitHub API, @ibbysalam shows us the steps he took to build an extract, transform, load data pipeline from scratch β€”Β and as a complete beginner. towardsdatascience.com/i-bui…
3
25
4,727
Ibrahim Salami (Ibby) retweeted
πŸš€ Complete Data Engineering Roadmap 🧠 STEP 1: Learn Programming Basics βœ” Start with Python βœ” Variables, Functions & Loops βœ” OOP Concepts βœ” APIs & JSON Handling πŸ“Š STEP 2: Master SQL βœ” SELECT & WHERE βœ” JOINS & GROUP BY βœ” Window Functions βœ” CTEs & Subqueries βœ” Query Optimization πŸ›  Databases to Learn: βœ” MySQL βœ” PostgreSQL βœ” MongoDB ⚑ STEP 3: Learn Data Warehousing βœ” ETL vs ELT βœ” Data Pipelines βœ” Star & Snowflake Schema βœ” Batch Processing βœ” Data Modeling πŸ›  Tools to Learn: βœ” Snowflake βœ” Amazon Redshift βœ” BigQuery ☁️ STEP 4: Learn Big Data Technologies βœ” Distributed Systems βœ” Parallel Processing βœ” Streaming Data βœ” Real-Time Analytics πŸ›  Technologies to Learn: βœ” Apache Spark βœ” Hadoop βœ” Apache Kafka πŸ”„ STEP 5: Learn Data Pipelines & Orchestration* βœ” Workflow Scheduling βœ” Data Automation βœ” Monitoring Pipelines βœ” Error Handling πŸ›  Tools to Learn: βœ” Apache Airflow βœ” dbt βœ” Prefect ☁️ STEP 6: Learn Cloud Platforms βœ” Cloud Storage βœ” Data Lakes βœ” Serverless Processing βœ” Cloud Security Basics πŸ›  Platforms to Learn: βœ” AWS βœ” Microsoft Azure βœ” Google Cloud Platform πŸ›  STEP 7: Learn DevOps for Data Engineering βœ” Version Control βœ” CI/CD Basics βœ” Containerization βœ” Deployment Automation πŸ›  Tools to Learn: βœ” Git βœ” Docker βœ” Kubernetes πŸ”₯ STEP 8: Build Real Projects* βœ” ETL Pipeline Project βœ” Real-Time Data Streaming βœ” Sales Data Warehouse βœ” Data Lake Project βœ” Analytics Dashboard Backend πŸ’‘ The best way to become a Data Engineer: πŸ‘‰ Learn SQL β†’ Build Pipelines β†’ Work with Big Data β†’ Deploy on Cloud Data Engineering Resources: whatsapp.com/channel/0029Vao… πŸ’¬ Tap ❀️ if this helped you!
12
34
172
10,769
Happy to see a lot of people are resonating with this article. Can't wait to see where this journey takes me. If you're looking to break into or transition to data engineering. This is a good read
If you're considering a role change, or curious about the path to become a data engineer, don't miss @ibbysalam's new series on his own journey, covering the tools and resources he'll rely on and the many twists and turns he's bound to face. towardsdatascience.com/from-…
1
2
73
Ibrahim Salami (Ibby) retweeted
"It is tempting to treat hybrid search as something you can tune once: pick a merge algorithm, choose a lexical/semantic weight, and ship it - but there is no globally correct merge strategy" hornet.dev/blog/100m-doc-sea…

2
8
61
3,311