Filter
Exclude
Time range
-
Near
Apache Airflow 3.2 is here, bringing partitioned Dag runs and asset events, async Python support for @ task and PythonOperator, and UI theming. This quick notes guide comes with code examples for every new feature to use as patterns in your own Dags. Download the guide to learn how to: ↔️ Pass timestamps between Dags scheduled based on assets without custom workarounds ⚡ Cut task runtime by running concurrent async API calls in a single @ task 🎨 Flag critical production deployments by adjusting the colors in the Airflow UI using an Airflow configuration variable Link below.
4
1
6
310
21 Days Roadmap to Apache Airflow Basics - Days 1–7: Airflow Fundamentals (1 hour daily) > Install Airflow locally (using Docker or pip). > Learn Airflow concepts: DAGs, Tasks, Operators. >Explore the Airflow UI, scheduler, and logs. - Days 8–14: Building DAGs (1 hour daily) > Write simple Python DAGs. > Use built-in operators (BashOperator, PythonOperator). > Learn task dependencies and scheduling with CRON/intervals. >Experiment with XComs (data sharing between tasks). - Days 15–21: Real-World Integrations (1 hour daily) > Connect Airflow to a database (Postgres/MySQL). > Build ETL DAG: extract (API/CSV) → transform (Python) → load (DB). > Intro to sensors, hooks, and custom operators. > Learn the basics of monitoring & retries. - Throughout: Note every project idea (e.g., automate daily data load, schedule ETL, run API → DB pipelines) - Day 22: Pick your first project and build > (e.g., daily pipeline that fetches weather API data → cleans with Pandas → loads into Postgres → sends Slack/Email notification).
2
24
149
7,912
💡Airflow Operators: The Building Blocks In Airflow, tasks are created using Operators. They’re the “actions” your workflow performs. Here are the common ones for data engineers: ◉BashOperator → run bash commands. ◉ PythonOperator → run Python functions. Example:
2
3
9
210
Day 91 – August 3, 2025 Explored Airflow DAGs today. 🛠️ Built a simple pipeline with: DummyOperator PythonOperator Dependencies & retries Getting comfortable with scheduling workflows like a pro. #DataEngineering #Airflow #91DaysOfCode
2
22
17 Jul 2025
Are you looking for a drama-free way to run your Airflow pipelines? In a new example below, we show how 📦Kubetorch🔥 cleanly separates the concerns of development from orchestration, so you never need to get stuck push-and-pray debugging in Airflow again. You can iterate and debug your application in local Python with Kubetorch's live dispatch to cloud hardware, and then simply run that Python code as-is in Airflow with PythonOperator in CI or production. The Airflow node will run your code identically, on identical cloud hardware, as you ran and debugged it locally, so you'll never need to rely on Airflow as your debugger again. If you need to debug something that's breaking in production, or simply experiment with a change to a production pipeline, you can just edit and run that same Python code locally. It's a totally closed research-to-production-to-research loop. This example demonstrates how to use Kubetorch to train a PyTorch model on a remote GPU, and then schedule that training for recurring automation with Airflow. If you already have a bunch of Airflow pipelines, you'll see that it takes minimal code changes to restore the development and debugging experience in any existing pipeline. It's not just Airflow; the same structure can be used to improve development velocity, research-to-production, and fault tolerance with any pipeline orchestrator (e.g. Argo, Dagster, Prefect, Flyte) without requiring any direct integration. Take a look and get in touch if you'd like to try it out yourself! run.house/examples/pytorch-m…
2
2
347
Understand Airflow For Data Engineering (Quick Guide📝) 📌 What is Apache Airflow? As a Data Engineer, one of the tasks you perform is to build a data pipeline now for that you can write simple Python scripts or use enterprise tools. Simple Python script with cron job is enough for a few pipelines but what if you have 100s of them? Hard to manage!!! This is where Airflow Comes into the picture 💡 It's an open-source tool for managing data pipelines, you can build, schedule, and monitor workflows. In one place you can manage everything! There are a few components you need to understand. 📌 DAG (Directed Acyclic Graph): At the heart of Airflow is the DAG, which defines a collection of tasks and their dependencies in a specific order. This is a core computer science concept. Think of it as a blueprint of your workflow, ensuring that tasks run in the sequence. 👉🏻 Directed: Tasks move in a certain direction. 👉🏻 Acyclic: No loops! Tasks don't run in circles. 👉🏻 Graph: A visual representation of the tasks. 📌 What is a Task? It is basically where you write your logic, such as reading data, transforming it, or writing it. Each task runs independently, in its own process. To create a task we need to use Operators. 📌 What are Operators? There are many different operators you can use for a specific task. They determine WHAT gets done. 👉🏻 BashOperator: Executes a bash command. 👉🏻 PythonOperator: Executes a Python function. 👉🏻 PostgresOperator: Executes SQL on a Postgres database. and many more! 📌 Executor: Determines HOW tasks are run. There are several types: 👉🏻 SequentialExecutor: Runs tasks sequentially. 👉🏻 LocalExecutor: Runs tasks in parallel on a single machine. 👉🏻 CeleryExecutor: Distributes tasks across multiple machines. 📌 Scheduler: The brain behind when your tasks run. It checks the DAGs and sends them to the executor to see if they have tasks to run. 📌 Web Server: A friendly UI to monitor and manage your DAGs. You can check task logs, rerun tasks, and visualize task dependencies. Let me know if you found this helpful 👇🏻 You can watch an overview of Airflow here - youtu.be/5peQThvQmQk
6
75
393
27,048
23 Mar 2024
Replying to @oscarnazhan
Not yet broo. Semalam aku duk study beza taskflow dengan pythonoperator je. For now aku nampak lagi senang kalau dbt job tu run guna bashoperator jelah dah benda tu command je. Taskflow tak sure.
2
83
22 Mar 2024
PythonOperator executes Python functions directly (lower-level), while TaskFlow offers higher-level constructs for managing workflows. Ultimately, you should choose whichever based on your project's needs, or consider combining them for more complex and flexible workflows.
159
22 Mar 2024
2 method of defining job/task in Apache Airflow. Both resulting the same output - calling API and printing the result. Pythonoperator is a built in module to execute Python function, while Taskflow API provides a pythonic structure approach.
4
2
30
2,121
Understand Airflow For Data Engineering (Quick Guide📝) 📌 What is Apache Airflow? As a Data Engineer, one of the tasks you perform is to build a data pipeline now for that you can write simple Python scripts or use enterprise tools. Simple Python script with cron job is enough for a few pipelines but what if you have 100s of them? Hard to manage!!! This is where Airflow Comes into the picture 🚀 💡 It's an open-source tool for managing data pipelines, you can build, schedule, and monitor workflows. In one place you can manage everything! You can watch the detailed video here: There are a few components you need to understand. 📌 DAG (Directed Acyclic Graph): At the heart of Airflow is the DAG, which defines a collection of tasks and their dependencies in a specific order. This is a core computer science concept. Think of it as a blueprint of your workflow, ensuring that tasks run in the sequence. 👉🏻 Directed: Tasks move in a certain direction. 👉🏻 Acyclic: No loops! Tasks don't run in circles. 👉🏻 Graph: A visual representation of the tasks. 📌 What is a Task? It is basically where you write your logic, such as reading data, transforming it, or writing it. Each task runs independently, in its own process. To create a task we need to use Operators. 📌 What are Operators? There are many different operators you can use for a specific task. They determine WHAT gets done. 👉🏻 BashOperator: Executes a bash command. 👉🏻 PythonOperator: Executes a Python function. 👉🏻 PostgresOperator: Executes SQL on a Postgres database. and many more! 📌 Executor: Determines HOW tasks are run. There are several types: 👉🏻 SequentialExecutor: Runs tasks sequentially. 👉🏻 LocalExecutor: Runs tasks in parallel on a single machine. 👉🏻 CeleryExecutor: Distributes tasks across multiple machines. 📌 Scheduler: The brain behind when your tasks run. It checks the DAGs to see if they have tasks to run and sends them to the executor. 📌 Web Server: A friendly UI to monitor and manage your DAGs. You can check task logs, rerun tasks, and visualize task dependencies. Let me know if you found this helpful 👇🏻
8
98
580
113,421
6. Operators There are many different operators you can use for a specific task They determine WHAT gets done. 1. BashOperator: Executes a bash command. 2. PythonOperator: Executes a Python function. 3. PostgresOperator: Executes SQL on a Postgres database. and many more!
1
1
13
3,003
The TaskFlow API was introduced in Apache Airflow 2.0 and is a wonderful alternative to PythonOperator, writes Dennis Ferruzzi of @awscloud. He looks at a few use cases where TaskFlow excels and how it compares to writing a DAG using the traditional PythonOperator.
1
4
2,066
In python, operator overloading is used to perform specific operations on the given operands. Also, operations that any p..... tutorialwithexample.com/pyth… #tutorialwithexample #learning #python #PythonOOP #100DaysOfCode #pythonoperator #100DaysOfCodeChallenge #operatoroverloading
2
2
There are 7 types of operators in Python. Arithmetic Operators Comparison Operators Assignment Operators Logical Operators Bitwise Operators... tutorialwithexample.com/pyth… #learnpython #python #tutorialwithexample #pythonoperator #100DaysOfCode #100DaysOfCodeChallenge
1
4
4
Arrêtez tout, Airflow 2.0 est disponible :D - Scheduler HA, - nouvelle API pour les PythonOperator (TaskFlow), - API Rest stable, - TaskGroups (fini les subdags \o/), - améliorations de l'UI, - et un tas d'autres nouveautés ! #Airflow @ApacheAirflow airflow.apache.org/blog/airf…

1
3
リンク先の方法がどうしても動かなくて、結局PythonOperator使ってgoogle-cloud-automlから叩くようにしたらすんなり動いた。 qiita.com/shuhoyo/items/af79…

3
AirflowでシンプルなHTTPリクエスト送るDAGを作るのに結構ハマったりする。シンプルなものであれば、おすすめはHttpOperatorではなく、PythonOperatorを使うところ。この辺りを詳しく書いておいた。 qiita.com/munaita_/items/e33…

2
Airflowでhttpリクエスト Airflow で httpリクエスト作りたい場合はSimpleHttpOperator が思い浮かぶ。 ただし、これを使う場合、環境変数 or UIからConnection経由でhostを登録する必要がある。 シンプルにDagコードだけで完結したい場合はPythonOperatorで実装する。
2
1 Jan 2019
one of the neat things is dynamic dags. For example, you can write stuff to a file. then a pythonOperator can read the contents of a file and generate new tasks based on the contents of the file. It's quite powerful.
1
1
2