Skip to content
geeksforgeeks
  • Tutorials
    • Python
    • Java
    • Data Structures & Algorithms
    • ML & Data Science
    • Interview Corner
    • Programming Languages
    • Web Development
    • CS Subjects
    • DevOps And Linux
    • School Learning
    • Practice Coding Problems
  • Courses
    • DSA to Development
    • Get IBM Certification
    • Newly Launched!
      • Master Django Framework
      • Become AWS Certified
    • For Working Professionals
      • Interview 101: DSA & System Design
      • Data Science Training Program
      • JAVA Backend Development (Live)
      • DevOps Engineering (LIVE)
      • Data Structures & Algorithms in Python
    • For Students
      • Placement Preparation Course
      • Data Science (Live)
      • Data Structure & Algorithm-Self Paced (C++/JAVA)
      • Master Competitive Programming (Live)
      • Full Stack Development with React & Node JS (Live)
    • Full Stack Development
    • Data Science Program
    • All Courses
  • DSA
  • Practice Problems
  • C
  • C++
  • Java
  • Python
  • JavaScript
  • Data Science
  • Machine Learning
  • Courses
  • Linux
  • DevOps
  • SQL
  • Web Development
  • System Design
  • Aptitude
  • GfG Premium
Open In App
Next Article:
How to Install Apache Airflow?
Next article icon

How to Install Apache Airflow?

Last Updated : 23 Nov, 2022
Comments
Improve
Suggest changes
Like Article
Like
Report

A batch-oriented workflow can be developed, scheduled, and monitored using Apache Airflow, an open-source platform. You can integrate Airflow with virtually any technology thanks to its Python extension framework. Workflows can be managed using a web interface. Airflow is deployable in many ways, from simple processes running on laptops to distributed setups that can support even a huge flow of data.

Why Choose Airflow?

The Airflow framework can be easily extended to connect to new technology if your workflows have a clear start and end, and run at regular intervals. It is a batch workflow orchestration platform. If your workflows have a clear start and end and are scheduled to run at regular intervals, you can create Airflow DAGs.

Features:

  1. Easy to Use: if you are good with the basics of python, Airflow is easy.
  2. Open Source: The software is free and open-source, and it has many users.
  3. Roll back version: Previous versions of workflows can be rolled back by using version control
  4. Integrations: It provides ready-to-use operators with which to work with Google Cloud Platform, Amazon AWS, Microsoft Azure, etc.
  5. Amazing User Interface: Track your workflows and manage them with ease with the status interface.

Advantages:

  1. There is a time-based schedule for the entire Airflow model.
  2. To build a pipeline using Airflow, you can choose from a variety of operators.
  3. The Apache Airflow UI lets you check DAG status, runtimes, and logs.
  4. The raw data is stored, processed, and then separated from the processed data to provide immutability.
  5. Aim to provide idempotence wherein inputs and outputs will always be the same.

Disadvantages:

  1. Raw data pipelines make it extremely difficult to write test cases.
  2. Changing the schedule requires renaming your DAG.
  3. Running Airflow natively on Windows is not straightforward

Installation for Apache Airflow:

For Apache Airflow installation you should have pip installed first.

Step 1: Install pip first, in case you have already installed move to Step 3.

$ sudo apt-get install python3-pip

Step 2: Set the location

$ export AIRFLOW_HOME=~/airflow

Step 3: Install Apache Airflow using pip

$ pip3 install apache-airflow

Output:

install airflow
 

Step 4: Backend initialization to maintain workflow

$ airflow initdb

Step 5: Run the below command to start the web server or Apache user interface 

$ airflow webserver -p 8080

Step 6: Airflow scheduler to monitor workflow

$ airflow scheduler

Next Article
How to Install Apache Airflow?

D

devgarg05
Improve
Article Tags :
  • Installation Guide
  • Technical Scripter 2022
  • Apache
  • how-to-install

Similar Reads

    How to Install Apache Airflow in Kaggle
    Apache Airflow is a popular open-source tool used to arrange workflows and manage ETL (Extract, Transform, Load) pipelines. Installing Apache Airflow in a Kaggle notebook allows users to perform complex data processing tasks within the Kaggle environment, leveraging the flexibility of DAGs (Directed
    4 min read
    What is Apache Airflow?
    Apache Airflow is an open-source tool to programmatically author, schedule, and monitor workflows. It is used by Data Engineers for orchestrating workflows or pipelines. One can easily visualize your data pipelines' dependencies, progress, logs, code, trigger tasks, and success status. Complex data
    3 min read
    How To Install Flink?
    Flink is an open-source stream processing framework developed by the Apache Software Foundation. It's designed to process real-time data streams and batch data processing. Flink provides features like fault tolerance, high throughput, low-latency processing, and exactly-once processing semantics. It
    4 min read
    How to Install Apache in Ubuntu using Ansible?
    Apache HTTP Server, commonly referred to as Apache, is a robust and broadly used web server for open-source web server programming. It is exceptionally adaptable and extensible, and it is famous for hosting websites and web applications. Ansible, then again, is a strong automation device that improv
    6 min read
    How to Install PHP on AWS EC2?
    AWS or Amazon web services is a cloud service platform that provides on-demand computational services, databases, storage space, and many more services. EC2 or Elastic Compute Cloud is a scalable computing service launched on the AWS cloud platform. In simpler words, EC2 is nothing but a virtual com
    2 min read
geeksforgeeks-footer-logo
Corporate & Communications Address:
A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)
Registered Address:
K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305
GFG App on Play Store GFG App on App Store
Advertise with us
  • Company
  • About Us
  • Legal
  • Privacy Policy
  • In Media
  • Contact Us
  • Advertise with us
  • GFG Corporate Solution
  • Placement Training Program
  • Languages
  • Python
  • Java
  • C++
  • PHP
  • GoLang
  • SQL
  • R Language
  • Android Tutorial
  • Tutorials Archive
  • DSA
  • Data Structures
  • Algorithms
  • DSA for Beginners
  • Basic DSA Problems
  • DSA Roadmap
  • Top 100 DSA Interview Problems
  • DSA Roadmap by Sandeep Jain
  • All Cheat Sheets
  • Data Science & ML
  • Data Science With Python
  • Data Science For Beginner
  • Machine Learning
  • ML Maths
  • Data Visualisation
  • Pandas
  • NumPy
  • NLP
  • Deep Learning
  • Web Technologies
  • HTML
  • CSS
  • JavaScript
  • TypeScript
  • ReactJS
  • NextJS
  • Bootstrap
  • Web Design
  • Python Tutorial
  • Python Programming Examples
  • Python Projects
  • Python Tkinter
  • Python Web Scraping
  • OpenCV Tutorial
  • Python Interview Question
  • Django
  • Computer Science
  • Operating Systems
  • Computer Network
  • Database Management System
  • Software Engineering
  • Digital Logic Design
  • Engineering Maths
  • Software Development
  • Software Testing
  • DevOps
  • Git
  • Linux
  • AWS
  • Docker
  • Kubernetes
  • Azure
  • GCP
  • DevOps Roadmap
  • System Design
  • High Level Design
  • Low Level Design
  • UML Diagrams
  • Interview Guide
  • Design Patterns
  • OOAD
  • System Design Bootcamp
  • Interview Questions
  • Inteview Preparation
  • Competitive Programming
  • Top DS or Algo for CP
  • Company-Wise Recruitment Process
  • Company-Wise Preparation
  • Aptitude Preparation
  • Puzzles
  • School Subjects
  • Mathematics
  • Physics
  • Chemistry
  • Biology
  • Social Science
  • English Grammar
  • Commerce
  • World GK
  • GeeksforGeeks Videos
  • DSA
  • Python
  • Java
  • C++
  • Web Development
  • Data Science
  • CS Subjects
@GeeksforGeeks, Sanchhaya Education Private Limited, All rights reserved
We use cookies to ensure you have the best browsing experience on our website. By using our site, you acknowledge that you have read and understood our Cookie Policy & Privacy Policy
Lightbox
Improvement
Suggest Changes
Help us improve. Share your suggestions to enhance the article. Contribute your expertise and make a difference in the GeeksforGeeks portal.
geeksforgeeks-suggest-icon
Create Improvement
Enhance the article with your expertise. Contribute to the GeeksforGeeks community and help create better learning resources for all.
geeksforgeeks-improvement-icon
Suggest Changes
min 4 words, max Words Limit:1000

Thank You!

Your suggestions are valuable to us.

What kind of Experience do you want to share?

Interview Experiences
Admission Experiences
Career Journeys
Work Experiences
Campus Experiences
Competitive Exam Experiences