Loading...

Course Description

Short Course | 12 hours | 1.2 CEUs | $1,315

Apache Airflow is a configuration-as-code OSS solution for workflow automation that is positioned as a replacement of cron-like scheduling systems. Written in Python, Airflow enables developers to programmatically author, schedule for execution, and monitor highly configurable complex workflows.

Course Outline

  • Chapter 1. Apache Airflow Introduction
    • A Traditional ETL Approach
    • Apache Airflow Defined
    • Airflow Core Components
    • The Component Collaboration Diagram
    • Workflow Building Blocks and Concepts
    • Airflow CLI
    • Main Configuration File
    • Extending Airflow
    • Jinja Templates
    • Variables and Macros
    • Summary
  • Chapter 2. Apache Airflow Web UI
    • Web UI - the Landing (DAGs) Page
    • Web UI - the DAG Graph View
    • Run Status Legends
    • The Pause Button (Trigger Latch)
    • The DAG Triggering/Job Checking Sequence
    • The Control Panel for a Task
    • Sample Log File Messages (Abridged for Space)
    • Summary
  • Chapter 3. Anatomy of a DAG and Scheduling
    • What is a DAG?
    • Scheduled and Manually Triggered DAG Runs
    • The DAG Object
    • Tasks
    • Task Lifecycle
    • Operators
    • Idempotent Operators
    • Operator Types
    • Airflow Common Operators
    • Specifying Dependencies
    • Associating Operators with a DAG
    • Associating Operators Using the "With DAG" Statement Example
    • Associating Operators with DAG Using the Operator's Constructor
    • The default_args Parameter
    • Passing DAG Parameters Through Web UI
    • DAG Run Scheduling
    • Examples of the schedule_interval Parameter
    • DAG Scheduling Nuances
    • Understanding The Backfill Process
    • Killing/Stopping DAG Runs
    • An XCom Messaging Example
    • Summary

Prerequisites

Participants should be familiar with Python syntax (or have a background in programming).

Duration

12 Hours | 2 Days or 4 Nights

Applies Towards the Following Certificates

Loading...
Enroll Now - Select a section to enroll in
Section Title
Data Pipelines and Orchestration with Apache Airflow
Type
Instructor-Led
Days
M, W
Time (Central Time)
5:30PM to 8:30PM
Dates
Oct 28, 2024 to Nov 06, 2024
Schedule and Location
# of Course Hours
12.0
Delivery Option
Course Fee(s)
Rate non-credit $1,320.00
Potential Discount(s)
Required fields are indicated by .
*Academic Unit eligibility to be determined by college/university in which you are enrolled in a degree seeking program.