Loading...

Course Description

In this course, students will be introduced to the various types of data commonly utilized in Artificial Intelligence and Machine Learning problems.  In addition, the students will apply standard processes commonly found in data extraction, data transformation and data loading methods commonly utilized in AI applications. Students will construct a data ingestion pipeline using a data warehouse and prep data for use in a predefined Machine Learning problem.

Course Outline

This course will cover the following topics:

  • Data Extraction
    • Understand full scope of ETL (Extraction, Transform, Load) for data ingestion pipeline
    • Data migration
    • Data Combination
    • Structured Data
      • Full Extraction
      • Incremental Extraction
    • Unstructured Data focusing on data preparation
    • How to address common issues in the extraction process
    • Highlight data extraction tools:
      • Batch processing tools
      • Open Source Tools
      • Cloud-Based Tools
  • Data Transformation
    • Identify Variable Types
    • Feature Transformation
    • Power Transform
    • Difference Transform
    • Standardization
    • Normalization
    • Binning
  • Data Loading
    • Challenges of Data loading
    • Data Loading:
      • Batch processing tools
      • Open Source Tools
      • Cloud-Based Tools
  • Google BigQuery, Snowflake, and Amazon Redshift or S3???

Learner Outcomes

This is a comprehensive foundation course for the fields of data science, artificial intelligence, and machine learning. The ETL knowledge and skills learning in this course are transferrable to all other TIF courses and certifications as well as prepare the student with industry-relevant skills and tools to understand and manipulate various types of data.

Successful completion of this course entitles the student to The Intelligence Factory Skillset Certification in the following areas:

  • Data Pipelines
  • SQL

This course also provides you the skills and concepts needed to complete the TIF Certification One for a Machine Learning Engineer.

Prerequisites

Basic knowledge of Python programming skills.

Duration

30 Hours | 5 days or 10 nights

Applies Towards the Following Certificates

Loading...

Thank you for your interest in this course. Unfortunately, the course you have selected is currently not open for enrollment. Please complete a Course Inquiry or call 314-977-3226 so that we may promptly notify you when enrollment opens.