Course Description

Data engineering is the aspect of data science that focuses on practical applications of data collection and analysis. A data engineer conceives, builds and maintains the data infrastructure that holds your enterprise’s advanced analytics capacities together. This five day Data Engineering Bootcamp training course is supplemented by hands-on labs that help attendees reinforce their theoretical knowledge of the learned material.

Course Outline

  • Chapter 1. Big Data for Data Engineers
  • Chapter 2. Defining Data Engineering
  • Chapter 3. Data Processing Phases
  • Chapter 4. Apache Hive
  • Chapter 5. Hive Command-line Interface
  • Chapter 6. Hive Data Definition Language
  • Chapter 7. HiveQL
  • Chapter 8. Hive Select Statement and Built-In Functions
  • Chapter 9. Introduction to Functional Programming
  • Chapter 10. Introduction to Apache Spark
  • Chapter 11. How Spark Works Visually
  • Chapter 12. The Spark Shell
  • Chapter 13. Spark RDDs
  • Chapter 14. Parallel Data Processing with Spark
  • Chapter 15. Shared Variables in Spark
  • Chapter 16. Introduction to Spark SQL


5 Days | 10 Nights

Applies Towards the Following Certificates


Thank you for your interest in this course. Unfortunately, the course you have selected is currently not open for enrollment. Please complete a Course Inquiry or call 314-977-3226 so that we may promptly notify you when enrollment opens.

Required fields are indicated by .
*Academic Unit eligibility to be determined by college/university in which you are enrolled in a degree seeking program.