The course objective is for students to immerse themselves in the role of a data engineer and acquire the essential skills needed to work with various tools and databases to design, deploy, and manage structured and unstructured data.
By the end of this Professional Certificate, students will be able to explain and perform the critical tasks required in a data engineering role. Learners will use the Python programming language and Linux/UNIX shell scripts to extract, transform and load (ETL) data, work with Relational Databases (RDBMS), query data using SQL statements, and use NoSQL databases and unstructured data. Learners will be introduced to Big Data, work with Big Data engines like Hadoop and Spark, and gain experience creating Data Warehouses and utilizing Business Intelligence tools to analyze and extract insights.
Each module includes numerous hands-on labs & projects to apply the concepts and skills learned. The program culminates in a Capstone Project, bringing together all of these skills to develop and implement an entire data platform with various data repositories and pipelines to address a real-world inspired data analytics problem.
This program does not require any prior data engineering or programming experience.