Course

Course Summary
Credit Type:
Course
ACE ID:
STAT-0012
Organization:
Location:
Classroom-based
Length:
4 weeks (60 hours)
Dates Offered:
Credit Recommendation & Competencies
Level Credits (SH) Subject
Upper-Division Baccalaureate 3 Computer Science or Programming
Description

Objective:

The course objective is to introduce statisticians to Hadoop, and provide an exemplar workflow for using Hadoop, writing MapReduce jobs, and finally leveraging Hadoop Streaming to conclude work in an analytics programming language such as R.

Learning Outcomes:

  • Use Hadoop and the software components of the Hadoop ecosystem
  • Manage data on a distributed file system
  • Write MapReduce jobs to perform computations with Hadoop
  • Utilize Hadoop streaming to output jobs

General Topics:

  • Distributed computing environments
  • How to work with Hadoop
  • Computing with MapReduce
  • Hadoop workflows with R
Instruction & Assessment

Instructional Strategies:

  • Audio Visual Materials
  • Case Studies
  • Computer Based Training
  • Discussion
  • Lectures
  • Practical Exercises

Methods of Assessment:

  • Other
  • weekly homework assignments

Minimum Passing Score:

70%
Supplemental Materials