Python for Big Data

Course Content:

  • Introduction
  • Course Overview
  • Frequently Asked Questions
  • What is Spark? Why Python?
  • Set-up Overview
  • Note on Installation Sections
  • Local Installation VirtualBox Part 1
  • Local Installation VirtualBox Part 2
  • Setting up PySpark
  • AWS EC2 Set-up Guide
  • Creating the EC2 Instance
  • SSH with Mac or Linux
  • Installations on EC2
  • Databricks Setup
  • AWS EMR Setup
  • Introduction to Python Crash Course
  • Jupyter Notebook Overview
  • Python Crash Course Part One
  • Python Crash Course Part Two
  • Python Crash Course Part Three
  • Python Crash Course Exercises
  • Python Crash Course Exercise Solutions
  • Introduction to Spark DataFrames
  • Spark DataFrame Basics
  • Spark DataFrame Basics Part Two
  • Spark DataFrame Basic Operations
  • Groupby and Aggregate Operations
  • Missing Data
  • Dates and Timestamps
  • DataFrame Project Exercise
  • DataFrame Project Exercise Solutions
  • Introduction to Machine Learning and ISLR
  • Machine Learning with Spark and Python with MLlib
  • Linear Regression Theory and Reading
  • Linear Regression Documentation Example
  • Regression Evaluation
  • Linear Regression Example Code Along
  • Linear Regression Consulting Project
  • Linear Regression Consulting Project Solutions
  • Logistic Regression Theory and Reading
  • Logistic Regression Example Code Along
  • Logistic Regression Code Along
  • Logistic Regression Consulting Project
  • Logistic Regression Consulting Project Solutions
  • Tree Methods Theory and Reading
  • Tree Methods Documentation Examples
  • Decision Tress and Random Forest Code Along Examples
  • Random Forest – Classification Consulting Project
  • Random Forest Classification Consulting Project Solutions
  • K-means Clustering Theory and Reading
  • KMeans Clustering Documentation Example
  • Clustering Example Code Along
  • Clustering Consulting Project
  • Clustering Consulting Project Solutions
  • Introduction to Recommender Systems
  • Recommender System – Code Along Project
  • Introduction to Natural Language Processing
  • NLP Tools Part One
  • NLP Tools Part Two
  • Natural Language Processing Code Along Project
  • Introduction to Streaming with Spark!
  • Spark Streaming Documentation Example
  • Spark Streaming Twitter Project – Part
  • Spark Streaming Twitter Project – Part Two
  • Spark Streaming Twitter Project – Part Three
  • Bonus Lecture: Coupons
Enquire Now