Skip to content

Latest commit

 

History

History
56 lines (50 loc) · 1.61 KB

README.md

File metadata and controls

56 lines (50 loc) · 1.61 KB

Big Data Analysis with Apache Hive

These are the exercise files used for Big Data Analysis with Apache Hive course.

The course outline can be found in

https://www.tertiarycourses.com.sg/big-data-analysis-apache-hive.html

https://www.tertiarycourses.com.my/big-data-analysis-with-apache-hive-malaysia.html

Module 1: Get Started on Apache Hive

  • What is Hive?
  • How Hive Works with Hadoop
  • Install CDH on VirtualBox
  • Hue 4 UI Overview

Module 2: Basic Hive Operations

  • Create and Drop Database
  • Create and Drop Table
  • Create Table from CSV File
  • Alter Table
  • Fix CSV File with Serde
  • Load Data to Empty Table
  • Partition Tables

Module 3: HiveQL

  • Retrieve Data with SELECT
  • SELECT Options
  • Operators and Built In Functions
  • Filter data with WHERE

Module 4: Aggregating Data

  • Hive Aggregations
  • Having
  • Grouping Sets
  • Cube & Rollup

Module 5: Joining Tables

  • Combining Tables with JOIN
  • EXIST & SEMI JOIN
  • Joining Multiple Tables

Module 6: Data Analysis with Apache Hive

  • Math Functions
  • String Functions
  • Date Functions
  • Conditional Statements