Talend Big Data Basics

17 September 2019 | Kuala Lumpur
Book Your Seat Today!

Kindly advise me your company detail and our consultant will contact you soonest!

Course Objectives

After completing this course, you will be able to:

  • Create a Big Data batch Job using the Spark framework
  • Copy data from a local file to HDFS
  • Copy data from MySQL to HDFS
  • Create a Hive table and copy data from HDFS to it
  • Import tweets to HDFS
  • Join, sort, and aggregate data
  • Use caches for faster processing
  • Query data from a Hive table using Hive QL
  • Query data from Spark datasets using Spark SQL

Target Audience

Anyone who wants to use Talend Studio to interact with Big Data systems

Training Outline

Introduction to Spark
  • Monitoring the Hadoop cluster
  • Setting up the development environment
  • Understanding the basics of Spark
  • Analyzing customer data
Sentiment Analysis Use Case
  • Monitoring the Hadoop cluster
  • Setting up the development environment
  • Loading tweets into HDFS
  • Processing tweets with Spark
  • Scheduling job execution
Download Analysis Use Case
  • Setting up the development environment
  • Loading customers to Hive
  • Download analysis
  • Using Spark SQL to query data

Prerequisite

Completion of Talend Big Data Basics