Talend Big Data Advanced – Machine Learning

Available Upon Request
Book Your Seat Today!

Kindly advise me your company detail and our consultant will contact you soonest!

Course Objectives

After completing this course, you will be able to:

  • Connect to a Hadoop cluster from a Talend Job
  • Use context variables and metadata
  • Read and write files in HDFS in a Big Data Batch Job
  • Configure a Big Data Batch Job to use the Spark framework
  • Create and test recommendation models
  • Create and test classification models
  • Use a machine learning algorithm to deduplicate data

Target Audience

Anyone who wants to use Talend Studio to industrialize machine learning algorithms

Training Outline

SMS Classification Use Case
  • Monitoring the Hadoop cluster
  • Exploring an SMS classification use case: decision trees
  • Connecting to your cluster
  • Creating an SMS classification model
  • Testing the SMS classification model
Movie Recommendation Use Case
  • Movie recommendation use case: alternating least squares
  • Building a movie recommendation model
  • Testing the movie recommendation model
Irises Classification Use Case
  • Exploring an Iris flower classification use case: Naïve Bayes classifier
  • Building an iris classification model
  • Testing the iris classification model
Child-Care Deduplication Use Case
  • Exploring a child-care use case and dataset: matching
  • Setting up the environment
  • Pairing data
  • Building a matching model
  • Using the matching model
  • Merging groups of duplicates


Completion of Talend Data Quality Essentials or Talend Big Data Basics