Talend training, implementing data integration version 2022

Presentation

Talend Open Studio (TOS) is an open source ETL application that enables massive synchronization of information from one database to another. This course will teach you how to design, model and develop TOS jobs in order to deal with various issues.

Pedagogical objectives

Upon completion of the training, the participant will be able to:
  • Design and develop jobs in the Talend ETL application
  • Optimize the jobs developed by using contexts and data sets
  • Perform more complex transformations using variables, expressions and joins
  • Run and debug a job, trace execution statistics

Target public

Developers, project managers, business intelligence consultants, database administrators.

Requirements

Price

  • 2000€ HT per person.
  • Good knowledge of RDBMS and SQL. Knowledge of Java is a plus.

Training program

  • Data integration. ETL solutions.
  • TOS: installation, user preferences. Reference documentation.
  • Product philosophy. Job design.
Practical works
Getting to grips with the tool. Presentation of the specifications of the red thread project.

Model the need, design a first job

  • Business Modeler. Job Designer.
  • Main connections. CSV and XML components.
  • Simple transformation components.
  • View generated code, run a job.
Exercise
Development of a job ensuring the sorting of a CSV source, the filtering of data and the storage of the result in an XML file.
  • Configure reusable connections using metadata.
  • Update metadata and propagate it to jobs, import/export metadata.
  • Set up jobs with contexts.
  • Externalize context variables in ".properties" and ".ini" files.
  • Create and manage your own variables.
  • Generate data sets for testing.
Exercise
Refactor a job using metadata and contexts. Generate a test data set for this job.
 

Working with databases

  • Supported databases and main components.
  • Settings of the operations on the tables.
  • Metadata and connection context to a database schema.
  • Connection sharing and transaction management.
  • Create queries using SQLBuilder.
Exercise
Read and update a data repository hosted on a MySQL server.
  • Presentation of the tMap component.
  • Configuration of input streams, creation of joins.
  • Perform transformations using variables, expressions and joins.
  • Qualify the data using filters.
  • Generate multiple outputs, manage rejects.
Exercise
Consolidation of multi-source data and generation of a warehouse.
 

Supplements

  • Break down a job into sub-jobs, use of tRunJob. Launching jobs from the command line. Periodic execution.
  • Debugging a job, tracing execution statistics.
  • Reporting tJasperOutput.
Exercise
Generate a Jasper Report from a warehouse.

Pedagogical methods

Practical training: 70% Practical, 30% Theory.
Training material distributed in digital format to all participants.
Access to servers and databases as well as PCs are provided for practice.

Evaluation method

The evaluation of the objectives is done throughout the session through multiple exercises (70% of time).

Instructor

Our training is provided by Mohand LARABI, PhD in computer science and Talend expert.

Organization

Classes start at 9am until 12:30pm and then from 2pm until 5:30pm. That is 7 hours per day.

Location and dates of the sessions

26 avenue Perrichont 75016 Paris

02 au 04 Mai 2022(inclus)​

26 avenue Perrichont 75016 Paris

01 au 03 Juin 2022 (inclus)​

CUSTOMER NOTICES

Satisfaction

100
%

Attendance

95
%

Contact us

125 rue Michel Ange 75016 Paris

Home

+33(0)142307782