View
435
Download
7
Tags:
Embed Size (px)
DESCRIPTION
Talend Open Studio is the most open, innovative and powerful data integration solution on the market today. Talend Open Studio for Data Integration allows you to create ETL (extract, transform, load) jobs.
Citation preview
Talend Open Studio
Sat Apr 19, 2014Brij Bhushan SharmaSr. Software Engineer
2
Talend Open Studio
What is Talend Open Studio?
2
What is Talend Open Studio
● Talend Open Studio is the most open, innovative and powerful data integration solution on the market today.
● Talend Open Studio for Data Integration allows you to create ETL (extract, transform, load) jobs.
● A graphical integrated development environment with an intuitive Eclipse-based interface.
● Draw procedures linking components, each component performs an operation.
● Produces fully editable Java (or Perl) code
2
Talend Open Studio
Main features and benefits of that solution:
● Business modeling● Graphical development● Drag-and-drop job design● Metadata-driven design and execution● Real-time debugging● Robust execution● A unified repository for storing and reusing metadata
2
Talend Open Studio
ETL is a common process in Data Integration
Extract: reading data from different datasources (database, flat files, spreadsheet files, web services, etc).
Transfom: converting data in a form so that it can be placed in another container (database, web services, files etc). Cleaning, computations and verifications are also performed.
Load: write the data in the target format.
2
Talend Open Studio
Important concepts in Talend Data
Integration Studio
2
Talend Open Studio
What is a repository?
A repository is the storage location Talend Data Integration Studio uses to
gather data related to all of the technical items that you use either to describe business models or to design Jobs.
2
Talend Open Studio
What is a project?
Projects are structured collections of technical items and their associated
metadata.
All of the Jobs and business models you design are organized in Projects.
2
Talend Open Studio
What is a workspace? A workspace is the directory where you
store all your project folders. You need to have one workspace
directory per connection (repository connection). Talend enables you to
connect to different workspace directories, if you do not want to use the
default one.
2
Talend Open Studio
What is a component?A component is a preconfigured connector used to perform a specific data integration
operation, no matter what data sources you are integrating: databases, applications, flat
files, Web services, etc.
A component can minimize the amount of hand-coding required to work on data from
multiple, heterogeneous sources.
2
Talend Open Studio
What is an item? An item is the fundamental technical unit in a project. Items are grouped, according to their
types,as: Job Design, Business model, Context, Code, Metadata, etc. One item can
include other items. For example, the business models and the Jobs you design are items, metadata and routines you use
inside your Jobs are items as well.
2
Talend Open Studio-User Interface
2
Talend Open Studio
What is a repository?
A repository is the storage location Talend Data Integration Studio uses to
gather datarelated to all of the technical items that
you use either to describe business models or to design Jobs.
2
Talend Open Studio
Small Demo on Talend Job
2
Talend Open Studio
Source Data
2
Talend Open Studio
Lookup Table
2
Talend Open Studio
Target Table
2
Talend Open Studio
Have a look on the main job
2
T-Map: Mapping Component
2
Java Function Used