Upcoming Talk: Intro to Talend Open Studio For Data Integration

At the next Orange County Java Users Group meeting I will be giving an intro talk on Talend Open Studio. I chose to speak on Talend because I found it to be a highly productive tool for creating data integration (ETL/ELT), schema migration, and batch processing jobs. The full abstract is below.

If you can attend, please RSVP on our new site hosted by Meetup.

Abstract: Talend Open Studio for Data Integration is an Eclipsed-based Java code generator that allows you to create ETL (extract, transform, load) jobs. It does this primarily through a visual programming editor containing over 600 components, including 450+ connectors for databases, files, and applications. However, it can be extended by injecting Java or Groovy code where needed, or by developing custom components in Java. In essence, it’s a Swiss Army knife for your data. There are specialized versions as well, targeting big data (Hadoop), BPM, data quality, and more.

In this introductory talk I will cover what TOS DI is capable of, best practices, and give a live demo showing you how to:

  • connect to databases and work with files
  • map input and output schemas
  • filter and join data
  • insert Java code for more advanced data manipulation

As time permits, I will delve further into the feature set.

Update: Here are the slides:

Download PowerPoint slides

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>