At the next Orange County Java Users Group meeting I will be giving an intro talk on Talend Open Studio. I chose to speak on Talend because I found it to be a highly productive tool for creating data integration (ETL/ELT), schema migration, and batch processing jobs. The full abstract is below.

If you can attend, please RSVP on our new site hosted by Meetup.

Abstract: Talend Open Studio for Data Integration is an Eclipsed-based Java code generator that allows you to create ETL (extract, transform, load) jobs. It does this primarily through a visual programming editor containing over 600 components, including 450+ connectors for databases, files, and applications. However, it can be extended by injecting Java or Groovy code where needed, or by developing custom components in Java. In essence, it’s a Swiss Army knife for your data. There are specialized versions as well, targeting big data (Hadoop), BPM, data quality, and more.

In this introductory talk I will cover what TOS DI is capable of, best practices, and give a live demo showing you how to:

  • connect to databases and work with files
  • map input and output schemas
  • filter and join data
  • insert Java code for more advanced data manipulation

As time permits, I will delve further into the feature set.

Update: Here are the slides:

Download PowerPoint slides