At the next Orange County Java Users Group meeting I will be giving an intro talk on Talend Open Studio. I chose to speak on Talend because I found it to be a highly productive tool for creating data integration (ETL/ELT), schema migration, and batch processing jobs. The full abstract is below.
If you can attend, please RSVP on our new site hosted by Meetup.
Abstract: Talend Open Studio for Data Integration is an Eclipsed-based Java code generator that allows you to create ETL (extract, transform, load) jobs. It does this primarily through a visual programming editor containing over 600 components, including 450+ connectors for databases, files, and applications. However, it can be extended by injecting Java or Groovy code where needed, or by developing custom components in Java. In essence, it’s a Swiss Army knife for your data. There are specialized versions as well, targeting big data (Hadoop), BPM, data quality, and more.
In this introductory talk I will cover what TOS DI is capable of, best practices, and give a live demo showing you how to:
- connect to databases and work with files
- map input and output schemas
- filter and join data
- insert Java code for more advanced data manipulation
As time permits, I will delve further into the feature set.
Update: Here are the slides: