Pentaho Data Integration (Kettle)

Pentaho Data Integration, formerly known as Kettle, is a powerful, metadata-driven ETL tool designed to bridge the gap between business and IT.

See the Java and Troubleshooting GUI tools pages for more information.

Resources

Pentaho have documentation for using PDI on their wiki.

Here are some other resources:

Step Error Handling

Step error handling allows you to configure a step so that instead of halting a transformation when an error occurs, the rows that caused an error are passed to a different step. Enable error handling in the Step error handling settings page. To configure the error handling, right click the step and select Define Error handing.

Note: This menu item only appears when clicking on steps that support the error handling.

At minimum you need to set a target step for the data stream in error and check "Enable the error handling". When you do a preview on this target step, you can see extra fields being added to the "error rows". This way, you can easily define new data flows in transformations. The typical use-case for this is an alternative way of doing an Upsert (Insert/Update).

More at Transformation Steps


Creator: sd on 2008/11/04 15:49
XWiki Enterprise 1.7.2.16857 - Documentation