Friday, January 21, 2011

Tips and Tricks to Debug InfoSphere DataStage Job

Here is a Small List of tricks for debugging DataStage parallel jobs. I will keep updating this post on need basis. So it is a continuous evolution of Steps

It is to disable or enable combining multiple stages into one process. If facing any issues, disabling this makes life easier to identify the error and source of error.

Must to run any DataStage Parallel Job. You can have multiple configuration files and can use them based on your requirements. Can use various node combination and use it.

Provide information about Process ID for each stage

Provide number of records in the DataStage Director Log

 CPU time used by each stage

Generate/Display OSH code for DataStage Parallel job including unexpected settings if any set by the GUI.

Details out all processes and inserted operators in your job
Row Generator can be used to generate sample data required by DataStage Parallel Job for consumption.

Phantom files provide details about addition error messages and resides $DSHOME/../DSProjects/<Project_Name>/&PH&

In case we want to see intermediate data can use Copy Stage to dump out data in Peek Stage or Sequential File

Disclaimer: "The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.

No comments:

Post a Comment