Tips and Tricks to Debug InfoSphere DataStage Job

Here is a Small List of tricks for debugging DataStage parallel jobs. I will keep updating this post on need basis. So it is a continuous evolution of Steps

APT_DISABLE_COMBINATION
It is to disable or enable combining multiple stages into one process. If facing any issues, disabling this makes life easier to identify the error and source of error.

APT_CONFIG_FILE
Must to run any DataStage Parallel Job. You can have multiple configuration files and can use them based on your requirements. Can use various node combination and use it.

APT_PM_SHOW_PIDS
Provide information about Process ID for each stage

APT_RECORD_COUNTS
Provide number of records in the DataStage Director Log

APT_PM_PLAYER_TIMING
 CPU time used by each stage

OSH_DUMP
Generate/Display OSH code for DataStage Parallel job including unexpected settings if any set by the GUI.

APT_DUMP_SCORE
Details out all processes and inserted operators in your job
   
Row Generator can be used to generate sample data required by DataStage Parallel Job for consumption.

Phantom files provide details about addition error messages and resides $DSHOME/../DSProjects/<Project_Name>/&PH&

In case we want to see intermediate data can use Copy Stage to dump out data in Peek Stage or Sequential File
...Ongoing
-Ritesh

Disclaimer: "The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.