Thursday, June 30, 2011

Optimal Processing of Data ....Modular Development with InfoSphere DataStage - Series 2

When designing Jobs in InfoSphere DataStage aim should be to use modular development techniques. Modular Development maximizes reuse of parallel jobs and components and save development time.
  • Parametrize the Job while designing and supply values at run time. This allows a single job design to process different data in different circumstances or say share the Job Design and can access its multiple invocations processing different set of data. It saves designer from producing multiple copies of the same job with slightly different arguments.
  • Use shared containers to share common logic across a number of jobs. Shared containers are inserted when a job is compiled. If the shared container is changed, the jobs using it will need recompiling.
Disclaimer: The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions

No comments:

Post a Comment