For any task related to Data Mining and cleansing it is must to have knowledge of the overall workflow helps you streamline your data cleansing implementation. From InfoSphere Quality Stage perspective creating cleansed data is a four-phase and iterative approach as shown in the following diagram:
- Understand organizational goals and how they determine your requirements
- Understand and analyze the nature and content of the source data
- Design and develop the jobs that cleanse the data
- Evaluate the results
Will cover each of these Phases in upcoming blogs.