Data Cleansing & Analyzing source data - QualityStage-III

Once we are aware of organization goals about Data Quality we need to collect insight into Source Data as Source is what gets reflected into multiple results. Investigate Stage can be used to analyze the quality of the source data as it helps you determine the business rules that can be used in designing any data cleansing project.
The Investigate stage indicates the degree of processing needed to create the target cleansed data. Investigating data identifies errors and validates the contents of fields in a data file and lets team identify and correct data problems before they infect new systems.The Investigate stage analyzes data by determining the number and frequency of unique values, and classifying or assigning a business meaning to each occurrence of a value within a column. The Investigate stage has the following capabilities:
The Investigation reports, that you can generate from the IBM® InfoSphere Information Server Web console by using data processed in the Investigate job, can help you evaluate your data and develop better business practices. Please refer below link for more details.

 -Ritesh
Disclaimer: The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions