Talk:Data cleansing

From Wikipedia, the free encyclopedia

This article is within the scope of the Business and Economics WikiProject.
Stub rated as stub-Class on the assessment scale
Low rated as low-importance on the assessment scale

This step targets to eliminate duplicate records, repair records with junk characters, remove blank lines if any. Completion of these 4 steps would lead to populated data load files which are ready to be pumped into SAP system.

Note on MDM: - If there are multiple legacy systems which house data about the same data object but with different identifiers, then SAP MDM can be utilized to resolve such inconsistencies. The three scenarios provided for by SAP MDM are: 1. Data consolidation 2. Master data harmonization 3. Central data management. The content checker of MDM can identify duplicates based on business rules. For e.g.: if two customers have the same VAT registration numbers then they are to be treated as a single customer even if the primary key identifier (customer number in this case) is different in the source systems.