Tuesday, 6 August 2013

Data Cleansing


Also referred to as data scrubbing, the act of detecting and removing and/or correcting a database’s dirty data(i.e., data that is incorrect, out-of-date, redundant, incomplete, or formatted incorrectly). The goal of data cleansing is not just to clean up the data in a database but also to bring consistency to different sets of data that have been merged from separate databases. Sophisticated software applications are available to clean a database’s data using algorithms, rules and look-up tables, a task that was once done manually and therefore still subject to human error.

The process of removing inaccurate and historical data from operational systems to use in a data warehouse.

The process to correct data errors in a collection of data in order to bring the level of quality to an acceptable level to meet the information customers' needs.

means a removing errors and inconsistencies from data being imported into a data warehouse.

A processing step during which missing or inaccurate data is replaced with valid values.



Deduping:


'DeDuping' is the process of removing duplicates in Customer and Address records in a Database or Spreadsheet.

No comments:

Post a Comment