Define data cleansing?



Define data cleansing?..

Answer / Achin Kumar

Data cleansing, also known as data cleaning or data scrubbing, is a process of detecting and correcting or removing errors, inaccuracies, and inconsistencies in datasets. The goal of data cleansing is to improve the quality of data by reducing noise and minimizing the effect of outliers.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Hadoop General Interview Questions

Do we need to place 2nd and 3rd data in rack 2 only?

1 Answers  


What is pseudo-distributed mode?

1 Answers  


What are the important modes of hadoop?

1 Answers  


What does hadoop-env.sh do?

1 Answers  


Can we deploye job tracker other than name node?

1 Answers  


Can we use windows for hadoop?

1 Answers  


How many daemon processes run on a hadoop cluster?

1 Answers  


What is the command to change the replication factor ?

1 Answers  


What is Identity reducer?

1 Answers  


When should be hadoop archive create?

1 Answers  


What is the default replication factor and how will you change it?

1 Answers  


Can you explain record reader?

1 Answers  


Categories