how to cleansing data
Answers were Sorted based on User's Feedback
Answer / navin
Data cleansing means converting non unique data format into unique format .This is performed in Transformer stage.
Is This Answer Correct ? | 6 Yes | 1 No |
Answer / satyanarayana
In this removes the unwanted data(Bad records OR NULL
Values) and find the inconsistent data and make it
consistent data.
Example:
LOc
---
Hyd
Hyderabad
hyde
After Cleansing
Loc
---
Hyderabad
Hyderabad
Hyderabad
Is This Answer Correct ? | 4 Yes | 1 No |
Answer / usha
Data cleansing means removing unwanted spaces.
By using LTRim,Rtrim functions we can remove unwanted space
Is This Answer Correct ? | 3 Yes | 1 No |
Answer / krish
it is process of correcting the inconsitency data and make consitent format
Is This Answer Correct ? | 1 Yes | 0 No |
Answer / venkatesh k
Data cleansing means performing all the de-dupe rules according to the requirements and make your data unique.For cleansing operations mainly we will use transformer,sort stage,aggregator and look up.
Is This Answer Correct ? | 0 Yes | 0 No |
Answer / b.rambabu
data cleansing is a process of identifing the the data
inconsistency and inaccuracies
ex:
data inaccuracy:
hyd
Hydrabad
after
hydrabad
hydrabad
data inconsistency
10.78
10.23465
after
10.27
10.23
Is This Answer Correct ? | 1 Yes | 2 No |
What are some prerequisites for datastage?
What is datastage engine?
In which situations we can use normal and sparse lookup stages
What is Horizontal transformation, vertical transformation,diagonal transformation?
Name the different types of Lookups in Datastage?
What is data partitioning?
I have load a Dataset in UAT with 2 Node configuration, imported the job into PROD environment which is 4 node configuration and using this DataSet as SRC to other job. will the job run fine or give any errors? If job runs fine, on how many nodes? 2 nodes or 4 nodes?
How do u call shellscript/Batch file from DS?
what is mapping lookup
Hi, what is use of Macros,functions and Routines..? At what situation you are used. If you know the answer please explain it. Thanks.
What are the environmental settings for data stage,while working on parellel jobs?
What are the primary usages of datastage tool?