What is data merging,data cleansing,sampling?
Answers were Sorted based on User's Feedback
Answer / rekha
DATA MERGING : IT IS THE PROCESS OF INTEGRATING THE SOURCES
WITH SIMILAR STRUCTURE AND SIMILAR TYPE
TABLE A
ENO ENAME
100 REKHA VARCHAR2(10)
TABLE B
ENO ENAME
101 SAHAN VARCHAR
SO U CAN MERGE TO COMMAON DATATYPE STRING IN THE ABOVE CASE
dATA CLEANSING :
IT IS THE PROCESS OF IDENTIFING THE INCONSISTANCIES AND
INACCURACIES
DATASAMPLING:
ARBITARILY CHOOSING THE RECORDS FROM GROUP OF RECORDS FOR
TEST
Is This Answer Correct ? | 2 Yes | 0 No |
Data Merging: It is a process of combning Non-Similar
structures or Similar structure data into Target Warehouse
system.
To combine Non Similar we can use Joins Concept,For
Similar We can use Union Concept
Data Cleansing:It is a process of converting Non Unique
data format of the source system into unique data format of
Target Warehouse system.
I dont Know definion for Data Sampling..can anyone plz give
the answer...
Is This Answer Correct ? | 2 Yes | 1 No |
Answer / rakesh
Data Merging:-
Is the process of merging non similar structure data (or)similar structure data into target warehouse system;
Data Cleansing:-
IS the process of converting source Non-Unique data format into unique data format into target warehouse system.
sampling:-
it is the process ,orbitarly reading the data from
group of records.
Is This Answer Correct ? | 0 Yes | 0 No |
Answer / dr.jornalist
Datamerging: This is the process of
Datacleansing:- Removing the Data inconsistencies and
Inaccuracy
Data sampling:- Arbitorily taking the data from the group
of records for the sample purpose.
Is This Answer Correct ? | 2 Yes | 4 No |
What is the difference between OLTP and ODS?
How to recover sessions in concurrent batches?
What is an incremental loading? in which situations we will use incremental loading
Some flat files are there, out of these having some duplicate. How do you eliminate duplicate files while loading into targets?
How can we get two output ports in un-connect transformation?
my source is like this id,name sal 10 abc 1000,10 pqr 2000, 10 xyz 3000 ,10 jkl 4000 and my requirement is like this 10 abc,pqr,xyz,jkl 2000 .... i have try for this by using expression transformatin its ok of the concatenation of second column but the thing is that on third column if u group by using agg t/r the last value will com i.e 4000 but i asked by a interviewer that i dont want the first or last column i want the middle column i.e 2000 .plz reply for the same
Do you have knowledge in ralph kimball methodology
Why we require dwh in particular projects?
On a day i load 10 rows in my target and on nextday i get 10 more rows to add in target. But out of 10 - 5 records are send them to target?how i can insert the remaining records
Without source how to insert record to target?
What is the benefit of session partitioning?
What is target update override