What is data merging,data cleansing,sampling?
Answers were Sorted based on User's Feedback
Answer / rekha
DATA MERGING : IT IS THE PROCESS OF INTEGRATING THE SOURCES
WITH SIMILAR STRUCTURE AND SIMILAR TYPE
TABLE A
ENO ENAME
100 REKHA VARCHAR2(10)
TABLE B
ENO ENAME
101 SAHAN VARCHAR
SO U CAN MERGE TO COMMAON DATATYPE STRING IN THE ABOVE CASE
dATA CLEANSING :
IT IS THE PROCESS OF IDENTIFING THE INCONSISTANCIES AND
INACCURACIES
DATASAMPLING:
ARBITARILY CHOOSING THE RECORDS FROM GROUP OF RECORDS FOR
TEST
| Is This Answer Correct ? | 2 Yes | 0 No |
Data Merging: It is a process of combning Non-Similar
structures or Similar structure data into Target Warehouse
system.
To combine Non Similar we can use Joins Concept,For
Similar We can use Union Concept
Data Cleansing:It is a process of converting Non Unique
data format of the source system into unique data format of
Target Warehouse system.
I dont Know definion for Data Sampling..can anyone plz give
the answer...
| Is This Answer Correct ? | 2 Yes | 1 No |
Answer / rakesh
Data Merging:-
Is the process of merging non similar structure data (or)similar structure data into target warehouse system;
Data Cleansing:-
IS the process of converting source Non-Unique data format into unique data format into target warehouse system.
sampling:-
it is the process ,orbitarly reading the data from
group of records.
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / dr.jornalist
Datamerging: This is the process of
Datacleansing:- Removing the Data inconsistencies and
Inaccuracy
Data sampling:- Arbitorily taking the data from the group
of records for the sample purpose.
| Is This Answer Correct ? | 2 Yes | 4 No |
Normalizer transformation is not involved in Mapplet.. why???
What are the data movement modes in informatica?
My source table look like ID Name Location Salary 1 ran NULL NULL 1 NULL DELHI NULL 1 NULL NULL 1000 I want my output to look like ID Name Location Salary 1 ran Delhi 1000 What should be my approach to design a informatica mapping. Assuming my source is flat file not a Relation table so i cant use SQL.
There are 2 files, Master and User. We need to compare 2 files and prepare a output log file which lists out missing Rolename for each UserName between Master and User file. Please find the sample data- MASTER.csv ---------- Org|Tmp_UsrID|ShortMark|Rolename ---|---------|----------|------------ AUS|0_ABC_PW |ABC PW |ABC Admin PW AUS|0_ABC_PW |ABC PW |MT Deny all GBR|0_EDT_SEC|CR Edit |Editor GBR|0_EDT_SEC|CR Edit |SEC MT103 GBR|0_EDT_SEC|CR Edit |AB User USER.csv -------- Org|UserName|ShortMark|Rolename ---|--------|---------|------------ AUS|charls |ABC PW |ABC Admin PW AUS|amudha |ABC PW |MT Deny all GBR|sandya |CR Edit |Editor GBR|sandya |CR Edit |SEC MT103 GBR|sandya |CR Edit |AB User GBR|sarkar |CR Edit |Editor GBR|sarkar |CR Edit |SEC MT103 Required Output file: --------------------- Org|Tmp_UsrID|UserName|Rolename |Code ---|---------|--------|------------|-------- AUS|0_ABC_PW |charls |ABC Admin PW|MATCH AUS|0_ABC_PW |charls |MT Deny all |MISSING AUS|0_ABC_PW |amudha |ABC Admin PW|MISSING AUS|0_ABC_PW |amudha |MT Deny all |MATCH GBR|0_EDT_SEC|sandya |Editor |MATCH GBR|0_EDT_SEC|sandya |SEC MT103 |MATCH GBR|0_EDT_SEC|sandya |AB User |MATCH GBR|0_EDT_SEC|sarkar |Editor |MATCH GBR|0_EDT_SEC|sarkar |SEC MT103 |MATCH GBR|0_EDT_SEC|sarkar |AB User |MISSING Both the files are mapped through Organization, Shor_mark. So, based on each Organization, Short_Mark, for each UserName from User.csv, we need to find the Matching and Missing Rolename. I am able to bring Matching records in the output. But really I don't find any concept or logic to achieve "MISSING" records which are present in Master and not in User.csv for each UserName. Please help out guys. Let me know if you need any more information. Note:- In User.csv file, there are n number of Organization, under which n number Shortmark comes which has n number of UserName.
On a day, I load 10 rows in my target and on next day if I get 10 more rows to be added to my target out of which 5 are updated rows how can I send them to target? How can I insert and update the record?
Two Default User groups created in the repository are ____ and ______
What is the commit type if you have a transaction control transformation in the mapping?
Tell me can we override a native sql query within informatica? Where do we do it? How do we do it?
Diffrence between static and dynamic cache
if we have certain records,if i need to get 5 th rec for the first time to the target,when i run it for next time,i need to get 6th rec...like that it should process.what wil be the procedure to achieve it??
How to identify this row for insert and this row for update in dynamic lookup cache
how will u load the data to diminision tables and fact tables,what is the hiraraichy why we are using hirarichy