source
1123445
I WANT OUTPUT AS
DUPLICATES TO TARGET1 LIKE
TARGET1
1144
NON-DUPLICATES TO TARGET2
TARGET2
235
Answers were Sorted based on User's Feedback
Source
..|
..|
copy--->agg
..|.....|
..|.....|
join stage
....|
....|
Filter stage -----> target1
..|
..|
target2
the main data is:
1
1
2
3
4
4
5
from aggregator stage, the output is:
1,2
2,1
3,1
4,2
5,1
If you join these two links then the output will be:
1,2
1,2
2,1
3,1
4,2
4,2
5,1
Then specify the count<>1 in the Filter for target1 then you get the duplicate records. means YOU get:
1
1
4
4
in another link for target2, give count=1. means YOU get:
2
3
5
| Is This Answer Correct ? | 8 Yes | 0 No |
Answer / s
agg-->filter-->trg2
^ |
| v
seq-->copy-->join-->trg1
agg:countrows
join:innerjoin
filter:count=1
:count>1
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / chaint
source1(112345) ----- lookup stage- reference lookup
on(source2 output)--reject link(1144) --output(235)
source2(112345) -- sort( get count) -> filter(only unique)
we would require two source..
one original and other only (non repeated records)
we will have a lookup stage with source1 as input and
source2 as reference lookup..
in lookup stage we will have a reject link(1144) non matched
records.. and output will be(235).
Kindly correct me if i m wrong
| Is This Answer Correct ? | 0 Yes | 2 No |
Answer / nagam
seq.file----->sort------>filter----2datasets
in sort stge create key change column and then filter stage
write the condition on based on keychange column keychange
column =1 uniq data keychange<>0 duplicate data we can get
If wrong please tell me
| Is This Answer Correct ? | 0 Yes | 5 No |
i 10 jobs first two jobs are runing in 2nodes,next 2 jobs are running in 4 nodes, next 4 jobs are running in 6 nodes and the remaining jobs are running on 10 nodes. how to change the node configuration?
What is a merge?
What's the Main Function of the Staging area in DWH
What are constraints and derivations?
How will you load you daily/monthly jobs datas in to Fact and Dimension table using datastage.
Describe the architecture of datastage?
why dataset ?
How can we do null handling in sequential files?
at source level i have 40 columns,i want only 20 cols at target what r the various ways to get it
AGGREGATOR default datatype
what is the difference between == and eq in UNIX shell scripting?
What is PX?