what is the difference between SET and MERGE?
Answers were Sorted based on User's Feedback
Answer / sweety
Set : only support data without relation, with relation i.e
by statement it will produce error if we right like merge
statement.
Ex :
Set statement
data demo;
set dm1;
set dm2;
run;
It will produce the result same as merge , with or without
relation.
EX : data demo;
set dm1 dm2;
run;
It will produce error.
Merge : it will combine the datasets with or without
relation.
The only difference is set is not more efficient to combine
the datasets as we need to write set statement everytime.
Ex :
Merge (without relation)
data demo;
merge dm1 dm2;
run;
Merge (with relation)
data demo;
merge dm1 dm2;
by <variable name>
run;
| Is This Answer Correct ? | 2 Yes | 5 No |
Answer / sandhya
SET statement is used for one-to-one, concatination and
interleaving whereas MERGE is used for match merging.
| Is This Answer Correct ? | 4 Yes | 8 No |
Answer / pari
Set and merge statement perform similar function, in case
of set statement, the two data sets are merged under
unconditional criteria, but while using merge, it works
under conditional criteria by applying the PROC SORT
procedure ., i.e., by statement used.
| Is This Answer Correct ? | 2 Yes | 7 No |
Answer / kiran kumar yarlagadda
what is the difference between SET and MERGE?
ans:
set statement is used to create a new dataset by using
already existing datasetname/and this statement is used to
concotinate the number of datasets.whenever we are using
set statment it will be added as obsevation by obsevation
level where as when we use merge it will be added as
variable by variable level.
| Is This Answer Correct ? | 3 Yes | 9 No |
Answer / srinivas,korasavada
set concatinate the data set and of the another data
set.The second dataset starts end of the last value onwards.
merge:It adds sas files one dataset to end of the another
dataset.
| Is This Answer Correct ? | 0 Yes | 6 No |
Answer / chiranjeevi
MERGE:Merge statement is used to combine two datasets,The
two datasets can be combined by matching variable should by
specify by using statement calld 'by' in the data step.
SET:SET statement concatenates the two data sets by matching
variable.
Difference:when ever we are using the merge statement it
will over writes the existing and the similar observations
across two datasets,But where as the set statement doesn't
overwrites it.
| Is This Answer Correct ? | 3 Yes | 10 No |
Answer / kishore
set statement is used to create a new data set using
existing data set sothat we can add new variables ,create
subset
merge statement when we want to match the observation of
one dataset with observation of other dataset
| Is This Answer Correct ? | 14 Yes | 22 No |
Answer / ashok kore
Set : Its append the datasets is dataset by dataset at row
level. It's not need to sort the datasets before append
datasets.
Merge : Its merge the datasets according to conditions at
column level. It needs sort datasets by variable before
merge the datasets. It works like joins in Proc sql.
| Is This Answer Correct ? | 6 Yes | 14 No |
Answer / chandra sekar
Set statement is usually used to "append" to two datasets.
And merge step is used to "Merge" two datasets.
| Is This Answer Correct ? | 12 Yes | 30 No |
How would you code a macro statement to produce information on the sas log? This statement can be coded anywhere? : sas-macro
How many missing values are available? When might you use them?
firstobs and obs are working only option wise,but we are using infile statement with firstobs and obs in a statement wise? so firstobs,obs working at options and statemnts or not?
For clinical entire study how many tables will create approx?
how to create the AE dataset by using SDTMIG specifications and SAP plan by using UNIX platform?
explain the difference between proc means and proc summary?
What are the 3 components in sas programming?
Describe crosslist option in tables statement?
What is a method for assigning first.VAR and last.VAR to the BY group variable on unsorted data?
/*i have the following dataset.*/ data score; input marks ; datalines; 10 20 30 40 50 60 70 80 90 100 ; run; Now i should get the result as sum of 1 to 5 i.e(10+20+30+40+50)=150 and 2 to 6 i.e(20+30+40+50+60)=200 and 3 to 7 i.e(30+40+50+60+70)=250 and so on. how to get it. thanks in advance
Are the preferred term counts are always equal to Body system counts? If so, Why are they equal if not why they are not equal?
Do you prefer Proc Report or Proc Tabulate? Why?