what are methods to identify duplicate observations?
Answers were Sorted based on User's Feedback
Answer / hari prasad reddy
3 methods.
1.using proc sort
2.proc sql distinct
3.set first.var and last.var
Is This Answer Correct ? | 10 Yes | 1 No |
Answer / ramesh sas trainar in stg
1.Proc Sort
2.Proc SQL Distinct
3.first.byvariable
4.last.byvariable
5.proc freq
6.Proc SQL unique
7.merge Statement
Is This Answer Correct ? | 6 Yes | 0 No |
Answer / sudha ramalingam
You can also use nodupkey and noduprecs in your proc sort
Is This Answer Correct ? | 3 Yes | 0 No |
Answer / natrajboga
you can Identify the duplicate values by using
1) sort procedure along with the dupout= and nodupkey stmt
options and followed by BY stmt.
2) use the proc sql
for example:
proc sort data=demog dupout=dup_demog nodupkey;
by subjid;
run;
proc sql;
select subjid, count(*) from demog
group by subjid
having count(*)>1;
quit;
Is This Answer Correct ? | 6 Yes | 4 No |
What would be the value of month at the end of data step execution and how many observations would be there?
How will you generate test data with no input data?
hi guys ...i have one query... data abc; input s w k g o t a m; cards; 1 2 3 4 5 6 7 8 2 3 4 5 6 7 8 9 ; run; i want the output to be the sorted order(only variables).observations should not be changed..
Mention what is PROC in SAS?
is there any differnce between proc means and proc summary?
Difference between sum function and using “+” operator?
In SAS how to read the variable values having different formats. eg:mar99,mar1999 (in a single variable)
8 Answers GSK GlaxoSmithKline,
what is SAS/Access and SAS/Connect?what are the uses?
Tell e how how dealt with..
what is cummaltive frequency,and varience in proc unviarte
Which are the statements whose placement in the data step is critical?
If you use a symput in a data step, when and where can you use the macro variable? : sas-macro