How would you delete duplicate observations?
Answers were Sorted based on User's Feedback
Answer / mohan reddy
NODUP OR NODUPREC OPTION IN PROC SORT STATEMENT.
EX;
PROC SORT DATA=EMP NODUP;
RUN;
NODUPKEY OPTION WILL ALSO DELETE THE DUPLICATE OBSERVATION
VALUES.BUT IT CAN USE THE BY VARIABLE.
EX
PROC SORT DATA=EMP NODUPKEY;
BY ENO;
RUN;
| Is This Answer Correct ? | 14 Yes | 1 No |
Answer / vijay
NODUP: in proc sort will delete duplication observations
NODUPKEY: deletes duplicate observation values of Key
variables
| Is This Answer Correct ? | 10 Yes | 0 No |
Answer / ananth
nodupkey option in proc sort statement.
Or use first.byvaribale or last.byvariable in data step.
| Is This Answer Correct ? | 12 Yes | 3 No |
Answer / prr
In Proc sort:
NoDupkey: TO delete duplicate observations based on By variable.
NoDuprecs: It looksup complete observation and delete
duplicate observations.
Nodup: it is a sas key word tells to sas, to delete
duplicate observations and keep only first one.
in Data step: First. and Last.
In Proc sql: Distinct Clause.
Process of SQL: 1.Select
2.group by
3.having
4.distinct
5.order by
| Is This Answer Correct ? | 6 Yes | 0 No |
Answer / ganesh
When you want elemenate duplicate values from dataset using
nodup option in the procedure sort.
When you want elemenate duplicate keys from specified
variables then use nodupkey option in the procedure sort.
| Is This Answer Correct ? | 5 Yes | 1 No |
Answer / reddy
nodup will eliminate the successive duplicate value only.
nodupkey eliminates all the duplicate values in a mentioned
variable.
| Is This Answer Correct ? | 3 Yes | 3 No |
Answer / thirumalesh.e.
We can delete using Proc NoDupkey NoDuprecs and
NoDuplicates, then by Dupsort system option, then
if.first . last, Proc sql, create by select * unique ...
OK.
| Is This Answer Correct ? | 0 Yes | 2 No |
what is the difference between proc means and proc tabulate?
Differentiate between sas functions and sas procedures.
What are the functions which are used for character handling functions?
how to read raw data in sas. Do it manually and throw the programming.
data voter; input Age Party : $1. (Ques1-Ques4)($1. + 1); datalines; 23 D 1 1 2 2 45 R 5 5 4 1 67 D 2 4 3 3 39 R 4 4 4 4 19 D 2 1 2 1 75 D 3 3 2 3 57 R 4 3 4 4 ; Idont understand what the (Ques1-Ques4)($1. + 1) means. I have seen (Ques1-Ques4)(4*$1.), but what is (Ques1-Ques4)($1. + 1)? Appreciate all help Thanks
how are numeric and character missing values represented internally? : Sas programming
explain the difference between proc means and proc summary?
Name statements that are execution only.
what is the difference between %put and symbolgen?
How would you create a data set with 1 observation and 30 variables from a data set with 30 observations and 1 variable?
if we dont want to print output what we have to do..???give syntax..???
8 Answers Accenture, GSK GlaxoSmithKline,
I need level 2 to 5 sas using companies in india