How would you delete duplicate observations?
Answers were Sorted based on User's Feedback
Answer / mohan reddy
NODUP OR NODUPREC OPTION IN PROC SORT STATEMENT.
EX;
PROC SORT DATA=EMP NODUP;
RUN;
NODUPKEY OPTION WILL ALSO DELETE THE DUPLICATE OBSERVATION
VALUES.BUT IT CAN USE THE BY VARIABLE.
EX
PROC SORT DATA=EMP NODUPKEY;
BY ENO;
RUN;
| Is This Answer Correct ? | 14 Yes | 1 No |
Answer / vijay
NODUP: in proc sort will delete duplication observations
NODUPKEY: deletes duplicate observation values of Key
variables
| Is This Answer Correct ? | 10 Yes | 0 No |
Answer / ananth
nodupkey option in proc sort statement.
Or use first.byvaribale or last.byvariable in data step.
| Is This Answer Correct ? | 12 Yes | 3 No |
Answer / prr
In Proc sort:
NoDupkey: TO delete duplicate observations based on By variable.
NoDuprecs: It looksup complete observation and delete
duplicate observations.
Nodup: it is a sas key word tells to sas, to delete
duplicate observations and keep only first one.
in Data step: First. and Last.
In Proc sql: Distinct Clause.
Process of SQL: 1.Select
2.group by
3.having
4.distinct
5.order by
| Is This Answer Correct ? | 6 Yes | 0 No |
Answer / ganesh
When you want elemenate duplicate values from dataset using
nodup option in the procedure sort.
When you want elemenate duplicate keys from specified
variables then use nodupkey option in the procedure sort.
| Is This Answer Correct ? | 5 Yes | 1 No |
Answer / reddy
nodup will eliminate the successive duplicate value only.
nodupkey eliminates all the duplicate values in a mentioned
variable.
| Is This Answer Correct ? | 3 Yes | 3 No |
Answer / thirumalesh.e.
We can delete using Proc NoDupkey NoDuprecs and
NoDuplicates, then by Dupsort system option, then
if.first . last, Proc sql, create by select * unique ...
OK.
| Is This Answer Correct ? | 0 Yes | 2 No |
if a variable contain dates like "2015/01"---"2015/12" (yymm) ,How to add day to those dates,if them month is jan then 31 if the month is feb then 28 so on ...
how can u import .csv file in to SAS?tell Syntax?
7 Answers CitiGroup, Franklin Templeton,
How to convert .xls file into CSV format?
wt is a-z and a--z?
how do we get duplicate observations in a separate dataset?
What is the difference between Regression and Logistic Regression? Can u explain the Assumptions/Conditions?
Please, anyone, let me know the style or an example of using 'by='/'by' variable of a PDV(Program Data Vector)
3 Answers Verinon Technology Solutions,
What is the function of Stop statement in a SAS Program?
What is the role of sas grid administrator? : sas-grid-administration
explain the key concept of sas? : Sas-administrator
What are the functions used for character handling?
What is the difference between nodupkey and nodup options?