What is the Program Data Vector (PDV)? What are its functions?
Answers were Sorted based on User's Feedback
Answer / nishant
When SAS processes a data step it has two phases.
Compilation phase and execution phase.
During the compilation phase the input buffer is created to
hold a record from external file. After input buffer is
created the PDV is created. The PDV is the area of memory
where sas builds dataset, one observation at a time. The PDV
contains two automatic variables _N_ and _ERROR_.
| Is This Answer Correct ? | 32 Yes | 1 No |
Answer / poornima
PDV (Program Data Vector) is a logical area in memory where
SAS creates a dataset one observation at a time.
| Is This Answer Correct ? | 15 Yes | 0 No |
Answer / cutepa1
To the above answers I like to add a bit of information that
PDV has even more automated variables such as
FIRST.<by-group-variable>, LAST.<by-group-variable>, _END_,
...etc, along with _ERROR_ and _N_ where each has got its
own significance to have a control over the execution of a
SAS Program.
| Is This Answer Correct ? | 6 Yes | 0 No |
Answer / raj
It is a logical area in memory where SAS builds a data set,
one observation at a time. When a program executes, SAS
reads data values from the input buffer or creates them by
executing SAS language statements. The data values are
assigned to the appropriate variables in the program data
vector. From here, SAS writes the values to a SAS data set
as a single observation.
Along with data set variables and computed variables, the
PDV contains two automatic variables, _N_ and _ERROR_. The
_N_ variable counts the number of times the DATA step
begins to iterate. The _ERROR_ variable signals the
occurrence of an error caused by the data during execution.
The value of _ERROR_ is either 0 (indicating no errors
exist), or 1 (indicating that one or more errors have
occurred). SAS does not write these variables to the output
data set.
| Is This Answer Correct ? | 5 Yes | 0 No |
Answer / pambrose
Source: SAS book
The PDV contains all the variables in the input data set, the variables created in DATA step statements, and the two variables, _N_ and _ERROR_, that are automatically generated for every DATA step. The _N_ variable represents the number of times the DATA step has iterated. The _ERROR_ variable acts like a binary switch whose value is 0 if no errors exist in the DATA step, or 1 if one or more errors exist
Variables that are created by the INPUT and the Sum statements are set to missing initially. Note that in this representation, numeric variables are initialized with a period and character variables are initialized with blanks. The automatic variable _N_ is set to 1; the automatic variable _ERROR_ is set to 0.
The variable TeamName is marked Drop in the PDV because of the DROP= data set option in the DATA statement. Dropped variables are not written to the SAS data set. The _N_ and _ERROR_ variables are dropped because automatic variables created by the DATA step are not written to a SAS data set
| Is This Answer Correct ? | 3 Yes | 0 No |
How would you code a merge that will keep only the observations that have matches from both sets?
Are you involved in writing the inferential analysis plan? Tables specifications?
how do i get last 10obs from a dataset when we don't know about the number of obsevations in that dataset?
Explain the use of proc print and proc contents?
Does SAS ?Translate? (compile) or does it ?Interpret?? Explain.
how to debug and test the sas program? : Sas-administrator
what is option year cuttoff in sas
What is the significance of the ‘OF’ in X=SUM (OF a1-a4, a6, a9);?
What is difference between sas rename and lable?
What is the difference between input and infile statement?
How to convert a given date value into SAS date
9 Answers CitiGroup, Quintiles,
What report output formats can you generate using SAS?