SAS: Using the lag function without a set statement (to simulate time series data.)

Question

Could someone explain why the following two pieces of code give different results? I would like to simulate some simple time series processes in SAS, but I'm struggling with the lag function.

Specifically, in program 1, the variable b contains no data, which is unexpected. In program 2, the lag function works as expected.

/*Program 1*/
data lagtest;
a = 1;
b=lag(a);
output;

a = 2;
b= lag(a);
output;

a = 3;
b= lag(a);
output;
run;


/*Program 2*/
data lagtest2;
input a;
datalines;
1
2
3
;
run;

data lagtest2;
set lagtest2;
b= lag(a);
run;

I've been reading about the lag function, but can't find references to its use in a datastep that does not take an input dataset.

Thanks very much for any help.

Longfish Longfish · Accepted Answer · 2014-07-02T11:31:31

The LAG function works on input data, not output data. In your first example there is no input data, just output, therefore the lag value is always blank. In your second example you don't need the 2 sections of code, you could just put :

data lagtest2;
input a;
b= lag(a);
datalines;
1
2
3
;
run;

SAS: Using the lag function without a set statement (to simulate time series data.)

2 Answers