creating two data sets in one data step in sas

Question

This should be very simple, but somehow I confuse myself.

data in_both 
   missing_name (drop = name);

   merge employee (in=in_employee)
         hours (in = in_hours);

         by ID;

   if in_employee and in_hours then output in_both;
   else if in_employee and not in_hours then output missing_name;

run;

I have two questions: (1): For the first statement "missing_name(drop = name)", I understand that, it means keep all the data except the column whose head is name. But keep which data here? What is the input? (2): I know we can create two datasets within one data step, but that means we should use "data in_both missing_name", instead of "data in_both", right?

Many thanks for your time and attention. I appreciate your help.

Quentin Quentin · Accepted Answer · 2015-11-12T15:59:31

(1) The DROP= option refers to dropping variables from the dataset MISSING_NAME. With no drop= or keep= option, all variables that exist in EMPLOYEE or HOURS would be written to MISSING_NAME. You can run PROC CONTENTS on the four datasets to see which variables are included in each.

(2) As written, your code will output two datasets IN_BOTH and MISSING_NAME. As @Tom just commented, your current DATA statement already lists both datasets, because the semicolon ends the statement, not the white space/carriage return.

creating two data sets in one data step in sas

3 Answers