Concatenate duplicate values

Question

I have a table with some variables, say var1 and var2 and an identifier, and for some reasons, some identifiers have 2 observations.

I would like to know if there is a simple way to put back the second observation of the same identifier into the first one, that is

instead of having two observations, each with var1 var2 variables for the same identifier value

ID    var1    var2
------------------
A1    12      13
A1    43      53

having just one, but with something like var1 var2 var1_2 var2_2.

ID    var1    var2    var1_2    var2_2
--------------------------------------
A1    12      13      43        53

I can probably do that with renaming all my variables, then merging the table with the renamed one and dropping duplicates, but I assume there must be a simpler version.

I would suggest that this isn't a great idea, unless there's a very good reason to; it is almost always easier to work with data with fewer variables and more observations. — Joe

DomPazz DomPazz · Accepted Answer · 2014-11-28T15:59:58

Actually, your suggestion of merging the values back is probably the best.

This works if you have, at most, 1 duplicate for any given ID.

data first dups;
set have;
by id;
if first.id then output first;
else output dups;
run;

proc sql noprint;
create table want as
select a.id,
       a.var1,
       a.var2,
       b.var1 as var1_2,
       b.var2 as var2_2
from first as a
  left join
     dups as b
  on a.id=b.id;
quit;

Concatenate duplicate values

3 Answers