Create SAS-like Library in R

Question

In SAS there's an method of creating Library (using LIBNAME). This is helpful as when we have to do long data processing, we don't change always the dataset name. So, if we want to use a dataset again, without changing the name, we can put in a library. So, even if the dataset name are same, but since they are in different libraries, we can work on them together.

My question is there any such option in R that can create Library (or separate folder within R) so that we can save our data there?

Here's the example:

Suppose I've a dataset "dat1". I summarize variables in dat1 var1 & var2 for var3.

proc summary data=dat1 nway missing;
  var var1 var2;
  class var3;
  output out=tmp.dat1 (drop = _freq_ _type_) sum = ;
  run;

Then I merged dat1 with dat2, which is another dataset.Both dat1 & dat2 has common variable var3, with which I merged. I created new dataset dat1 again.

proc sql;
   create table dat1 as
   select a.*,b.*
   from dat1 a left join tmp.dat2 b
   on a.var3=b.var3;
  quit;

Now, I'm again summarizing dataset dat1 after merging to check if the values of var1 & var 2 remain the same before & after merging.

proc summary data=dat1 nway missing;
  var var1 var2;
  class var3;
  output out=tmp1.dat1 (drop = _freq_ _type_) sum = ;
  run;

The equivalent code in R will be

dat3<-ddply(dat1,.(var3),summarise,var1=sum(var1,na.rm=TRUE),var2=sum(var2,na.rm=TRUE))

dat1<-sqldf("select a.*,b.* from dat1 a left join dat2 b on a.var3=b.var3")

dat4<-ddply(dat1,.(var3),summarise,var1=sum(var1,na.rm=TRUE),var2=sum(var2,na.rm=TRUE))

In case of SAS I used just 2 dataset name. But in case of R, I'm using 4 dataset name. So, if I'm writing 4000 line code for data processing, having too many dataset name sometimes become overwhelming. In sas it became easy to have same dataset name as I'm using 2 libraries tmp, tmp1 other than the default work library.

In SAS, library is defined as:

LIBNAME tmp "directory_path\folder_name";

In this folder, dat1 will be stored.

This question may make sense to a SAS user, but it makes no sense to the rest of us. Why don't you explain what you want out of R and how the current way you do things is lacking? Perhaps with a reproducible example? — Ari B. Friedman
Your problem is you are writing 4000 line scripts. This may not be a problem in SAS where anything over five lines is confusing already, but in R you should never write anything more than about ten lines without thinking "hey, this should be wrapped up in a function." — Spacedman
Thanks Spacedman for your comment. My problem is not with writing 4K line script. If you see the example, in SAS I used around 20 lines. But in R I did the same thing in just 3 lines. So R is more efficient when writing code. But in case of R I've to define more data names, which don't have to do in SAS just because it has library option. I just want something which is equivalent of library. — Beta
it sounds like you might want to work with different, named environments — Glen_b

Paul Hiemstra Paul Hiemstra · Accepted Answer · 2012-10-14T11:17:08

From what I can gather from the SAS onlinehelp, a SAS library is a set of datasets that is stored in a folder, and can be referenced as a unit. The equivalent in R would be to store the R objects you want to save using save:

save(obj1, obj2, etc, file = "stored_objects.rda")

Loading the objects can be done using load.

edit: I dont really see why having an additional object or two is so much of a problem. However, if you want to reduce tge amount of object just put your results in a list.

Create SAS-like Library in R

4 Answers