Purrr map with rename_with

Question

I'm trying to clean a dataset's names. I've used janitor::clean_names() to start. However, I still have abbreviations that I would like to separate out with an underscore _. I have code that works using rename_with(~str_replace(.x, "gh", "gh_"), .cols = starts_with("gh")), however there are many abbreviations and it would be good to find a way to map or otherwise functionalize this process.

dat <- tibble(ghrisk_value = c(1,2), 
              ghrisk_corrected = c(2,3), 
              devpolicy_value = c(4,5),
              devpolicy_corrected = c(5,6))

# code works but not functionalized
dat %>%
   rename_with(~str_replace(.x, "gh", "gh_"), .cols = starts_with("gh")) %>%
   rename_with(~str_replace(.x, "dev", "dev_"), .cols = starts_with("dev")) %>%
   names()

# attempt at map...
abbr_words <- c("gh", "dev")
map(dat, ~rename_with(str_replace(.x, abbr_words, str_c(abbr_words, "_")))

Darren Tsai Darren Tsai · Accepted Answer · 2020-09-03T18:49:13

You don't need map(). Just use the regular expression syntax "(?<=a|b|c)", which matches the position behind a or b or c and insert an underscore. In addition, starts_with() can take a character vector as input to match the union of all elements.

abbr_words <- c("gh", "dev")

pattern <- sprintf("(?<=%s)", str_c(abbr_words, collapse = "|"))
# [1] "(?<=gh|dev)"

dat %>%
  rename_with(~ str_replace(.x, pattern, "_"), starts_with(abbr_words))

# # A tibble: 2 x 4
#   gh_risk_value gh_risk_corrected dev_policy_value dev_policy_corrected
#           <dbl>             <dbl>            <dbl>                <dbl>
# 1             1                 2                4                    5
# 2             2                 3                5                    6

Purrr map with rename_with

3 Answers