Concatenate strings by group with dplyr in R
Hello friends! today we’ll be learning how to Concatenate strings by group with dplyr in R. I’ll be using iris data available in R
Data
Sepal.Length | Sepal.Width | Petal.Length | Petal.Width | Species |
5.1 | 3.5 | 1.4 | 0.2 | setosa |
4.9 | 3 | 1.4 | 0.2 | setosa |
4.7 | 3.2 | 1.3 | 0.2 | setosa |
4.6 | 3.1 | 1.5 | 0.2 | setosa |
5 | 3.6 | 1.4 | 0.2 | setosa |
5.4 | 3.9 | 1.7 | 0.4 | setosa |
4.6 | 3.4 | 1.4 | 0.3 | setosa |
Code
So the key used is Species to group by and Sepal.Length is concatenated with “,”
library(dplyr) df <- iris %>% select(Species,Sepal.Length) %>% group_by(Species) %>% mutate(Grp = paste0(Sepal.Length, collapse = ",")) %>% distinct(Species, Grp, .keep_all = TRUE) df
Keep visiting Analytics Tuts for more tutorials.
Thanks for reading! Comment your suggestions and queries