AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |
Back to Blog
Dplyr summarize multiple columns9/27/2023 They are less likely to purchase online ( store_trans = 6 while online_trans = 3). Price: They are older people with average age 60. 1/3 of them are female, and 2/3 are male. It is a group of middle-age wealthy people. There is a lot of information you can extract from those simple averages.Ĭonspicuous: average age is about 40. online_trans: average times of online transactions.store_trans: average times of transactions in the store.HouseYes: percentage of people who own a house. In the end, we calculate the following for each segment: The rest of the command above is similar. Store the result in a new variable named Age.Round the result to the specified number of decimal places.Calculate the mean of column age ignoring missing value for each customer segment.For example, Age = round(mean(na.omit(age)),0) tell R the following things: Then list the exact actions inside summarise(). The third argument summarise tells R the manipulation(s) to do. Here we only summarize data by one categorical variable, but you can group by multiple variables, such as group_by(segment, house). The second line group_by(segment) tells R that in the following steps you want to summarise by variable segment. Now, let’s look at the code in more details. 14.1 Customer Data for Clothing Companyĭat_summary % dplyr :: group_by(segment) %>% dplyr :: summarise( Age = round( mean( na.omit(age)), 0), FemalePct = round( mean(gender = "Female"), 2), HouseYes = round( mean(house = "Yes"), 2), store_exp = round( mean( na.omit(store_exp), trim = 0.1), 0), online_exp = round( mean(online_exp), 0), store_trans = round( mean(store_trans), 1), online_trans = round( mean(online_trans), 1)) # transpose the data frame for showing purpose # due to the limit of output width cnames % ame() names(tdat_summary) 12.1.1 Logistic Regression as Neural Network.11.4 Regression and Decision Tree Basic.10.4 Penalized Generalized Linear Model.9.2 Principal Component Regression and Partial Least Square.9.1.2 Diagnostics for Linear Regression.
0 Comments
Read More
Leave a Reply. |