Group by to find max per group and then count the groups that have their max values in each year

Question

I'm trying to group the variables by their max values and then count the number of groups that have their max values in a certain period.

The data set looks like this:

Year	Car	Value
1991	A	21
1992	A	19
1993	A	20
1992	B	42
1993	B	17
1991	C	31
1992	C	50
1993	C	23

What I want to do is to find the max values per car and then count how many cars reached their maximum values per year.

So essentially, a table like this

Year	Count
1991	1
1992	2
1993	0

I was able to identify the max values per group using dplyr but cant figure out how to implement the count. Can someone please help? I've also tried top_n but that just gives me the max value per month which isnt what I want!

In your sample data set all cars have their max values in 1992 but the expected result does not reflect this. Please, can you clarify - thank you. — Uwe
Another question: What is the expected answer in case a car hits the maximum value in in more than one year, e.g., Car A has a Value of 21 in 1991 and 1992? — Uwe
@Uwe : Great question! In that case I would want both the years to be counted. — Shruti Mishra

TarJae TarJae · Accepted Answer · 2021-02-21T01:25:01

Very close to Ben's code! Without n_distinct

df1 <- df %>% 
  group_by(Car) %>%  
  mutate(mx = max(Value)) %>% 
  ungroup() %>% 
  group_by(Year) %>% 
  summarise(count=sum(Value >= mx))

Group by to find max per group and then count the groups that have their max values in each year

3 Answers